202 Commits (e9f5f020c6f00e0833edc84399d4eabf4a028780)

Author SHA1 Message Date
Ronald S. Bultje e9f5f020c6 Move VP3 IDCT functions from inline ASM to YASM. This fixes part of the VP3/5/6 15 years ago
Ronald S. Bultje 7e7c4b6008 Put ff_ prefix on non-static {put_signed,put,add}_pixels_clamped_mmx() 15 years ago
Loren Merritt 19d929f9a3 cosmetics in imdct_sse 15 years ago
Ronald S. Bultje 4eca52ed19 Fix typos when converting inline asm to yasm, fixes MMX-only fate-ea-vp61. 15 years ago
Ronald S. Bultje 6697bc33e2 Revert r24931, it broke Win32 and some BSD compiles (yay fate). 15 years ago
Ronald S. Bultje 72f642400b Mark xmm6 and xmm7 as clobbered in ff_vp3_idct_sse2(), which is contributing 15 years ago
Måns Rullgård 69dad87c48 VP6: fix vp6_filter_diag4_mmx/sse on 64-bit 15 years ago
Ronald S. Bultje 89fa3504ed Move vp6_filter_diag4() x86 SIMD code from inline ASM to YASM. This should 15 years ago
Ronald S. Bultje 3a0885146c Move vp6_filter_diag4() from DSPContext to VP56DSPContext. 15 years ago
Måns Rullgård c0ec9918b0 Remove global mm_flags variable 15 years ago
Ronald S. Bultje 3611c45ab7 Mark xmm registers as clobbered in simple loopfilter. Should fix the last 15 years ago
Alex Converse cb4f12466b imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits". 15 years ago
Ronald S. Bultje 684d608bde Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures). 15 years ago
Alex Converse 78b5c97d3e Convert ff_imdct_half_sse() to yasm. 15 years ago
Jason Garrett-Glaser 05c04cdf54 VP5/6/8: ~7% faster arithmetic decoding 15 years ago
Jason Garrett-Glaser 4a384de5b8 Split h264dsp and h264pred in configure. 15 years ago
Jason Garrett-Glaser 98fe09df7b Add file missing in r24702 15 years ago
Eli Friedman c12d6955e2 H.264: SSE2/SSSE3 weighted prediction asm 15 years ago
Måns Rullgård f079a64aea Move cavs dsp functions to their own struct 15 years ago
Jason Garrett-Glaser 8b9b5e085f VP5/6/8: add one inline missed in r24677 15 years ago
Jason Garrett-Glaser 827d43bb9d VP8: move zeroing of luma DC block into the WHT 15 years ago
Ronald S. Bultje 6341838f3c Use word-writing instead of dword-writing (with two cached but otherwise 15 years ago
Vitor Sessak fa738b3ad1 Remove x86/mmx.h. It is not used anymore and has been deprecated for years. 15 years ago
Vitor Sessak de4bc44abb Convert deinterlacing MMX code to YASM 15 years ago
Vitor Sessak 740dfe7012 Fix compilation in x86_64. I broke it with r24580. 15 years ago
Vitor Sessak 2c3dda6838 Translate libmpeg2 MMX IDCT to plain asm 15 years ago
Ronald S. Bultje ab4d031889 Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster. 15 years ago
Jason Garrett-Glaser e25dee602f VP8: Much faster SSE2 MC 15 years ago
Ronald S. Bultje 48adb7e7a4 Enable no-loop memory/register saving for ssse3/sse4 also. 15 years ago
Ronald S. Bultje 2a180c69ea Save a register (or regsize of stackspace for x86-32) for the no-loop 15 years ago
Ronald S. Bultje bcd4aa6498 Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. this 15 years ago
Ronald S. Bultje 2208053bd3 Split pextrw macro-spaghetti into several opt-specific macros, this will make 15 years ago
Ronald S. Bultje 6de5b7c6b8 Fix obvious bug in assignment. Somehow, the test vectors don't test this... 15 years ago
Ronald S. Bultje e3f7bf774c Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so this 15 years ago
Eli Friedman 3611e7a309 Inline asm for VP56 arith coder 15 years ago
Jason Garrett-Glaser 3ae079a3c8 VP8: optimize DC-only chroma case in the same way as luma. 15 years ago
Jason Garrett-Glaser 51c9156438 VP8 asm: cosmetics (spacing) 15 years ago
Jason Garrett-Glaser 8a467b2d44 VP8: 30% faster idct_mb 15 years ago
Jason Garrett-Glaser c25c776708 VP8: clear DCT blocks in iDCT instead of using clear_blocks. 15 years ago
Ronald S. Bultje dc5eec8085 Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles on 15 years ago
Ronald S. Bultje 003243c3c2 Fix and enable horizontal >=SSE2 mbedge loopfilter. 15 years ago
Loren Merritt c7b1d9768c relicense h264 deblock sse2 to lgpl 15 years ago
Loren Merritt 532e769701 sync yasm macros from x264 15 years ago
Jason Garrett-Glaser 8731dbd890 Eliminate one instruction in VP8 dc_add_sse4 15 years ago
Jason Garrett-Glaser 7dd224a42d Various VP8 x86 deblocking speedups 15 years ago
Jason Garrett-Glaser b8b231b5dc Make mmx VP8 WHT faster 15 years ago
David Conrad af521abc28 Add header declarations for mmx/sse constants missing them 15 years ago
David Conrad c7eec58170 Move ff_pw_* from vc1dsp_mmx.c to dsputil_mmx.c 15 years ago
Ronald S. Bultje e9e456d850 VP8 MBedge loopfilter MMX/MMX2/SSE2 functions for both luma (width=16) 15 years ago
Ronald S. Bultje 268821e76e Chroma (width=8) inner loopfilter MMX/MMX2/SSE2 for VP8 decoder. 15 years ago