408 Commits (d2f119a1f2d2d72b0001fcdf2cc051b022bf6528)

Author SHA1 Message Date
Ronald S. Bultje a33a2562c1 Rename h264_weight_sse2.asm to h264_weight.asm; add 16x8/8x16/8x4 non-square 14 years ago
Ronald S. Bultje 14bc1f2485 Split h264dsp_mmx.c (which was #included in dsputil_mmx.c) in h264_qpel_mmx.c, 14 years ago
Ronald S. Bultje 5929b3a651 Fix vertical align. 14 years ago
Ronald S. Bultje 79ce0f002e Fix compilation failure if yasm is disabled (missing vp3 symbols). 14 years ago
Ronald S. Bultje de1c253bab Split intra prediction initialization (i.e. assigning of function pointers) 14 years ago
Ronald S. Bultje d0eb5a1174 Move H264 chroma MC from inline asm to yasm. This fixes VP3/5/6 and VC-1 14 years ago
Ronald S. Bultje e9f5f020c6 Move VP3 IDCT functions from inline ASM to YASM. This fixes part of the VP3/5/6 14 years ago
Ronald S. Bultje 7e7c4b6008 Put ff_ prefix on non-static {put_signed,put,add}_pixels_clamped_mmx() 14 years ago
Loren Merritt 19d929f9a3 cosmetics in imdct_sse 14 years ago
Ronald S. Bultje 4eca52ed19 Fix typos when converting inline asm to yasm, fixes MMX-only fate-ea-vp61. 14 years ago
Ronald S. Bultje 6697bc33e2 Revert r24931, it broke Win32 and some BSD compiles (yay fate). 14 years ago
Ronald S. Bultje 72f642400b Mark xmm6 and xmm7 as clobbered in ff_vp3_idct_sse2(), which is contributing 14 years ago
Måns Rullgård 69dad87c48 VP6: fix vp6_filter_diag4_mmx/sse on 64-bit 14 years ago
Ronald S. Bultje 89fa3504ed Move vp6_filter_diag4() x86 SIMD code from inline ASM to YASM. This should 14 years ago
Ronald S. Bultje 3a0885146c Move vp6_filter_diag4() from DSPContext to VP56DSPContext. 14 years ago
Måns Rullgård c0ec9918b0 Remove global mm_flags variable 14 years ago
Ronald S. Bultje 3611c45ab7 Mark xmm registers as clobbered in simple loopfilter. Should fix the last 14 years ago
Alex Converse cb4f12466b imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits". 14 years ago
Ronald S. Bultje 684d608bde Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures). 14 years ago
Alex Converse 78b5c97d3e Convert ff_imdct_half_sse() to yasm. 14 years ago
Jason Garrett-Glaser 05c04cdf54 VP5/6/8: ~7% faster arithmetic decoding 14 years ago
Jason Garrett-Glaser 4a384de5b8 Split h264dsp and h264pred in configure. 14 years ago
Jason Garrett-Glaser 98fe09df7b Add file missing in r24702 14 years ago
Eli Friedman c12d6955e2 H.264: SSE2/SSSE3 weighted prediction asm 14 years ago
Måns Rullgård f079a64aea Move cavs dsp functions to their own struct 14 years ago
Jason Garrett-Glaser 8b9b5e085f VP5/6/8: add one inline missed in r24677 14 years ago
Jason Garrett-Glaser 827d43bb9d VP8: move zeroing of luma DC block into the WHT 14 years ago
Ronald S. Bultje 6341838f3c Use word-writing instead of dword-writing (with two cached but otherwise 15 years ago
Vitor Sessak fa738b3ad1 Remove x86/mmx.h. It is not used anymore and has been deprecated for years. 15 years ago
Vitor Sessak de4bc44abb Convert deinterlacing MMX code to YASM 15 years ago
Vitor Sessak 740dfe7012 Fix compilation in x86_64. I broke it with r24580. 15 years ago
Vitor Sessak 2c3dda6838 Translate libmpeg2 MMX IDCT to plain asm 15 years ago
Ronald S. Bultje ab4d031889 Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster. 15 years ago
Jason Garrett-Glaser e25dee602f VP8: Much faster SSE2 MC 15 years ago
Ronald S. Bultje 48adb7e7a4 Enable no-loop memory/register saving for ssse3/sse4 also. 15 years ago
Ronald S. Bultje 2a180c69ea Save a register (or regsize of stackspace for x86-32) for the no-loop 15 years ago
Ronald S. Bultje bcd4aa6498 Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. this 15 years ago
Ronald S. Bultje 2208053bd3 Split pextrw macro-spaghetti into several opt-specific macros, this will make 15 years ago
Ronald S. Bultje 6de5b7c6b8 Fix obvious bug in assignment. Somehow, the test vectors don't test this... 15 years ago
Ronald S. Bultje e3f7bf774c Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so this 15 years ago
Eli Friedman 3611e7a309 Inline asm for VP56 arith coder 15 years ago
Jason Garrett-Glaser 3ae079a3c8 VP8: optimize DC-only chroma case in the same way as luma. 15 years ago
Jason Garrett-Glaser 51c9156438 VP8 asm: cosmetics (spacing) 15 years ago
Jason Garrett-Glaser 8a467b2d44 VP8: 30% faster idct_mb 15 years ago
Jason Garrett-Glaser c25c776708 VP8: clear DCT blocks in iDCT instead of using clear_blocks. 15 years ago
Ronald S. Bultje dc5eec8085 Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles on 15 years ago
Ronald S. Bultje 003243c3c2 Fix and enable horizontal >=SSE2 mbedge loopfilter. 15 years ago
Loren Merritt c7b1d9768c relicense h264 deblock sse2 to lgpl 15 years ago
Loren Merritt 532e769701 sync yasm macros from x264 15 years ago
Jason Garrett-Glaser 8731dbd890 Eliminate one instruction in VP8 dc_add_sse4 15 years ago