200 Commits (80581e9887ce9968516ec4d7a6a450eeec6e4ade)

Author SHA1 Message Date
Loren Merritt d2bb7db135 sort H.264 mmx dsp functions into their own file 20 years ago
Michael Niedermayer c26ae41db2 adding a few const 20 years ago
Michael Niedermayer 435b0720a8 100l for myself (breaking amd64) 20 years ago
Michael Niedermayer 6510f43cf3 merge a few asm blocks so gcc cant unoptimize it (658->631 dezicycles on duron) 20 years ago
Michael Niedermayer 987ae784e6 get rid of 2 movq (680 -> 658 dezicycles on duron) 20 years ago
Michael Niedermayer e4b36d4434 avoid one transpose (730->680 dezicycles on duron) 20 years ago
Loren Merritt 85bbfcd4ee 10l (symbol mangling) 20 years ago
Michael Niedermayer 1f3dbc09b1 add rounding bias before the horizontal idct (765->730 dezicyles on duron) 20 years ago
Loren Merritt 1d62fc8560 MMX for H.264 iDCT (adapted from x264) 20 years ago
Zoltán Hidvégi 3072f0cb2e MMX code for (put|avg)_h264_chroma_mc8 20 years ago
Loren Merritt 5cf08f2393 H.264 deblocking optimizations (mmx for chroma_bS4 case, convert existing cases to 8-bit math) 20 years ago
Michael Niedermayer 5773a74669 porting the mmx&sse2 (sse2 untested) vp3 idcts to the lavc idct API 20 years ago
Michael Niedermayer b178f758fa disabling vp3 mmx&mmx2 idcts, they must be ported over to the lavc idct API, ill port the vp3 c idct 20 years ago
Michael Niedermayer c998bdd9a0 fix PIC 20 years ago
Loren Merritt 42251a2a4f MMX for H.264 deblocking filter 20 years ago
Michael Niedermayer 4e492bf107 read 32bit instead of 64bit to avoid overreading and missalignments 20 years ago
Martin Drab 4d9ae03b09 optimization and gcc 4.0 bug workaround patch by (Martin Drab >drab kepler.fjfi.cvut cz<) 20 years ago
Michael Niedermayer e42a152bbc x86 cpu capabilities detection rewrite / cleanup 20 years ago
Matthieu Castet e7ddb0cf4c detect sse on athlon-xp patch by (matthieu castet <castet >.< matthieu >at< free >.< fr>) 20 years ago
Michael Niedermayer 1c3990dbba H.261 encoder by (Maarten Daniels <maarten dot daniels at luc dot ac dot be>) 20 years ago
Aurelien Jacobs 8b4d077fc9 gcc 3.4.3 preversions do not appreciate invalid instruction and operand combinations anymore patch by (Aurelien Jacobs <aurel at gnuage dot org>) 20 years ago
Aurelien Jacobs 053dea12f2 adapting existing mmx/mmx2/sse/3dnow optimizations so they work on x86_64 patch by (Aurelien Jacobs <aurel at gnuage dot org>) 20 years ago
Michael Niedermayer 178fcca848 1/2 resolution decoding 20 years ago
Michael Niedermayer e69538fa60 h264_qpel8_hv_lowpass_mmx2/3dnow 20 years ago
Michael Niedermayer e772bb8a82 h264_qpel4_hv_lowpass_mmx2/3dnow 20 years ago
Michael Niedermayer 56d8bd5659 optimization 20 years ago
Michael Niedermayer a6e39f45a2 optimization 20 years ago
Michael Niedermayer ed8ffdf46c optimization 20 years ago
Michael Niedermayer 437525c473 h264 luma motion compensation in mmx2/3dnow 20 years ago
Michael Niedermayer d6af6b0350 10000l fix and use more mmx2/3dnow code for mpeg4 qpel which has been written and commited long time ago but appearently never used, qpel motion compensation is 5% faster 20 years ago
Michael Niedermayer 6206f8c920 10l and better MMX/SSE detection for VIA1000 21 years ago
Michael Niedermayer 61b3b639e7 dc_precission=11 fix 21 years ago
Michael Niedermayer 4c99f2cdbe transmeta crusoe mmx detection by "Nobody/Anonymous - nobody" / sf feature request tracker 21 years ago
Michael Niedermayer 1ec4df0fa8 sse8 and nsse in mmx 21 years ago
Michael Niedermayer e96682e6f4 some of the warning fixes by (Michael Roitzsch <mroi at users dot sourceforge dot net>) 21 years ago
Michael Niedermayer 77d0e165b1 mmx dct optimization 21 years ago
Mike Melanson 7daabccb5d move the 0x80 vector outside of the function, thus saving the compiler 21 years ago
Mike Melanson f9ed9d8584 separate out put_signed_pixels_clamped() into its own function and 21 years ago
Mike Melanson 75b4f3c6d6 yes, that's nice... 21 years ago
Mike Melanson 116824d0aa reorganize and simplify the VP3 IDCT stuff 21 years ago
Mike Melanson 6186b94c13 hush... 21 years ago
Mike Melanson 38acbc3cb9 hook up support for SSE2-optimized VP3 IDCT 21 years ago
Mike Melanson 78a16bfec6 add selected SSE2 ASM macros 21 years ago
Mike Melanson 5f9379007c SSE2-optimized variant of VP3 IDCT 21 years ago
Mike Melanson daae869903 correct MMX-optimized variant of VP3 IDCT, with comments (thank you 21 years ago
Michael Niedermayer 19f82bc824 fixing alignment problems -> SSE2 support enabled again in libavcodec (from ffdshow / milan_cutka) 21 years ago
Gildas Bazin 68951ecf0c fft_*() renamed into ff_fft_*() patch by (Gildas Bazin <gbazin at altern dot org>) 21 years ago
Mike Melanson 01456e8e86 use optimized VP3 functions where appropriate 21 years ago
Mike Melanson 44cb64ee89 seperated out the C-based VP3 DSP functions into a different file; also 21 years ago
Dmitry Baryshkov 5c0513bda7 attribute used patch by (mitya at school dot ioffe dot ru (Dmitry Baryshkov)) 21 years ago