Michael Niedermayer
be3b22f9ca
remove idiotc double subtraction from the sse2 code (untested, no sse2 here)
...
Originally committed as revision 10158 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Michael Niedermayer
8510c519e7
fixme note
...
Originally committed as revision 10157 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Michael Niedermayer
3a9f44d5d5
and of course the unneeded double subtractions were blindly put in the
...
mmx code
this also makes the affected code 4% faster
Originally committed as revision 10156 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Aurelien Jacobs
73f51a4d3e
help some gcc version to optimize out those functions
...
Originally committed as revision 9785 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Aurelien Jacobs
674eeb5f57
cosmetics: indentation
...
Originally committed as revision 9582 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Aurelien Jacobs
eb75a69818
Avoid linking with h263.c functions when the relevant codecs
...
are not compiled in.
Originally committed as revision 9581 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Måns Rullgård
a00177a952
make arguments to ssd_int8_vs_int16() const
...
Originally committed as revision 9548 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Diego Biurrun
09a6682648
typos
...
Originally committed as revision 9486 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Diego Biurrun
e5a389a1b7
license header consistency cosmetics
...
Originally committed as revision 9484 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Diego Biurrun
7b94177e37
Group all copyright and author notices together.
...
Originally committed as revision 9483 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Guillaume Poirier
efb775777f
add a comment to indicate which #endif belong to which #define
...
Originally committed as revision 9356 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Måns Rullgård
699b3f99d0
add multiple inclusion guards to headers
...
Originally committed as revision 9345 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Måns Rullgård
8657d5e549
correct type for {put,avg}_h264_chroma_mc2_mmx2
...
Originally committed as revision 9333 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Diego Biurrun
90b5b51eab
misc typo fixes
...
Originally committed as revision 9291 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Diego Biurrun
755bfeabcc
misc spelling fixes
...
Originally committed as revision 9289 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Graham Booker
ee387b57fd
work around issues with the old version of Gnu Assembler shipped on
...
Intel / OSX 10.4.9
patch by Graham Booker % gbooker A cod3r P com %
Original thread:
Date: May 25, 2007 11:51 PM
Subject: [FFmpeg-devel] r9017 breaks WMA decoding on Intel Macs
The patch:
Date: Jun 3, 2007 4:37 PM
Subject: Re: [FFmpeg-devel] r9017 breaks WMA decoding on Intel Macs
Originally committed as revision 9260 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Diego Biurrun
34933442ac
Preprocessor #defines starting with __ are reserved for the system.
...
Originally committed as revision 9251 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Ronald S. Bultje
3d4e3e560f
Use FFmpeg-specific CPU feature definitions.
...
taken from a patch by Ronald S. Bultje, rbultje ronald.bitfreak net
Date: Thu, 07 Jun 2007 08:57:46 -0400
Subject: Re: [FFmpeg-devel] [PATCH] Coremake support - ffmpeg_nommx.patch (1/1)
- ffmpeg-nommx.patch (1/1) - ff-cputest-noavconfig.patch (1/1)
Originally committed as revision 9250 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Carl Eugen Hoyos
154e30f6c2
rename attribute_unused to av_unused and moves its declaration to common.h
...
patch by Carl Eugen Hoyos cehoyos chez ag or at
original thread: [FFmpeg-devel] [PATCH] attribute_unused -> av_unused
date: 05/29/2007 01:23 PM
Originally committed as revision 9155 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Zuxy Meng
663deb54af
Remove incorrect comment; MMX2 is preferred over 3DNow! on Athlon
...
Originally committed as revision 9079 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Zuxy Meng
038bfcf9d6
3DNow! and SSSE3 optimization to QNS DSP functions; use pmulhrw/pmulhrsw instead of pmulhw
...
Originally committed as revision 9053 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Aurelien Jacobs
5b0b7054b4
better separation of vp3dsp functions from dsputil_mmx.c
...
Originally committed as revision 9039 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Ronald S. Bultje
b550bfaa61
Add libavcodec to compiler include flags in order to simplify header
...
include paths in the source files.
mostly from a patch by Ronald S. Bultje, rbultje ronald.bitfreak net
Originally committed as revision 9034 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Panagiotis Issaris
9b5dc86746
Make vp3dsp*.c compilation optional.
...
Originally committed as revision 9025 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Reimar Döffinger
e36d79c837
Change some leftover __attribute__((unused)) and __attribute__((used)) to
...
attribute_unused and attribute_used respectively to ease compiling on non-gcc.
Originally committed as revision 9024 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Zuxy Meng
25e4f8aaee
Faster SSE FFT/MDCT, patch by Zuxy Meng %zuxy P meng A gmail P com%
...
unrolls some loops, utilizing all 8 xmm registers. fft-test
shows ~10% speed up in (I)FFT and ~8% speed up in (I)MDCT on Dothan
Originally committed as revision 9017 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
ff506a906e
sse2 & ssse3 versions of dct_quantize.
...
core2: mmx2=154 sse2=73 ssse3=66 (cycles)
k8: mmx2=179 sse2=149
p4: mmx2=284 sse2=194
Originally committed as revision 9003 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
1edbfe1994
factor sum_abs_dctelem out of dct_sad, and simd it.
...
sum_abs_dctelem_* alone:
core2: c=186 mmx2=39 sse2=21 ssse3=13 (cycles)
k8: c=163 mmx2=33 sse2=31
p4: c=370 mmx2=60 sse2=60
dct_sad including sum_abs_dctelem_*:
core2: c=405 mmx2=258 sse2=240 ssse3=232
k8: c=624 mmx2=394 sse2=392
p4: c=849 mmx2=556 sse2=556
Originally committed as revision 9001 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
561f940c03
sse2 & ssse3 versions of hadamard. unroll and inline diff_pixels.
...
core2: before mmx2=193 cycles. after mmx2=174 sse2=122 ssse3=115 (cycles).
k8: before mmx2=205. after mmx2=184 sse2=180.
p4: before mmx2=342. after mmx2=314 sse2=309.
Originally committed as revision 9000 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
ba53071acb
10l, r8991 broke mmx1 sad
...
Originally committed as revision 8993 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
72946825fa
sse2 version of fullpel sad.
...
16% faster on core2, 5% faster on p4. 10% slower (and thus disabled) on k8.
Originally committed as revision 8992 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
164d75ebf3
tweak mmx2 sad.
...
40% faster on core2, 18% faster on k8, 5% faster on p4.
Originally committed as revision 8991 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
eca3810e31
tweak mmx2 sad.
...
6% faster on core2 and k8, no change on p4.
Originally committed as revision 8984 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
7c3a9fe2a3
sse2 version of fdct_col.
...
k8: 72->61 cycles, core2: 51->26 cycles.
Originally committed as revision 8966 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
5adf43e47e
cosmetics: remove code duplication in hadamard8_diff_mmx
...
Originally committed as revision 8946 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
bba5293bb7
cosmetics: remove duplicate transpose macro
...
Originally committed as revision 8939 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Reimar Döffinger
a1ce61108b
Fix parts missed in clip -> av_clip rename
...
Originally committed as revision 8760 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Diego Biurrun
fe0372296a
typos
...
Originally committed as revision 8642 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Loren Merritt
5900637219
mmx 16-bit ssd. 2.3x faster svq1 encoding.
...
Originally committed as revision 8559 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Diego Biurrun
d42f88025a
Fix wrong conditional, Snow decoding, not encoding, was SIMD-accelerated.
...
Originally committed as revision 8116 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
58e31fb1d5
reorder a few more paddws to reduce dependancy chains
...
chroma mc4 put 2480 -> 2460 dezicyles on duron
Originally committed as revision 8098 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
b4fe97696c
reorder paddws to reduce dependancy chain
...
put_h264_chroma_mc2_mmx2() 927 -> 902 dezicyles on duron
Originally committed as revision 8097 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
0c67082e02
shortening dependancy chain in chroma mc2
...
Originally committed as revision 8095 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
af26516261
remove now wrong comment
...
Originally committed as revision 8094 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
61240ae556
fix chroma mc2 bug, this is based on a patch by (Oleg Metelitsa oleg hitron co kr)
...
and does slow the mc2 chroma put down, avg interrestingly seems unaffected speedwise on duron
this of course should be rather done in a way which doesnt slow it down but its better a few %
slower but correct then incorrect
Originally committed as revision 8093 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
470d2d03cc
gcc 2.95 fix
...
Originally committed as revision 8059 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Måns Rullgård
459022f504
fix for x86-64
...
Originally committed as revision 8022 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
b21e0b6dfc
rewrite H264_CHROMA_MC4_TMPL (20% faster)
...
Originally committed as revision 8012 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
2a115873af
add a few asserts to ensure alignment
...
Originally committed as revision 7994 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago
Michael Niedermayer
00e210ddbb
prevent h.264 MC related functions from being inlined (yes this is much faster the code just doesnt fit in the code cache otherwise)
...
Originally committed as revision 7993 to svn://svn.ffmpeg.org/ffmpeg/trunk
18 years ago