Zuxy Meng
ecb24904fe
add SSE2 version of vp6_filter_diag
...
original patch by Zuxy Meng zuxy.meng _at_ gmail _dot_ com
Originally committed as revision 17195 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Sebastien Lucas
6af3c226c3
add MMX version of vp6_filter_diag
...
original patch by Sebastien Lucas sebastien.lucas _at_ gmail _dot_ com
Originally committed as revision 17194 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Aurelien Jacobs
5110b25e1e
convert ff_pw_64 into an xmm_reg for future use in vp6 sse code
...
Originally committed as revision 17192 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
d3a4b4e09c
Add check whether the compiler/assembler supports 10 or more operands.
...
thanks to Loren for some help with the asm statements
Originally committed as revision 17151 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Loren Merritt
3daa434a40
ff_add_hfyu_median_prediction_mmx2
...
overall ffvhuff decoding speedup: 28% on core2, 25% on k8.
Originally committed as revision 17059 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
137ae32760
Workaround for gcc 3.4 to align sh properly
...
Originally committed as revision 16797 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
406792e7b0
cosmetics: Remove pointless period after copyright statement non-sentences.
...
Originally committed as revision 16684 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Aurelien Jacobs
49fb20cb8a
replace all occurrence of ENABLE_ by the corresponding CONFIG_, HAVE_ or ARCH_
...
and remove all ENABLE_ definitions.
Originally committed as revision 16600 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Aurelien Jacobs
b250f9c66d
Change semantic of CONFIG_*, HAVE_* and ARCH_*.
...
They are now always defined to either 0 or 1.
Originally committed as revision 16590 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
c47d146be8
Add missing 'void' keyword to parameterless function declarations.
...
Originally committed as revision 16436 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Mathieu Velten
21ff7689da
Use H264 MMX chroma functions to accelerate RV40 decoding.
...
Patch by Mathieu Velten (matmaul A gmail)
Originally committed as revision 16419 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Jason Garrett-Glaser
37fed10087
Add x264 SSE2 iDCT functions to H.264 decoder.
...
Originally committed as revision 16409 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Carl Eugen Hoyos
2c67c65963
Fix h264 decoding on SSE2 cores with icc compilation.
...
Originally committed as revision 16373 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Jason Garrett-Glaser
c1fc70362f
Fix compilation without optimization under 64-bit with x264 deblock asm enabled.
...
Originally committed as revision 16313 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
a6493a8fbd
Rename libavcodec/i386/ --> libavcodec/x86/.
...
It contains optimizations that are not specific to i386 and
libavutil uses this naming scheme already.
Originally committed as revision 16270 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
ebb160a0b0
Use ARCH_X86_32 instead of !ARCH_X86_64, it is more straightforward.
...
Originally committed as revision 16262 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Jason Garrett-Glaser
a22eff36f9
Port x264 deblocking code to libavcodec.
...
This includes SSE2 luma deblocking code and both MMXEXT and SSE2 luma
intra deblocking code for H.264 decoding. This assembly is available
under --enable-gpl and speeds decoding of Cathedral by 7%.
Originally committed as revision 16239 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
ac22385931
H.264 idct functions that include the chroma, inter luma and intra16 luma loops
...
thus avoiding the calling overhead.
New functions are not yet used.
Originally committed as revision 16206 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Aurelien Jacobs
5e6604490a
avoid POSIX reserved _t suffix
...
Originally committed as revision 16117 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Loren Merritt
5fecfb7d58
clear_block mmx
...
Originally committed as revision 16045 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
9686df2be5
Delete unnecessary 'extern' keywords.
...
Originally committed as revision 15990 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Dominik Mierzejewski
82d1605fe7
Remove duplicated MM_* macros for CPU capabilities from dsputil.h.
...
Add missing one for FF_MM_ALTIVEC to avcodec.h.
Rename all the occurences of MM_* to the corresponding FF_MM_*.
Originally committed as revision 15770 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Pettenò
782fc0c36f
Rename template included sources from .h to _template.c.
...
There are multiple source files that are #include'd rather than
compiled, as they are used as template for generation of similar code,
like asm-optimised code. Some of these files are right now named with
a .h extension, although they are not header in any reasonable sense.
Rename them so that instead of being named with .h extension they are
named with _template.c as final part.
Originally committed as revision 15730 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Reimar Döffinger
31c4f07017
Use x86_reg type instead of long in float_to_int16 MMX/SSE functions.
...
Fixes compilation on MinGW64.
Originally committed as revision 15655 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
0dba1995bc
Cosmetics: reindent
...
Originally committed as revision 15644 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
ca4a4ac1b3
Combine non-bitexact sections
...
Originally committed as revision 15643 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
daa1ea049a
VP3 loop filter is mmx2 not mmx
...
Originally committed as revision 15642 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
357f45d9bc
MMX VP3 Loop Filter
...
Originally committed as revision 15630 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Pettenò
be449fca79
Convert asm keyword into __asm__.
...
Neither the asm() nor the __asm__() keyword is part of the C99
standard, but while GCC accepts the former in C89 syntax, it is not
accepted in C99 unless GNU extensions are turned on (with -fasm). The
latter form is accepted in any syntax as an extension (without
requiring further command-line options).
Sun Studio C99 compiler also does not accept asm() while accepting
__asm__(), albeit reporting warnings that it's not valid C99 syntax.
Originally committed as revision 15627 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
8cfd78ce8f
Ensure MMX/SSE2 VP3 IDCT selection isn't disabled when only Theora is enabled
...
Originally committed as revision 15350 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
David Conrad
ccd3ec82b8
MMX/SSE2 VP3 IDCT are bitexact now that the dequantization matrices are permutated correctly
...
Originally committed as revision 15345 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
David Conrad
b4c3d83584
Use ff_vp3_idct_data in vp3dsp_mmx.c rather than duplicating it
...
Originally committed as revision 15118 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
David Conrad
21383da8c4
Let ff_pw_8 be used as an SSE constant
...
Originally committed as revision 15052 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
ebceaa1cd5
gcc chokes on the 7 registers needed for float_to_int16_interleave6 (even inside HAVE_7REGS), so write it in yasm
...
Originally committed as revision 14749 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
ee46753739
gcc chokes on xmm constraints, so pessimize int32_to_float_fmul_scalar_sse a little
...
Originally committed as revision 14748 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
675872382f
special case 6 channel version of float_to_int16_interleave
...
5% faster ac3
Originally committed as revision 14744 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
911e21a306
simd int->float
...
20% faster ac3 if downmixing, 15% if not
Originally committed as revision 14743 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
ac2e556456
simd downmix
...
13% faster ac3 if downmixing
Originally committed as revision 14742 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
862b98d42c
cosmetics in dsp init
...
Originally committed as revision 14704 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Uoti Urpala
f769b746aa
Mark add_png_paeth_prediction_* functions which are only used within this file
...
as static. patch by Uoti Urpala, uoti.urpala pp1.inet fi
Originally committed as revision 14509 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
5eb0f2a425
float_to_int16_interleave: change src to an array of pointers instead of assuming it's contiguous.
...
this has no immediate effect, but will allow it to be used in more codecs.
Originally committed as revision 14252 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
4342a7f30b
10l, float_to_int16_interleave_sse/3dnow wrote the wrong samples
...
Originally committed as revision 14236 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
b9fa32082c
exploit mdct symmetry
...
2% faster vorbis on conroe, k8. 7% on celeron.
Originally committed as revision 14207 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Loren Merritt
f27e1d645e
simplify vorbis windowing
...
Originally committed as revision 14205 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Kostya Shishkov
d7e1fc4254
SSE2 optimizations for Monkey's Audio decoder vector functions
...
Originally committed as revision 14161 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Michael Niedermayer
e98750c373
float_to_int16_sse2()
...
20% faster than sse
Originally committed as revision 14138 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Michael Niedermayer
35ee72b1d7
1 c-asm loop less and 1x unroll of float_to_int16_sse()
...
25% faster
Originally committed as revision 14104 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Michael Niedermayer
560fa9bf51
Fix x86-64
...
Originally committed as revision 14103 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Michael Niedermayer
63b737d4f9
dont use C-asm loops and unroll once float_to_int16_3dnow()
...
30% faster
Originally committed as revision 14102 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago
Reimar Döffinger
00eebe3d6a
Fix add_bytes_mmx and add_bytes_l2_mmx for w < 16
...
Originally committed as revision 13877 to svn://svn.ffmpeg.org/ffmpeg/trunk
17 years ago