Mans Rullgard
3fc4e36c78
x86: h264: remove hardcoded edx in decode_significance[_8x8]_x86()
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
e4b5a204aa
x86: h264: remove hardcoded eax in decode_significance[_8x8]_x86()
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
018c33838e
x86: cabac: remove hardcoded ebx in inline asm
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
6b712acc0e
x86: cabac: remove hardcoded struct offsets from inline asm
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Ronald S. Bultje
ed63f527f2
Fix build if yasm is not available.
14 years ago
Daniel Kang
f188a1e0ca
H.264: Add x86 assembly for 10-bit MC Chroma H.264 functions.
...
Mainly ported from 8-bit H.264 MC Chroma.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
14 years ago
Carl Eugen Hoyos
5fb67d8039
Fix compilation with old yasm.
14 years ago
Jason Garrett-Glaser
c90b94424c
4:4:4 H.264 decoding support
...
Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
14 years ago
Jason Garrett-Glaser
504811baea
Roll back 4:4:4 H.264 for now
...
Needs some ARM/PPC asm modifications.
14 years ago
Jason Garrett-Glaser
c9c493872c
4:4:4 H.264 decoding support
...
Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
14 years ago
Oskar Arvidsson
6c031a3338
h264: Fix 10-bit H.264 x86 chroma v loopfilter asm.
...
The tc variable was not splatted correctly.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
14 years ago
Daniel Kang
4de83b7b6d
H264: x86 predict init cosmetics.
...
Change indentation and whitespace; also move HAVE_YASM blocks.
Signed-off-by: Diego Biurrun <diego@biurrun.de>
14 years ago
Daniel Kang
a8d44f9dd5
Add x86 assembly for some 10-bit H.264 intra predict functions.
...
Parts are inspired from the 8-bit H.264 predict code in Libav.
Other parts ported from x264 with relicensing permission from author.
Signed-off-by: Diego Biurrun <diego@biurrun.de>
14 years ago
Loren Merritt
53be7b23e9
Cosmetic changes to h264_idct_10bit.asm.
...
Removes redundant dword tags and whitespace changes.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
14 years ago
Loren Merritt
994c3550ff
2x faster h264_idct_add8_10.
...
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
14 years ago
Ronald S. Bultje
e6635a9a19
h264: remove CONFIG_GPL from x86 intra prediction code.
...
The authors permitted relicensing to LGPL a long time ago (Holger,
Loren and Jason).
14 years ago
Daniel Kang
f3aa65af3a
h264/10bit: add HAVE_ALIGNED_STACK checks.
...
Fixes regression in 836f47d34b
in ICC-10.x,
since ICC<=11.0 doesn't align stack upon function calls.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
14 years ago
Daniel Kang
348493db60
Update 8-bit H.264 IDCT function names to reflect bit-depth.
...
Signed-off-by: Ronald S. Bultje <rbultje@google.com>
14 years ago
Daniel Kang
836f47d34b
Add IDCT functions for 10-bit H.264.
...
Ports the majority of IDCT functions for 10-bit H.264.
Parts are inspired from 8-bit IDCT code in Libav; other parts ported from x264 with relicensing permission from author.
Signed-off-by: Ronald S. Bultje <rbultje@google.com>
14 years ago
Justin Ruggles
70bb747a57
ac3dsp: do not use the ff_* prefix when referencing ff_ac3_bap_bits.
...
this should fix the windows builds
Signed-off-by: Martin Storsjö <martin@martin.st>
14 years ago
Justin Ruggles
6ca23db9cc
ac3enc: modify mantissa bit counting to keep bap counts for all values of bap
...
instead of just 0 to 4.
This does all the actual bit counting as a final step.
14 years ago
Diego Biurrun
5e528cffcf
x86: Add appropriate ifdefs around certain AVX functions.
...
nasm versions prior to 2.09 have trouble assembling some of our AVX code.
Protect these sections by preprocessor macros to allow compilation to pass.
14 years ago
Reimar Döffinger
7e637b70ec
Fix compilation with YASM/NASM versions not supporting AVX.
14 years ago
Reimar Döffinger
384d10360b
Fix register types for LOAD_AB arguments, fixes compilation with NASM.
14 years ago
Dave Yeo
a10fb79070
x86 asm: Add SECTION_TEXT to dct32_sse.asm.
...
This fixes the following error on OS/2:
error: segment name `.text align=16' not recognized
Signed-off-by: Diego Biurrun <diego@biurrun.de>
14 years ago
Michael Niedermayer
94ea17075b
dct32: Replacing libav by ffmpeg in the license header with the authors permission.
...
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
14 years ago
Loren Merritt
422b2362fc
dct32_sse: eliminate some spills
...
125->104 cycles on penryn (x86_64 only)
14 years ago
Vitor Sessak
e6c1791b47
Fix compilation with --disable-yasm.
14 years ago
Vitor Sessak
165c7c420d
Fix dct32() compilation with --disable-yasm
...
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
14 years ago
Vitor Sessak
6204feb160
dct32: Add AVX implementation of 32-point DCT
14 years ago
Vitor Sessak
4e653b98c8
dct32: Change pass 6 permutation to allow for AVX implementation
14 years ago
Vitor Sessak
3758eb0eb9
dct32: port SSE 32-point DCT to YASM
14 years ago
Diego Biurrun
153382e1b6
multiple inclusion guard cleanup
...
Add missing multiple inclusion guards; clean up #endif comments;
add missing library prefixes; keep guard names consistent.
14 years ago
Dave Yeo
d69f9a4234
Add support for a.out object format to assembler macros.
...
This format is still used by e.g. OS/2.
Signed-off-by: Diego Biurrun <diego@biurrun.de>
14 years ago
Mans Rullgard
0b5e44ed29
mpegaudiodsp: fix x86 and ppc makefiles
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
c4f5c2d6f4
Move some mpegaudio functions to new mpegaudiodsp subsystem
...
This separation allows these functions to be used in a cleaner
fashion from other codecs (e.g. qdm2) and simplifies creating
optimised versions of them.
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Justin Ruggles
e98a95e779
10l: wrap float_interleave functions in HAVE_YASM.
...
fixes compilation with --disable-yasm
14 years ago
Justin Ruggles
32f8fb8ecf
Add float_interleave() to FmtConvertContext with x86-optimized versions.
...
Partially based on patches by clsid2 in ffdshow-tryout.
ff_float_interleave6() x86 improvements by Loren Merrit.
14 years ago
Daniel Kang
d0005d347d
Modify x86util.asm to ease transitioning to 10-bit H.264 assembly.
...
Arguments for variable size instructions are added to many macros, along
with other various changes. The x86util.asm code was ported from x264.
Signed-off-by: Diego Biurrun <diego@biurrun.de>
14 years ago
Gil Pedersen
257de5fb25
h264dsp_mmx: Add #ifdefs around some mmxext functions on x86_64.
...
This fixes linking errors due to undefined symbols on x86_64 OS X.
Signed-off-by: Diego Biurrun <diego@biurrun.de>
14 years ago
Diego Biurrun
888fa31eca
Fix FSF address copy paste error in some license headers.
14 years ago
Jason Garrett-Glaser
5705b02079
10-bit H.264 x86 chroma v loopfilter asm
...
Also delete some unused deblock asm macros.
14 years ago
Jason Garrett-Glaser
9f3d6ca4f1
Port x86 10-bit H.264 deblock asm from x264
14 years ago
Jason Garrett-Glaser
8ad77b65b5
Update x86 H.264 deblock asm
...
Includes AVX versions from x264.
14 years ago
Ronald S. Bultje
86b29553f8
h264dsp_mmx: place bracket outside #if/#endif block.
...
Should fix compile on systems missing yasm/nasm.
14 years ago
Oskar Arvidsson
19a0729b4c
Adds 8-, 9- and 10-bit versions of some of the functions used by the h264 decoder.
...
This patch lets e.g. dsputil_init chose dsp functions with respect to
the bit depth to decode. The naming scheme of bit depth dependent
functions is <base name>_<bit depth>[_<prefix>] (i.e. the old
clear_blocks_c is now named clear_blocks_8_c).
Note: Some of the functions for high bit depth is not dependent on the
bit depth, but only on the pixel size. This leaves some room for
optimizing binary size.
Preparatory patch for high bit depth h264 decoding support.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
14 years ago
Diego Biurrun
a734fa575f
Remove disabled non-optimized code variants.
14 years ago
Michael Niedermayer
52a81cd0e4
Fix add_paeth_prediction_mmx for rgb48
...
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
14 years ago
Michael Niedermayer
afd2371d5c
merge read and and in add_paeth_prediction
...
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
14 years ago
Baptiste Coudurier
6d4c49a2af
Move png mmx functions into x86/png_mmx.c, remove them from DSPContext.
...
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
14 years ago