FFmpeg

Commit Graph

Author	SHA1	Message	Date
Michael Niedermayer	dc12f7d4ec	x86inc: try to put amdnop under ifdef to prevent compile failure based on similar amdnop usage in ffmpeg Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Loren Merritt	4d4752366f	x86inc: add SPLATB_LOAD, SPLATB_REG, PSHUFLW macros Signed-off-by: Diego Biurrun <diego@biurrun.de>	13 years ago
Loren Merritt	2cd1f5cadc	x86inc: modify ALIGN to not generate long nops on i586 Signed-off-by: Diego Biurrun <diego@biurrun.de>	13 years ago
Mans Rullgard	889c1ec4cc	x86: cpu: clean up check for cpuid instruction support This adds macros for accessing the EFLAGS register and uses these instead of coding the entire check in inline asm. Signed-off-by: Mans Rullgard <mans@mansr.com>	13 years ago
yang	9b72041f80	x86/intmath.h: Fix mull operand constraints Fixes Ticket1466 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Mans Rullgard	963cdf39b4	x86: cpu: whitespace (mostly) cosmetics This adds whitespace around operators, aligns line continuation backslashes, and breaks long lines. Also fixes an ifdef halfway through a statement. The one line of duplication this saved is not worth the ugliness. Signed-off-by: Mans Rullgard <mans@mansr.com>	13 years ago
Ronald S. Bultje	8123e0901f	x86: place some inline asm under #if HAVE_INLINE_ASM Signed-off-by: Mans Rullgard <mans@mansr.com>	13 years ago
Diego Biurrun	65345a5a30	x86: Add CPU flag for the i686 cmov instruction	13 years ago
Michael Niedermayer	97726e86be	x86/intmath: fix type of FASTDIV Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Justin Ruggles	82b2df9790	float_dsp: add x86-optimized functions for vector_fmac_scalar()	13 years ago
Michael Niedermayer	f0313e9022	x86/float_dsp.asm: restore author attribution The attribution was removed by libav while moving the code to libavutil The original code is from commit `eb4825b5d4` Author: Loren Merritt <lorenm@u.washington.edu> Date: Thu Aug 10 19:06:25 2006 +0000 sse and 3dnow implementations of float->int conversion and mdct windowing. 15% faster vorbis. and commit `069720565c` Author: Loren Merritt <lorenm@u.washington.edu> Date: Fri Aug 11 18:19:37 2006 +0000 vorbis simd tweaks Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Justin Ruggles	d5a7229ba4	Add a float DSP framework to libavutil Move vector_fmul() from DSPContext to AVFloatDSPContext.	13 years ago
Vitor Sessak	4a301706fd	x86: Avoid movs on BUTTERFLYPS when in AVX mode Signed-off-by: Janne Grunau <janne-libav@jannau.net>	13 years ago
Justin Ruggles	5cc6d5244d	lavr: replace the SSE version of ff_conv_fltp_to_flt_6ch() with SSE4 and AVX The current SSE version is slower than the MMX version on Athlon64 and Sandy Bridge, but the SSE4 and AVX versions are faster on Sandy Bridge.	13 years ago
Justin Ruggles	c8af852b97	Add libavresample This is a new library for audio sample format, channel layout, and sample rate conversion.	13 years ago
Reimar Döffinger	9b1f776d75	Fix compilation with NASM. Signed-off-by: Reimar Döffinger <Reimar.Doeffinger@gmx.de>	13 years ago
Nico Weber	a4a88fd42c	Remove .rodata alignment kludge for Mach-O if a recent enough yasm is used. Yasm was fixed in its r2161 and yasm 0.8.0 (Apr 2010) contained this fix. Nasm was fixed in 2.06 (Jun 2009): https://groups.google.com/group/alt.lang.asm/browse_thread/thread/fcc85bbc3745d893 I tested with yasm 0.7.99 and yasm 1.2.0.7, where this works fine. I also tested with nasm. The nasm shipping with Xcode is too old to understand ffmpeg's assembly, before and after the patch. Nasm 2.10 fails to compile fft_mmx.asm on trunk with libavcodec/x86/fft_mmx.asm:88: panic: section ".text" has already been specified with alignment 32, conflicts with new alignment of 16 but builds fine if I change the two alignment "16"s in x86inc.asm to "32". With this patch, nasm 2.10 fails with libavcodec/x86/fft_mmx.asm:39: panic: section ".rodata" has already been specified with alignment 32, conflicts with new alignment of 16 instead, but again builds fine with s/16/32/. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Loren Merritt	705f3d4759	x86inc: support AVX abstraction for 2-operand instructions Add cvtdq2ps and cvtps2dq to the AVX instruction list. Signed-off-by: Justin Ruggles <justin.ruggles@gmail.com>	13 years ago
Diego Biurrun	baaab6069a	build: Move all arch OBJS declarations into arch subdirectory Makefiles.	13 years ago
Henrik Gramner	729f90e268	x86inc improvements for 64-bit Add support for all x86-64 registers Prefer caller-saved register over callee-saved on WIN64 Support up to 15 function arguments Also (by Ronald S. Bultje) Fix up our asm to work with new x86inc.asm. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Justin Ruggles <justin.ruggles@gmail.com>	13 years ago
Ronald S. Bultje	98b9da2ac7	x86inc: add *mp named argument support to DEFINE_ARGS.	13 years ago
Loren Merritt	0f53d0cf4b	x86inc: don't "bake" stack_offset in named arguments. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Reimar Döffinger	b223035511	Detect and check for CMOV. Some MMX-only CPUs do not have support for CMOV. All SSE/MMX2 CPUs should be fine, thus no check was added to those functions. See also https://sourceforge.net/tracker/?func=detail&aid=3358347&group_id=205275&atid=992986 Signed-off-by: Reimar Döffinger <Reimar.Doeffinger@gmx.de>	13 years ago
Haruhiko Yamagata	166f399377	x86inc: support yasm -f win64 flag also. This sets __OUTPUT_FORMAT__ to win64 instead of win32, even though both (through -m amd64) produce 64-bit binary code. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Henrik Gramner	9cf7385309	x86inc: allow manual use of WIN64_SPILL_XMM. Functions using INIT_MMX may still access XMM registers through direct means (xmm0-15). Therefore, they still need to be marked for clobber so they can be properly saved/restored. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Ronald S. Bultje	7e4d9d5d45	win64: add a XMM clobber test configure option. This will be useful to test more aggressively for failures to mark XMM registers as clobbered in Win64 builds, and prevent regressions thereof. Based on a patch by Ramiro Polla <ramiro.polla@gmail.com>	13 years ago
Ronald S. Bultje	412b248edb	x86inc.asm: fix typo. Assemblers don't understand ! in %if statements.	13 years ago
Ronald S. Bultje	3b15a6d742	config.asm: change %ifdef directives to %if directives. This allows combining multiple conditionals in a single statement.	13 years ago
Michael Niedermayer	f4a01d7788	x86inc: remove duplicate movlhps Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Vitor Sessak	39df0c434c	mpegaudiodec: optimized iMDCT transform Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Mans Rullgard	5b0d35eaed	x86: bswap: remove test for bswap instruction Firstly, this test never worked as intended, always reporting success. Secondly, bswap is available from 486 onward and can thus be assumed present. Signed-off-by: Mans Rullgard <mans@mansr.com>	13 years ago
Mans Rullgard	f64c2e710f	bswap: make generic implementation more compiler-friendly With these changes, gcc 4.5 and later recognise it as a bswap and use the proper instructions on ARM and x86. On x86, the 16-bit bswap is recognised from gcc 4.1. Signed-off-by: Mans Rullgard <mans@mansr.com>	13 years ago
Sean McGovern	be0675ce67	x86 cpuid: set vendor union members separately Solaris Studio (suncc) has difficulty with filling in members of a union. Instead, let's retrieve and store the cpuid() results separately. This is still a compiler bug, however this fix does not cause a regression on other platforms. Signed-off-by: Janne Grunau <janne-libav@jannau.net>	13 years ago
Vitor Sessak	ca55606a51	x86inc: Flag shufps as an floating-point instruction for the AVX emulation code. Without this, code like "shufps m0, m1, m2, 0xaa" would not work in CPUs not supporting SSE2. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Vitor Sessak	6b6ee58249	x86inc: Flag shufps as an floating-point instruction for the AVX emulation code. Without this, code like "shufps m0, m1, m2, 0xaa" would not work in CPUs not supporting SSE2. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Reimar Döffinger	e8ab4bac0c	Fix compilation with YASM. YASM does not support %ifempty even though it is in the official NASM documentation and supported since NASM 2.02. Signed-off-by: Reimar Döffinger <Reimar.Doeffinger@gmx.de>	13 years ago
Reimar Döffinger	03f1caec61	HACK: fix compilation with NASM. This issue is actually already fixed in NASM, however it will be a bit before it arrives everywhere. This should work as a temporary work-around. Signed-off-by: Reimar Döffinger <Reimar.Doeffinger@gmx.de>	13 years ago
Vitor Sessak	22e25c002e	mpegaudiodec: add SSE-optimized imdct36() Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Justin Ruggles	f2bd8a0786	x86inc: use sse versions of common macros instead of sse2 when applicable	13 years ago
Loren Merritt	2f7f2e4b41	Update x86inc.asm to latest x264 version, and add AVX symmetry. We keep INIT_AVX (for backwards compatibility). 3arg AVX ops with a memory arg can only have it in src2, whereas SSE emulation of 3arg prefers to have it in src1 (i.e. the mov). So, if the op is symmetric and the wrong one is memory, swap them.	13 years ago
Justin Ruggles	4e8e262476	fmtconvert: port int32_to_float_fmul_scalar() x86 inline asm to yasm	13 years ago
Jason Garrett-Glaser	96a59cf37b	x86: XOP/FMA4 CPU detection support	13 years ago
Sean McGovern	5938e02185	cpu detection: avoid a signed overflow 1<<31 overflows because 1 is signed, so force it to unsigned. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Ronald S. Bultje	38e06c2969	Move clipd macros to x86util.asm. This allows sharing them between multiple .asm files.	13 years ago
Ronald S. Bultje	b2c087871d	Move x86util.asm from libavcodec/ to libavutil/. This allows using it in swscale also.	13 years ago
Ronald S. Bultje	3a39195b1d	Move x86inc.asm to libavutil/. This allows using it in libswscale/ also.	13 years ago
Jason Garrett-Glaser	15919ee48f	bswap: use native types for av_bwap16(). This prevents a call to bytestream_get_be16() using a movzwl both before and after the ror instruction, which is obviously inefficient. Arm uses the same trick also. Sintel decoding goes from (avg+SD) 9.856 +/- 0.003 to 9.797 +/- 0.003 sec. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	14 years ago
Justin Ruggles	45ed822550	cosmetics: indentation	14 years ago
Justin Ruggles	eba586b0d9	Add a CPU flag for the Atom processor. The Atom has SSSE3 support, which is useful in many cases, but sometimes the SSSE3 version is slower than the SSE2 equivalent on the Atom, but is generally faster on other processors supporting SSSE3. This flag allows for selectively disabling certain SSSE3 functions on the Atom.	14 years ago
Mans Rullgard	2912e87a6c	Replace FFmpeg with Libav in licence headers Signed-off-by: Mans Rullgard <mans@mansr.com>	14 years ago

... 2 3 4 5 6 ...

331 Commits (4a993335812f7e9b2088145c7d043c01019c5d04)