FFmpeg

Author	SHA1	Message	Date
Andreas Rheinhardt	4618f36a24	avcodec/x86/h264dsp_init: Remove obsolete MMX(EXT) functions x64 always has MMX, MMXEXT, SSE and SSE2 and this means that some functions for MMX, MMXEXT and 3dnow are always overridden by other functions (unless one e.g. explicitly disables SSE2) for x64. So given that the only systems that benefit from these functions are truely ancient 32bit x86s they are removed. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	3 years ago
James Almer	2c844c9828	x86/h264_deblock: fix warning about trailing empty parameter Fixes part of ticket #8771 An error occurred Signed-off-by: James Almer <jamrial@gmail.com>	5 years ago
Janne Grunau	156ea66c91	h264/x86: sign extend int stride in deblock functions Fixes checkasm errors after adding the h264 deblock tests.	6 years ago
James Darnley	33de0fee2c	avcodec/h264: enable sse2 chroma deblock/loop filter functions Between 1.00 and 1.16 times faster on Intel Yorkfield Core 2 Quad. Between 1.11 and 1.39 times faster on Intel Kaby Lake Pentium.	8 years ago
James Darnley	cd893b9307	avcodec/h264: add avx 8-bit 4:2:2 chroma h intra deblock/loop filter ~1.37x faster (147 vs. 108 cycles) compared to mmxext function	8 years ago
James Darnley	0e16b3e2be	avcodec/h264: add avx 8-bit 4:2:0 chroma h intra deblock/loop filter ~1.10x faster (69 vs. 63 cycles) compared to mmxext function	8 years ago
James Darnley	987ffe4b8d	avcodec/h264: add avx 8-bit chroma v intra deblock/loop filter ~1.14x faster (90 vs 78 cycles) compared with mmxext	8 years ago
James Darnley	88307b3eec	avcodec/h264: add avx 8-bit 4:2:2 chroma h deblock/loop filter ~1.21x faster (68 vs. 56 cycles) compared with mmxext function	8 years ago
James Darnley	ac096fc82d	avcodec/h264: add avx 8-bit 4:2:0 chroma h deblock/loop filter ~1.14x faster (93 vs. 81 cycles) compared with mmxext function	8 years ago
James Darnley	5c56758843	avcodec/h264: add avx 8-bit chroma v deblock/loop filter ~1.24x faster (101 vs. 81 cycles) compared with mmxext function	8 years ago
James Darnley	5336887867	avcodec/h264: sse2, avx h luma mbaff deblock/loop filter x86-64 only Yorkfield: - sse2: ~2.17x (434 vs. 200 cycles) Nehalem: - sse2: ~2.94x (409 vs. 139 cycles) Skylake: - sse2: ~3.10x (370 vs. 119 cycles) - avx: ~3.29x (370 vs. 112 cycles)	8 years ago
James Darnley	e18bc2114f	avcodec/h264: add named parameters to x86 function	8 years ago
James Darnley	9d815b7424	avcodec/x86: deduplicate PASS8ROWS macro	8 years ago
James Darnley	815ea8c6cc	avcodec/h264: mmxext 4:2:2 chroma intra deblock/loop filter 2.1 times faster (401 vs. 194 cycles)	8 years ago
Henrik Gramner	aa751573fe	avcodec/h264: Fix segfault in 4:2:2 chroma deblock with 32-bit msvc Using rNm and x86inc's stack allocation with a negative value at the same time isn't supported, and caused the original stack pointer to be clobbered when using a compiler that doesn't support stack alignment.	9 years ago
James Darnley	7042a55c55	avcodec/h264: mmxext 4:2:2 chroma deblock/loop filter 2.6 times faster (366 vs. 142 cycles)	9 years ago
Henrik Gramner	9f1245eb96	x86inc: Support arbitrary stack alignments Change ALLOC_STACK to always align the stack before allocating stack space for consistency. Previously alignment would occur either before or after allocating stack space depending on whether manual alignment was required or not. Signed-off-by: Anton Khirnov <anton@khirnov.net>	10 years ago
Henrik Gramner	826790f596	x86inc: Support arbitrary stack alignments Change ALLOC_STACK to always align the stack before allocating stack space for consistency. Previously alignment would occur either before or after allocating stack space depending on whether manual alignment was required or not.	10 years ago
Diego Biurrun	79793f8337	Update Fiona's name in copyright statements.	11 years ago
Martin Storsjö	570d4b2186	x86: h264: Don't keep data in the redzone across function calls on 64 bit unix We know that the called function (ff_chroma_inter_body_mmxext) doesn't touch the redzone, and thus will be kept intact - thus, this doesn't fix any bug per se. However, valgrind's memcheck tool intentionally assumes that the redzone is clobbered on every function call and function return (see a long comment in valgrind/memcheck/mc_main.c). This avoids false positives in that tool, at the cost of an extra stack pointer adjustment. The other alternative would be a valgrind suppression for this issue, but that's an extra burden for everybody that wants to run libavcodec within valgrind. Signed-off-by: Martin Storsjö <martin@martin.st>	11 years ago
Diego Biurrun	55519926ef	x86: Make function prototype comments in assembly code consistent This helps grepping for functions, among other things.	11 years ago
Henrik Gramner	bbe4a6db44	x86inc: Utilize the shadow space on 64-bit Windows Store XMM6 and XMM7 in the shadow space in functions that clobbers them. This way we don't have to adjust the stack pointer as often, reducing the number of instructions as well as code size. Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>	12 years ago
Ronald S. Bultje	b93b27edb0	dsputil: Make dsputil selectable Signed-off-by: Martin Storsjö <martin@martin.st>	12 years ago
Ronald S. Bultje	6a701306db	dsputil: make selectable. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	12 years ago
Matt Wolenetz	82a4a4e7ca	Fix Win64 AVX h264_deblock by not using redzone on Win64 Thanks-to: "Ronald S. Bultje" <rsbultje@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	12 years ago
Matt Wolenetz	311443f6c7	x86: h264: Don't use redzone in AVX h264_deblock on Win64 This fixes crashes in chromium on win64 on machines with AVX (crashes that apparently aren't triggered by fate). Signed-off-by: Martin Storsjö <martin@martin.st>	12 years ago
Ronald S. Bultje	ce58642ed0	x86inc: support stack mem allocation and re-alignment in PROLOGUE. Use this in VP8/H264-8bit loopfilter functions so they can be used if there is no aligned stack (e.g. MSVC 32bit or ICC 10.x). Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	12 years ago
Ronald S. Bultje	6f40e9f070	x86inc: support stack mem allocation and re-alignment in PROLOGUE Use this in VP8/H264-8bit loopfilter functions so they can be used if there is no aligned stack (e.g. MSVC 32bit or ICC 10.x). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	12 years ago
Diego Biurrun	26301caaa1	x86: mmx2 ---> mmxext in asm constructs	12 years ago
Diego Biurrun	04581c8c77	x86: yasm: Use complete source path for macro helper %includes This is more consistent with the way we handle C #includes and it simplifies the build system.	13 years ago
Diego Biurrun	6860b4081d	x86: include x86inc.asm in x86util.asm This is necessary to allow refactoring some x86util macros with cpuflags.	13 years ago
Carl Eugen Hoyos	52be5428c0	Add some missing _EXTERNAL suffixes to yasm source files.	13 years ago
Ronald S. Bultje	b829b4ce29	h264: convert loop filter strength dsp function to yasm. This completes the conversion of h264dsp to yasm; note that h264 also uses some dsputil functions, most notably qpel. Performance-wise, the yasm-version is ~10 cycles faster (182->172) on x86-64, and ~8 cycles faster (201->193) on x86-32.	13 years ago
Ronald S. Bultje	a5bbb1242c	h264_loopfilter: port x86 simd to cpuflags.	13 years ago
Henrik Gramner	729f90e268	x86inc improvements for 64-bit Add support for all x86-64 registers Prefer caller-saved register over callee-saved on WIN64 Support up to 15 function arguments Also (by Ronald S. Bultje) Fix up our asm to work with new x86inc.asm. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Justin Ruggles <justin.ruggles@gmail.com>	13 years ago
Ronald S. Bultje	8fb26950ed	h264: don't use redzone in loopfilter on win64. Red zone usage is not allowed in the Win64 ABI.	13 years ago
Michael Niedermayer	f9caec0cf9	h264: change deblock_h_chroma_8_mmxext() to prevent valgrind confusion. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Reimar Döffinger	f51a072160	Fix compilation without HAVE_AVX. %ifdef HAVE_AVX must now be %if HAVE_AVX. Signed-off-by: Reimar Döffinger <Reimar.Doeffinger@gmx.de>	13 years ago
Ronald S. Bultje	3b15a6d742	config.asm: change %ifdef directives to %if directives. This allows combining multiple conditionals in a single statement.	13 years ago
Kieran Kunhya	b1766c170c	Move x264asm to libavutil. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	14 years ago
Dave Yeo	cc73511e8e	Fix NASM include directive Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	14 years ago
Ronald S. Bultje	b2c087871d	Move x86util.asm from libavcodec/ to libavutil/. This allows using it in swscale also.	14 years ago
Ronald S. Bultje	3a39195b1d	Move x86inc.asm to libavutil/. This allows using it in libswscale/ also.	14 years ago
Jason Garrett-Glaser	a3bf7b864a	H.264: tweak some other x86 asm for Atom	14 years ago
Carl Eugen Hoyos	5fb67d8039	Fix compilation with old yasm.	14 years ago
Diego Biurrun	888fa31eca	Fix FSF address copy paste error in some license headers.	14 years ago
Jason Garrett-Glaser	5705b02079	10-bit H.264 x86 chroma v loopfilter asm Also delete some unused deblock asm macros.	14 years ago
Jason Garrett-Glaser	9f3d6ca4f1	Port x86 10-bit H.264 deblock asm from x264	14 years ago
Jason Garrett-Glaser	8ad77b65b5	Update x86 H.264 deblock asm Includes AVX versions from x264.	14 years ago
Mans Rullgard	2912e87a6c	Replace FFmpeg with Libav in licence headers Signed-off-by: Mans Rullgard <mans@mansr.com>	14 years ago

1 2

65 Commits (5c55e4e2975dd6e577fb1b4597e5292496ec8cbb)