FFmpeg

Commit Graph

Author	SHA1	Message	Date
Anton Khirnov	12004a9a7f	audiodsp/x86: yasmify vector_clipf_sse	8 years ago
Anton Khirnov	75d98e30af	audiodsp/x86: clear the high bits of the order parameter on 64bit Also change shl to add, since it can be faster on some CPUs. CC: libav-stable@libav.org	8 years ago
Anton Khirnov	1d6c76e11f	audiodsp/x86: fix ff_vector_clip_int32_sse2 This version, which is the only one doing two processing cycles per loop iteration, computes the load/store indices incorrectly for the second cycle. CC: libav-stable@libav.org	8 years ago
Henrik Gramner	ab43beefab	x86inc: Drop SECTION_TEXT macro The .text section is already 16-byte aligned by default on all supported platforms so `SECTION_TEXT` isn't any different from `SECTION .text`. Signed-off-by: Anton Khirnov <anton@khirnov.net>	9 years ago
Diego Biurrun	9a9e2f1c8a	dsputil: Split audio operations off into a separate context	11 years ago
Diego Biurrun	054013a0fc	dsputil: Move APE-specific bits into apedsp	11 years ago
Diego Biurrun	0d439fbede	dsputil: Split off HuffYUV decoding bits into their own context Also shorten HuffYUV context member names to avoid clutter.	11 years ago
Diego Biurrun	57b5b84e20	x86: dsputil: Move ff_apply_window_int16_* bits to ac3dsp, where they belong	11 years ago
Diego Biurrun	55519926ef	x86: Make function prototype comments in assembly code consistent This helps grepping for functions, among other things.	11 years ago
Ronald S. Bultje	610b18e2e3	x86: qpel: Move fullpel and l2 functions to a separate file This way, they can be shared between mpeg4qpel and h264qpel without requiring either one to be compiled unconditionally. Signed-off-by: Martin Storsjö <martin@martin.st>	12 years ago
Janne Grunau	e5c2794a71	x86: consistently use unaligned movs in the unaligned bswap Fixes fate errors in asv1, ffvhuff and huffyuv on x86_32.	12 years ago
Diego Biurrun	e8c52271c4	Revert "Move H264/QPEL specific asm from dsputil.asm to h264_qpel_*.asm." This reverts commit `f90ff772e7`. The code should be put back in h264_qpel_8bit.asm, but unfortunately it is unconditionally used from dsputil_mmx.c since `71155d7`.	12 years ago
Daniel Kang	9acd23d655	x86: dsputil: Fix h263 loop filter link error in some configurations This was caused by unconditionally referencing a conditionally compiled table. Now the code is also compiled conditionally. Signed-off-by: Diego Biurrun <diego@biurrun.de>	12 years ago
Daniel Kang	a1d3673034	dsputil: x86: Fix compile error Accidentally prefixed ff_ with cextern. Signed-off-by: Martin Storsjö <martin@martin.st>	12 years ago
Daniel Kang	659d4ba5af	dsputil: x86: Convert h263 loop filter to yasm Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	12 years ago
Diego Biurrun	52acd79165	x86: hpel: Move {avg,put}_pixels16_sse2 to hpeldsp	12 years ago
Ronald S. Bultje	f90ff772e7	Move H264/QPEL specific asm from dsputil.asm to h264_qpel_*.asm.	12 years ago
Ronald S. Bultje	d56668bd80	floatdsp: move scalarproduct_float from dsputil to avfloatdsp. This makes the aac decoder and all voice codecs independent of dsputil.	12 years ago
Ronald S. Bultje	42d3246948	floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp. Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.	12 years ago
Ronald S. Bultje	55aa03b9f8	floatdsp: move vector_fmul_add from dsputil to avfloatdsp.	12 years ago
Ronald S. Bultje	8a4f26206d	dsputil: remove butterflies_float_interleave. The function is unused.	12 years ago
Ronald S. Bultje	8c53d39e7f	lavc: introduce VideoDSPContext Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	12 years ago
Daniel Kang	610e00b359	x86: h264: Convert 8-bit QPEL inline assembly to YASM Signed-off-by: Diego Biurrun <diego@biurrun.de>	12 years ago
Diego Biurrun	87af05c575	x86: SPLATD: port to cpuflags	12 years ago
Diego Biurrun	8c3849bc76	x86: dsputil: port to cpuflags	12 years ago
Diego Biurrun	26301caaa1	x86: mmx2 ---> mmxext in asm constructs	12 years ago
Diego Biurrun	2b479bcab0	build: Drop AVX assembly ifdefs An assembler able to cope with AVX instructions is now required.	12 years ago
Diego Biurrun	04581c8c77	x86: yasm: Use complete source path for macro helper %includes This is more consistent with the way we handle C #includes and it simplifies the build system.	12 years ago
Diego Biurrun	6860b4081d	x86: include x86inc.asm in x86util.asm This is necessary to allow refactoring some x86util macros with cpuflags.	12 years ago
Diego Biurrun	17337f54c0	x86: Split inline and external assembly #ifdefs	12 years ago
Diego Biurrun	3b9e832e17	x86: Drop silly "_yasm" suffixes from filenames	12 years ago
Mans Rullgard	a3df4781f4	x86: add colons after labels nasm prints a warning if the colon is missing. Signed-off-by: Mans Rullgard <mans@mansr.com>	12 years ago
Ronald S. Bultje	da6505ad2f	dsputil: make add_hfyu_left_prediction_sse4() support unaligned src. This makes add_hfyu_left_prediction_sse4() handle sources that are not 16-byte aligned in its own function rather than by proxying the call to add_hfyu_left_prediction_ssse3(). This fixes a crash on Win64, since the sse4 version clobberes xmm6, but the ssse3 version (which uses MMX regs) does not restore it, thus leading to XMM clobbering and RSP being off. Fixes bug 342.	12 years ago
Ronald S. Bultje	30b45d9c38	x86inc: automatically insert vzeroupper for YMM functions.	12 years ago
Jason Garrett-Glaser	85a3c19ed1	dsputil: x86: add SHUFFLE_MASK_W macro Simplifies pshufb masks that operate on words.	13 years ago
Justin Ruggles	d5a7229ba4	Add a float DSP framework to libavutil Move vector_fmul() from DSPContext to AVFloatDSPContext.	13 years ago
Justin Ruggles	713548cbad	x86: lavc: use %if HAVE_AVX guards around AVX functions in yasm code. This is needed for older versions of yasm/nasm that do not support AVX. Signed-off-by: Diego Biurrun <diego@biurrun.de>	13 years ago
Kieran Kunhya	5ff01259a8	Convert vector_fmul range of functions to YASM and add AVX versions Signed-off-by: Justin Ruggles <justin.ruggles@gmail.com>	13 years ago
Ronald S. Bultje	b089ca871a	dsputil: fix optimized emu_edge function on Win64. Recent register allocation changes (x86inc.asm update) changed the register order and thus opcodes for the inner loops. One of them became >128bytes, which confuses other parts of this function where it jumps to fixed-offset positions to extend the edge by fixed amounts. A simple register change fixes this.	13 years ago
Henrik Gramner	729f90e268	x86inc improvements for 64-bit Add support for all x86-64 registers Prefer caller-saved register over callee-saved on WIN64 Support up to 15 function arguments Also (by Ronald S. Bultje) Fix up our asm to work with new x86inc.asm. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Justin Ruggles <justin.ruggles@gmail.com>	13 years ago
Christophe GISQUET	6b81da2fd0	dsputil x86: use SSE float instruction instead of SSE2 integer equivalent All the more required since the users are pure SSE functions. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Christophe GISQUET	7e1ce6a6ac	dsputil: remove shift parameter from scalarproduct_int16 There is only one caller, which does not need the shifting. Other use cases are situations where different roundings would be needed. The x86 and neon versions are modified accordingly. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Justin Ruggles	236a550c3f	Fix a typo in the x86 asm version of ff_vector_clip_int32() Specifies the correct number of xmm registers used so that they can be saved and restored on Win64 if necessary.	13 years ago
Christophe Gisquet	6b03900382	x86 dsputil: provide SSE2/SSSE3 versions of bswap_buf While pshufb allows emulating bswap on XMM registers for SSSE3, more shuffling is needed for SSE2. Alignment is critical, so specific codepaths are provided for this case. For the huffyuv sequence "angels_480-huffyuvcompress.avi": C (using bswap instruction): ~ 55k cycles SSE2: ~ 40k cycles SSSE3 using unaligned loads: ~ 35k cycles SSSE3 using aligned loads: ~ 30k cycles Signed-off-by: Diego Biurrun <diego@biurrun.de>	13 years ago
Ronald S. Bultje	3b15a6d742	config.asm: change %ifdef directives to %if directives. This allows combining multiple conditionals in a single statement.	13 years ago
Justin Ruggles	0e8fdd41c2	dsputil: use cpuflags in x86 emu_edge_core avoids passing around the extra argument among all the macros it uses	13 years ago
Justin Ruggles	395f2e70dd	dsputil: use movups instead of movdqu in ff_emu_edge_core_sse() This allows emulated_edge_mc_sse() and gmc_sse() to be used under AV_CPU_FLAG_SSE.	13 years ago
Justin Ruggles	9d06037d48	twinvq: add SSE/AVX optimized sum/difference stereo interleaving	13 years ago
Justin Ruggles	b8f02f5b4e	dsputil: use cpuflags in x86 versions of vector_clip_int32()	13 years ago
Justin Ruggles	4e8e262476	fmtconvert: port int32_to_float_fmul_scalar() x86 inline asm to yasm	13 years ago

5 Commits (5801f9ed245ca5ebb57b0b5183de7a24aaece133)