FFmpeg

Commit Graph

Author	SHA1	Message	Date
David Conrad	19530266a5	Enable SSE2 (put\|avg)_pixels_16_sse2 SVQ1 chroma has been special-cased aligned to 16-bytes since at least r15466 Other architectures also assume 16-byte alignment here too but set STRIDE_ALIGN to 16. Originally committed as revision 21736 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Reimar Döffinger	3d05c1fbec	Make the jump-table section-relative for x86_64 with PIC enabled. This allows to get rid of the macho64 specific hack that moves them to rodata (with worse cache behaviour) and avoids textrels which e.g. Gentoo does not allow for x86_64 libraries. Originally committed as revision 21551 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	900479bb74	optimize h264_loop_filter_strength_mmx2 244->160 cycles on core2 Originally committed as revision 21462 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Alex Converse	3deb53849e	Implement an sse version of scalarproduct_float(). Originally committed as revision 21386 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	c67278098d	Move array specifiers outside DECLARE_ALIGNED() invocations Originally committed as revision 21377 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
David Conrad	1f630b9717	Use two separate memory arguments since 8+() is invalid gas syntax Originally committed as revision 21360 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Michael Niedermayer	b4c2ada528	Attempt to fix asm compilation failure. Only tested on gcc 4 & x86_64. Originally committed as revision 21355 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	5e7dfb7de1	Move COPY3_IF_LT to lavc/mathops.h This obscure macro is only used in motion_est.c so having it in lavc makes more sense. See discussion here: http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/2008-November/056561.html Originally committed as revision 21346 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
David Conrad	c4f2b6dce3	Use constant offsets for memory operands since gcc is unable to This fixes gcc failing to fit 6 memory locations into 7 registers on x86-32 Originally committed as revision 21337 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Michael Niedermayer	9ac4548ff7	Fix h264_loop_filter_strength_mmx2() so it works with b frames. Originally committed as revision 21327 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Michael Niedermayer	ebddd2e253	Remove -2 -> -1 remapping, its not needed anymore as we must remap all references per LUT anyway. Originally committed as revision 21323 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Gwenole Beauchesne	5716aec3f9	Fix XvMC. XvMCCreateBlocks() may not allocate 16-byte aligned blocks, so we can't use SSE-optimized routines. Originally committed as revision 21011 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Reimar Döffinger	4a1289450a	Reduce number of ASM constraints for ff_lpc_compute_autocorr_sse2 since it causes no significant speed difference and can avoid compilation issues with --enable-pic. Originally committed as revision 21003 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Diego Biurrun	4052cbf161	Get rid of pointless CONFIG_ANY_H263 preprocessor definition. Originally committed as revision 20975 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	758c7455f1	fix a crash in ape decoding on x86_32 sse2 Originally committed as revision 20777 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	a4605efdf5	slightly faster scalarproduct_and_madd_int16_ssse3 on penryn, no change on conroe Originally committed as revision 20743 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	91e644ff77	r20739 broke compilation on systems without yasm Originally committed as revision 20742 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	b1159ad928	refactor and optimize scalarproduct 29-105% faster apply_filter, 6-90% faster ape decoding on core2 (Any x86 other than core2 probably gets much less, since this is mostly due to ssse3 cachesplit avoidance and I haven't written the full gamut of other cachesplit modes.) 9-123% faster ape decoding on G4. Originally committed as revision 20739 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	b10fa1bb8b	port ape dsp functions from sse2 to mmx now requires yasm Originally committed as revision 20722 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	4521308363	s/movdqa/movaps/ in sse1 fft. (regression in r20293) Originally committed as revision 20371 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	b07781b6e4	fix linking on systems with a function name prefix (10l in r20287) Originally committed as revision 20294 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	29e4edbbe7	sync yasm macros to x264 Originally committed as revision 20293 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	e17ccf60fe	huffyuv: add some const qualifiers Originally committed as revision 20290 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Loren Merritt	2f77923d72	simd add_hfyu_left_prediction 2.2x faster than C on conroe, 3.6x on penryn. 4-6% faster huffyuv decoding if using left or plane mode and yuv Originally committed as revision 20287 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Justin Ruggles	f4d608e344	add CONFIG_LPC to the build system for lpc dsputil functions. fixes build problems when lpc.c is not compiled. Originally committed as revision 20285 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Justin Ruggles	fde82ca7e4	Move autocorrelation function from flacenc.c to lpc.c. Also rename the corresponding dsputil functions and remove their dependency on the FLAC encoder. Fixes Issue1486. Originally committed as revision 20266 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Reimar Döffinger	ec65675504	Use MANGLE in cavsdsp, the current version using "m" constraints will not compile on e.g. OpenBSD due to running out of registers. Originally committed as revision 20123 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Reimar Döffinger	003121091e	Replace several #ifdef PIC with the more obvious and correct #if !HAVE_EBX_AVAILABLE, since all it does is avoid using ebx. Originally committed as revision 20094 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	35de5d2412	cosmetics: fix indentation after previous commit Originally committed as revision 20062 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	952e872198	Drop unused args from vector_fmul_add_add, simpify code, and rename The src3 and step arguments to vector_fmul_add_add() are always zero and one, respectively. This removes these arguments from the function, simplifies the code accordingly, and renames the function to better match the new operation. Originally committed as revision 20061 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	01b2214758	Merge FFTContext and MDCTContext Originally committed as revision 19931 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	f486321395	Move per-arch fft init bits into the corresponding subdirs Originally committed as revision 19864 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	4e36a5b46f	Move declarations of some mmx functions to dsputil_mmx.h Originally committed as revision 19739 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Vitor Sessak	9263a05aab	Mark "i" parameter of vector_clipf_sse() as early-clobber Originally committed as revision 19731 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Vitor Sessak	50e23ae9d3	Mark parameter src of vector_clipf() as const Originally committed as revision 19729 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Vitor Sessak	0a68cd876e	SSE optimized vector_clipf(). 10% faster TwinVQ decoding. Originally committed as revision 19728 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
John Adcock	3f87f39cb8	Update x264 asm code to latest to add support for 64-bit Windows. Use the new x86inc features to support 64-bit Windows on all non-x264 nasm assembly code as well. Patch by John Adcock, dscaler.johnad AT googlemail DOT com. Win64 changes originally by Anton Mitrofanov. x86util changes mostly by Holger Lubitz. Originally committed as revision 19580 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Diego Biurrun	9be6f0d2f8	Do not check for both CONFIG_VC1_DECODER and CONFIG_WMV3_DECODER, the former depends upon the latter. Originally committed as revision 19533 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Diego Biurrun	99e5a9d1ea	Do not redundantly check for both CONFIG_THEORA_DECODER and CONFIG_VP3_DECODER. The Theora decoder depends on the VP3 decoder. Originally committed as revision 19492 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Carl Eugen Hoyos	36904c4c9f	Icc 11.1 still does not align the stack pointer, disable some x264 functions. Originally committed as revision 19454 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Jason Garrett-Glaser	73b02e2460	SSE version of clear_blocks Originally committed as revision 19206 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Jason Garrett-Glaser	4f717c69ed	idct_dc for VC-1/WMV3 decoder; ~11% faster decoding overall. Includes mmx2 asm for the various functions. Note that the actual idct still does not have an x86 SIMD implemtation. For wmv3 files using regular idct, the decoder just falls back to simple_idct, since simple_idct_dc doesn't exist (yet). Originally committed as revision 19204 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Ramiro Polla	74a841af8b	Replace more uses of __attribute__((aligned)) by DECLARE_ALIGNED. Originally committed as revision 19089 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Ramiro Polla	989b7181ac	Use fewer macros in x86-optimized mlpdsp. Fixes compilation on 32-bit llvm which didn't allow a cast in an m operand. Originally committed as revision 19086 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Alexander Strange	2b9969a945	H264: Fix out of bounds reads in SSSE3 MC Reading above src[-2] isn't safe, so move loads and palignr ahead 3 pixels to load starting at the first pixel actually used. Fixes issue941. Originally committed as revision 18999 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Ramiro Polla	7c4c60e520	mlp: Use LABEL_MANGLE() to export label symbols from inside asm block. Originally committed as revision 18935 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Ramiro Polla	5624766d18	MLP DSP functions x86-optimized. 12.59% overall speedup in x86_32 9.98% overall speedup in x86_64 compared to gcc 4.3.3 Originally committed as revision 18903 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
David Conrad	c21c835b8d	avg_ pixel functions need to use (dst+pix+1)>>1 to average with existing pixels, not (dst+pix)>>1. This makes the mmx functions bitexact with the C functions. Originally committed as revision 18527 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
David Conrad	9bf0fdf378	VC1: extend MMX qpel MC to include MMX2 avg qpel Originally committed as revision 18519 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
David Conrad	8013da7364	VC1: add and use avg_no_rnd chroma MC functions Originally committed as revision 18518 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago

1 2 3 4

186 Commits (d93fdcbf5c0e70ca03aaad2581fd328f277cd7cb)