FFmpeg

Author	SHA1	Message	Date
sunyuechi	0b9d009b4a	lavc/vc1dsp: R-V V inv_trans C908: vc1dsp.vc1_inv_trans_4x4_dc_c: 125.7 vc1dsp.vc1_inv_trans_4x4_dc_rvv_i32: 53.5 vc1dsp.vc1_inv_trans_4x8_dc_c: 230.7 vc1dsp.vc1_inv_trans_4x8_dc_rvv_i32: 65.5 vc1dsp.vc1_inv_trans_8x4_dc_c: 228.7 vc1dsp.vc1_inv_trans_8x4_dc_rvv_i64: 64.5 vc1dsp.vc1_inv_trans_8x8_dc_c: 476.5 vc1dsp.vc1_inv_trans_8x8_dc_rvv_i64: 80.2 Signed-off-by: Rémi Denis-Courmont <remi@remlab.net>	2 years ago
Andreas Rheinhardt	333b32af8e	avcodec/h264chroma: Constify src in h264_chroma_mc_func Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	3 years ago
Andreas Rheinhardt	40e6575aa3	all: Replace if (ARCH_FOO) checks by #if ARCH_FOO This is more spec-compliant because it does not rely on dead-code elimination by the compiler. Especially MSVC has problems with this, as can be seen in https://ffmpeg.org/pipermail/ffmpeg-devel/2022-May/296373.html or https://ffmpeg.org/pipermail/ffmpeg-devel/2022-May/297022.html This commit does not eliminate every instance where we rely on dead code elimination: It only tackles branching to the initialization of arch-specific dsp code, not e.g. all uses of CONFIG_ and HAVE_ checks. But maybe it is already enough to compile FFmpeg with MSVC with whole-programm-optimizations enabled (if one does not disable too many components). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	3 years ago
Ben Avison	2e26847780	avcodec/vc1: Introduce fast path for unescaping bitstream buffer Includes a checkasm test. Signed-off-by: Ben Avison <bavison@riscosopen.org> Signed-off-by: Martin Storsjö <martin@martin.st>	3 years ago
Martin Storsjö	db54426975	vc1dsp: Change remaining stride parameters to ptrdiff_t The existing x86 assembly for loop filters uses the stride as a full register without clearing/sign extending the upper half of the registers on x86_64. This avoids crashes if the caller would have passed nonzero bits in the previously undefined upper 32 bits of the parameters. Signed-off-by: Martin Storsjö <martin@martin.st>	3 years ago
Martin Storsjö	a78f136f3f	configure: Use a separate config_components.h header for $ALL_COMPONENTS This avoids unnecessary rebuilds of most source files if only the list of enabled components has changed, but not the other properties of the build, set in config.h. Signed-off-by: Martin Storsjö <martin@martin.st>	3 years ago
Hao Chen	60ead5cd68	avcodec: [loongarch] Optimize vc1dsp with LASX. ./ffmpeg -i 11_wmv3_720p_24fps_7Mbps.wmv -f rawvideo -y /dev/null -an before:131fps after :229fps Reviewed-by: Shiyou Yin <yinshiyou-hf@loongson.cn> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	3 years ago
Michael Niedermayer	507ca66ee4	avcodec/vc1dsp: Avoid undefined shifts in vc1_v_s_overlap_c / vc1_h_s_overlap_c Fixes: left shift of negative value -13 Fixes: 15260/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_VC1_fuzzer-5702076048343040 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	6 years ago
Jerome Borsboom	975a1a81b2	avcodec/vc1: fix overlap filter for frame interlaced pictures The overlap filter is not correct for vertical edges in frame interlaced I and P pictures. When filtering macroblocks with different FIELDTX values, we have to match the lines at both sides of the vertical border. In addition, we have to use the correct rounding values, depending on the line we are filtering. Signed-off-by: Jerome Borsboom <jerome.borsboom@carpalis.nl>	7 years ago
Zhou Xiaoyong	5b74ebe937	avcodec/mips: version 1 of vc1dsp optimizations for loongson mmi Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
Diego Biurrun	e4a94d8b36	h264chroma: Change type of stride parameters to ptrdiff_t This avoids SIMD-optimized functions having to sign-extend their stride argument manually to be able to do pointer arithmetic.	9 years ago
Diego Biurrun	2ec9fa5ec6	idct: Change type of array stride parameters to ptrdiff_t ptrdiff_t is the correct type for array strides and similar.	9 years ago
Diego Biurrun	29c2d06d67	cosmetics: Drop empty comment lines	9 years ago
Michael Niedermayer	e3f7142306	avcodec/vc1dsp: add () to protect the arguments of the op* macros Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Ben Avison	adf8227cf4	vc-1: Add platform-specific start code search routine to VC1DSPContext. Initialise VC1DSPContext for parser as well as for decoder. Note, the VC-1 code doesn't actually use the function pointer yet. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	11 years ago
Diego Biurrun	58e65e44f4	vc1dsp: Add wrappers for {avg\|put}_vc1_mspel_mc00_c This avoids invoking the wrapped functions with too many arguments.	11 years ago
Diego Biurrun	368f50359e	dsputil: Split off quarterpel bits into their own context	11 years ago
Ben Avison	9d8ecdd8ca	vc-1: Add platform-specific start code search routine to VC1DSPContext. Initialise VC1DSPContext for parser as well as for decoder. Note, the VC-1 code doesn't actually use the function pointer yet. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	11 years ago
Christophe Gisquet	319235c67c	vc1dsp: introduce cases for 8x8 and 16x16 This allows further unrolling the DSP implementation where possible. x86 and ARM DSP modified by simply moving the multiple calls from vc1dec to the DSP code. Decoding improvements should only occurs because of the compiler actually able to unroll more. Decoding time: ~8.80s -> 8.64s (ie around 2%) Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	11 years ago
Diego Biurrun	3dc6272bed	Remove a number of unnecessary dsputil.h #includes	11 years ago
Janne Grunau	71617884a2	aarch64: h264 chroma motion compensation NEON optimizations Since RV40 and VC-1 use almost the same algorithm so optimizations for those two decoders are easy to do and included.	12 years ago
Michael Niedermayer	6d98959c8a	vc1: Add avg_no_rnd_vc1_chroma_mc4_c() Needed for proper interlaced support. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	12 years ago
Luca Barbato	c798a6fedc	vc1: Factorize out chroma MC	12 years ago
Luca Barbato	a1f5164814	vc1dsp: K&R formatting cosmetics Signed-off-by: Diego Biurrun <diego@biurrun.de>	12 years ago
Mason Carter	832e190632	vc1: arm: Add NEON assembly For: ff_vc1_inv_trans_{8,4}x{8,4}_{dc_,}neon ff_put_pixels8x8_neon ff_put_vc1_mspel_mc{0,1,2,3}{0,1,2,3}_neon (except for 00) Based on ARM assembly code in libavcodec/arm by Rob Clark and Mans Rullgard. Signed-off-by: Martin Storsjö <martin@martin.st>	12 years ago
Diego Biurrun	67e6a9f558	cosmetics: Place arch initialization calls in alphabetical order	12 years ago
Diego Biurrun	38282149b6	ppc: More consistent arch initialization	12 years ago
Michael Niedermayer	dd6e291e40	vc1dsp: add avg_no_rnd_vc1_chroma_mc4_c() Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	12 years ago
Michael Niedermayer	019b378d90	vc1: fix int/ptrdiff_t mismatches Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	12 years ago
Luca Barbato	a8b6015823	dsputil: convert remaining functions to use ptrdiff_t strides Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	12 years ago
Diego Biurrun	79dad2a932	dsputil: Separate h264chroma	13 years ago
Diego Biurrun	88bd7fdc82	Drop DCTELEM typedef It does not help as an abstraction and adds dsputil dependencies. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	13 years ago
Michael Niedermayer	075eaf8d6a	vc1dsp: fix the warning fix, make it work with --disable-asm Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Michael Niedermayer	fceeac9847	vc1dsp: fix pointer type warnings Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Janne Grunau	7e522859fc	x86: vc1: call ff_vc1dsp_init_x86() under if (ARCH_X86)	13 years ago
Martin Storsjö	1d9c2dc89a	Don't include common.h from avutil.h Signed-off-by: Martin Storsjö <martin@martin.st>	13 years ago
Michael Niedermayer	2278a3e5f7	vc1dsp: use av_assert2 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Mans Rullgard	bc92214e27	vc1dsp: mark put/avg_vc1_mspel_mc() always_inline This ensures that these functions are inlined into the per-position entry points, allowing constant propagation as needed for proper optimisation. 18% faster VC1 decoding on Cortex-A9. Signed-off-by: Mans Rullgard <mans@mansr.com>	13 years ago
Ronald S. Bultje	c23acbaed4	Don't use ff_cropTbl[] for IDCT. Results of IDCT can by far outreach the range of ff_cropTbl[], leading to overreads and potentially crashes. Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind CC: libav-stable@libav.org	13 years ago
Michael Niedermayer	32f0c65828	vc1: fix out of array reads in vc1_inv_trans_4x4_c() Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Michael Niedermayer	80c702efeb	vc1: fix out of array reads in vc1_inv_trans_4x8_c() Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Michael Niedermayer	af796ba4b8	vc1: fix out of array reads in vc1_inv_trans_8x4_c() Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	13 years ago
Mans Rullgard	373211d828	Remove extraneous semicolons These semicolons cause invalid empty top-level declarations. Signed-off-by: Mans Rullgard <mans@mansr.com>	14 years ago
Mashiat Sarker Shakkhar	cad16562c8	vc1dec: interlaced stream decoding support 3/3 Cosmetics: break some lines and reformat TODOs Signed-off-by: Anton Khirnov <anton@khirnov.net>	14 years ago
Alberto Delmás	45ecda8554	Windows Media Image decoder (WMVP/WVP2) Signed-off-by: Anton Khirnov <anton@khirnov.net>	14 years ago
Ronald S. Bultje	7d2e03afc8	vc1: make overlap filter for I-frames bit-exact.	14 years ago
Ronald S. Bultje	18b6a69ce9	Revert "VC1: merge idct8x8, coeff adjustments and put_pixels." This reverts commit `f8bed30d8b`. The reason for this is that the overlap filter, which runs after IDCT, should run on unclamped values, and thus IDCT and put_pixels() cannot be merged if we want to attempt to be bitexact.	14 years ago
Mans Rullgard	2912e87a6c	Replace FFmpeg with Libav in licence headers Signed-off-by: Mans Rullgard <mans@mansr.com>	14 years ago
Ronald S. Bultje	6a786b15c3	VC1: merge idct8x8, coeff adjustments and put_pixels. Merging these functions allows merging some loops, which makes the results (particularly after SIMD optimizations) much faster. (cherry picked from commit `f8bed30d8b`)	14 years ago
Ronald S. Bultje	f8bed30d8b	VC1: merge idct8x8, coeff adjustments and put_pixels. Merging these functions allows merging some loops, which makes the results (particularly after SIMD optimizations) much faster.	14 years ago

1 2 3

109 Commits (89de2f0de1a41349fe827c00c8f52ca3c12594ad)