FFmpeg

Commit Graph

Author	SHA1	Message	Date
Anton Khirnov	71f1ad37d8	lavc: do not compile fmtconvert unconditionally Only ac3dec and dcadec use it.	10 years ago
Anton Khirnov	d74a8cb7e4	fmtconvert: drop unused functions	10 years ago
Seppo Tomperi	63ca0fe828	avcodec/hevcdsp: ARM NEON optimized qpel functions uses comma as macro parameter separator Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Michael Niedermayer	390c57781f	avcodec/arm/hevcdsp_idct_neon: drop ".code 32" gas-preprocessor and armasm fail otherwise Tested-by: Timotius Margo Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Seppo Tomperi	e40e446efd	hevcdsp: HEVC deblocking ARM NEON register clobber fix Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Peter Meerwald	702458538d	g722: Add ARM NEON implementation for g722_apply_qmf() Signed-off-by: Peter Meerwald <pmeerw@pmeerw.net> Signed-off-by: Martin Storsjö <martin@martin.st>	10 years ago
Michael Niedermayer	cab6302534	avcodec/arm/videodsp_armv5te: Fix linking failure with "g++ -shared -D__STDC_CONSTANT_MACROS -o test.so ... libavcodec.a" Tested-by: Andreas Haupt Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Seppo Tomperi	03cecf45c1	hevcdsp: ARM NEON optimized transforms cherry picked from commit b153f55935969c794de4640f8d34e01c58e027ae Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Seppo Tomperi	0c494114cc	hevcdsp: ARM NEON optimized deblocking filter cherry picked from commit 1b9ee47d2f43b0a029a9468233626102eb1473b8 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Carl Eugen Hoyos	f9f9ae1b77	lavc/arm: Use the neon vertical chroma loop filter also for H.264 4:2:2.	10 years ago
Janne Grunau	4c81613df4	arm: mlpdsp: handle pic offset calculation in a macro Makes the code easier to read since it hides different offset calculations for arm and thumb mode.	10 years ago
Janne Grunau	581c7f0e12	arm: make ff_mlp_filter_channel_arm and ff_mlp_rematrix_channel_arm position independent No significant difference in used cpu cycles on a cortex-a9.	10 years ago
Martin Storsjö	f963f80399	arm: Use .data.rel.ro for const data with relocations Signed-off-by: Martin Storsjö <martin@martin.st>	10 years ago
Martin Storsjö	b280c6202b	arm: fft_vfp: Unify the behaviour in ff_fft_calc_vfp between arm/thumb Don't include the function pointer table in the code segment in arm mode. This shouldn't have any significant performance effect. It does end up as a few more instructions than before, for ARM, but only at the entry to this function, not within the fft functions themselves. Signed-off-by: Martin Storsjö <martin@martin.st>	10 years ago
Martin Storsjö	ae81576414	arm: fft_vfp: Add a missing "endconst" when building in thumb mode Signed-off-by: Martin Storsjö <martin@martin.st>	10 years ago
Vittorio Giovara	9c12c6ff95	motion_est: convert stride to ptrdiff_t CC: libav-stable@libav.org Bug-Id: CID 700556 / CID 700557 / CID 700558	10 years ago
James Almer	3cec54b7d7	x86/flacdsp: add SSE2 and AVX decorrelate functions Two to four times faster depending on instruction set, block size and channel count.	10 years ago
James Almer	c99a882814	avcodec/idctdsp: change {put,add}_pixels_clamped to ptrdiff_t line_size Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>	10 years ago
Bernd Kuhls	6b733be755	Fix compile error on arm4/arm5 platform Since these commits http://git.videolan.org/?p=ffmpeg.git;a=commitdiff;h=adf8227cf4e7b4fccb2ad88e1e09b6dc00dd00ed http://git.videolan.org/?p=ffmpeg.git;a=commitdiff;h=db7f1c7c5a1d37e7f4da64a79a97bea1c4b6e9f8 compilation on arm4/arm5 fails: libavcodec/libavcodec.so: undefined reference to `ff_startcode_find_candidate_armv6' Because libavcodec/arm/Makefile contains ARMV6-OBJS-$(CONFIG_STARTCODE) += arm/startcode_armv6.o function ff_startcode_find_candidate_armv6 is not included for older ARM archs. The bug was found during automatic buildroot builds: http://autobuild.buildroot.net/results/ec7/ec71e4f16ee9106747dff5f15999cbd17903e76f//build-end.log Quote from configure summary: ARCH arm (armv4t) big-endian no runtime cpu detection yes ARMv5TE enabled no ARMv6 enabled no ARMv6T2 enabled no http://autobuild.buildroot.net/results/be7/be72eb182eaccf0064a32c9dfc2ac1c0d6555506/build-end.log ARCH arm (armv5te) big-endian no runtime cpu detection yes ARMv5TE enabled yes ARMv6 enabled no ARMv6T2 enabled no This patch provides the necessary #if clauses as discussed with Michael: https://ffmpeg.org/pipermail/ffmpeg-devel/2014-September/163329.html Signed-off-by: Bernd Kuhls <bernd.kuhls@t-online.de> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Diego Biurrun	95c0cec03a	idctdsp: Add global function pointers for {add\|put}_pixels_clamped functions These function pointers already existed in the ARM code. Adding them globally allows calls to the function pointers to access arch-optimized versions of the functions transparently.	10 years ago
Diego Biurrun	efd26bedec	build: Add explanatory comments to (optimization) blocks in the Makefiles	10 years ago
Diego Biurrun	835f798c7d	mpegvideo: cosmetics: Lowercase ugly uppercase MPV_ function name prefixes	10 years ago
James Almer	a8592db9bb	avcodec/idctdsp: make add/put_pixels_clamped_c internal functions This reduces code duplication and differences with the fork. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Michael Niedermayer	305f72aee7	avcodec: Change get_pixels() to ptrdiff_t linesize Found-by: ubitux Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Ben Avison	adf8227cf4	vc-1: Add platform-specific start code search routine to VC1DSPContext. Initialise VC1DSPContext for parser as well as for decoder. Note, the VC-1 code doesn't actually use the function pointer yet. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	10 years ago
Ben Avison	db7f1c7c5a	h264: Move start code search functions into separate source files. This permits re-use with parsers for codecs which use similar start codes. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	10 years ago
Michael Niedermayer	b051a1bbb9	avcodec/arm/idctdsp_init_arm*: Only select non bitexact IDCTs by default when bitexact is not set Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Diego Biurrun	7fb993d338	qpeldsp: Mark source pointer in qpel_mc_func function pointer const	10 years ago
Ben Avison	6869612f5c	arm: Macroize the test for 'setend' CPU instruction support Signed-off-by: Diego Biurrun <diego@biurrun.de>	10 years ago
Diego Biurrun	81b9bf3192	dct-test: Move arch-specific bits into arch-specific subdirectories	10 years ago
Diego Biurrun	4de8b60684	idct: Move arm-specific declarations to a header in the arm directory	10 years ago
Diego Biurrun	8b0dd4942a	idctdsp: prettyprinting cosmetics	10 years ago
Diego Biurrun	b4987f7219	idct: Convert IDCT permutation #defines to an enum Also rename the enum values to be consistent with other DCT permutations.	10 years ago
Martin Storsjö	7e18a727d2	arm: cosmetics: Consistently use lowercase for shift operators Signed-off-by: Martin Storsjö <martin@martin.st>	10 years ago
Martin Storsjö	fe67f3fbb5	arm: cosmetics: Fix a misaligned asm operand Signed-off-by: Martin Storsjö <martin@martin.st>	10 years ago
Ben Avison	87552d54d3	armv6: Accelerate ff_fft_calc for general case (nbits != 4) The previous implementation targeted DTS Coherent Acoustics, which only requires nbits == 4 (fft16()). This case was (and still is) linked directly rather than being indirected through ff_fft_calc_vfp(), but now the full range from radix-4 up to radix-65536 is available. This benefits other codecs such as AAC and AC3. The implementaion is based upon the C version, with each routine larger than radix-16 calling a hierarchy of smaller FFT functions, then performing a post-processing pass. This pass benefits a lot from loop unrolling to counter the long pipelines in the VFP. A relaxed calling standard also reduces the overhead of the call hierarchy, and avoiding the excessive inlining performed by GCC probably helps with I-cache utilisation too. I benchmarked the result by measuring the number of gperftools samples that hit anywhere in the AAC decoder (starting from aac_decode_frame()) or specifically in the FFT routines (fft4() to fft512() and pass()) for the same sample AAC stream: Before After Mean StdDev Mean StdDev Confidence Change Audio decode 2245.5 53.1 1599.6 43.8 100.0% +40.4% FFT routines 940.6 22.0 348.1 20.8 100.0% +170.2% Signed-off-by: Martin Storsjö <martin@martin.st>	10 years ago
Ben Avison	5c22e8e4ad	armv6: Accelerate ff_imdct_half for general case (mdct_bits != 6) The previous implementation targeted DTS Coherent Acoustics, which only requires mdct_bits == 6. This relatively small size lent itself to unrolling the loops a small number of times, and encoding offsets calculated at assembly time within the load/store instructions of each iteration. In the more general case (codecs such as AAC and AC3) much larger arrays are used - mdct_bits == [8, 9, 11]. The old method does not scale for these cases, so more integer registers are used with non-unrolled versions of the loops (and with some stack spillage). The postrotation filter loop is still unrolled by a factor of 2 to permit the double-buffering of some VFP registers to facilitate overlap of neighbouring iterations. I benchmarked the result by measuring the number of gperftools samples that hit anywhere in the AAC decoder (starting from aac_decode_frame()) or specifically in ff_imdct_half_c / ff_imdct_half_vfp, for the same example AAC stream: Before After Mean StdDev Mean StdDev Confidence Change aac_decode_frame 2368.1 35.8 2117.2 35.3 100.0% +11.8% ff_imdct_half_* 457.5 22.4 251.2 16.2 100.0% +82.1% Signed-off-by: Martin Storsjö <martin@martin.st>	10 years ago
Diego Biurrun	2d60444331	dsputil: Split motion estimation compare bits off into their own context	10 years ago
Diego Biurrun	adff0a8166	arm: dsputil: Coalesce all init files	10 years ago
Ben Avison	42c1cc35b7	armv6: Accelerate ff_imdct_half for general case (mdct_bits != 6) The previous implementation targeted DTS Coherent Acoustics, which only requires mdct_bits == 6. This relatively small size lent itself to unrolling the loops a small number of times, and encoding offsets calculated at assembly time within the load/store instructions of each iteration. In the more general case (codecs such as AAC and AC3) much larger arrays are used - mdct_bits == [8, 9, 11]. The old method does not scale for these cases, so more integer registers are used with non-unrolled versions of the loops (and with some stack spillage). The postrotation filter loop is still unrolled by a factor of 2 to permit the double-buffering of some VFP registers to facilitate overlap of neighbouring iterations. I benchmarked the result by measuring the number of gperftools samples that hit anywhere in the AAC decoder (starting from aac_decode_frame()) or specifically in ff_imdct_half_c / ff_imdct_half_vfp, for the same example AAC stream: Before After Mean StdDev Mean StdDev Confidence Change aac_decode_frame 2368.1 35.8 2117.2 35.3 100.0% +11.8% ff_imdct_half_* 457.5 22.4 251.2 16.2 100.0% +82.1% Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Diego Biurrun	1173320249	dsputil: Drop unused bit_depth parameter from all init functions	11 years ago
Diego Biurrun	f46bb608d9	dsputil: Split off pixel block routines into their own context	11 years ago
Martin Storsjö	79fce1ec8a	arm: Avoid using the 'setend' instruction on ARMv7 and newer This instruction is deprecated on ARMv8, and it is serializing on some ARMv7 cores as well [1]. [1] http://article.gmane.org/gmane.linux.ports.arm.kernel/339293 CC: libav-stable@libav.org Signed-off-by: Martin Storsjö <martin@martin.st>	11 years ago
Diego Biurrun	c166148409	dsputil: Move pix_sum, pix_norm1, shrink function pointers to mpegvideoenc	11 years ago
Diego Biurrun	e3fcb14347	dsputil: Split off IDCT bits into their own context	11 years ago
Janne Grunau	f23d26a686	h264: avoid using uninitialized memory in NEON chroma mc Adapt commit `982b596ea6` for the arm and aarch64 NEON asm. 5-10% faster on Cortex-A9.	11 years ago
Diego Biurrun	9a9e2f1c8a	dsputil: Split audio operations off into a separate context	11 years ago
Michael Niedermayer	08c5859f17	avcodec: add simpleauto idct This will pick the "best" simple idct compatible idct Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	11 years ago
Diego Biurrun	e74433a8e6	dsputil: Split clear_block/fill_block off into a separate context	11 years ago
Christophe Gisquet	ccff45a0d3	apedsp: move to llauddsp APE is not the sole codec using scalarproduct_and_madd_int16. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	11 years ago

1 2 3 4 5 ...

731 Commits (22af79a9c88f8bfaa8c4130c8f58c5bff20e1a1f)