FFmpeg/aarch64 at 7e42d5f0ab2aeac811fd01e122627c9198b13f01 - FFmpeg - Gitea: Git with a cup of tea

Mirror of https://git.ffmpeg.org/ffmpeg.git https://ffmpeg.org/

History

Martin Storsjö 7e42d5f0ab aarch64: vp8: Optimize vp8_idct_add_neon for aarch64 The previous version was a pretty exact translation of the arm version. This version does do some unnecessary arithemetic (it does more operations on vectors that are only half filled; it does 4 uaddw and 4 sqxtun instead of 2 of each), but it reduces the overhead of packing data together (which could be done for free in the arm version). This gives a decent speedup on Cortex A53, a minor speedup on A72 and a very minor slowdown on Cortex A73. Before: Cortex A53 A72 A73 vp8_idct_add_neon: 79.7 67.5 65.0 After: vp8_idct_add_neon: 67.7 64.8 66.7 Signed-off-by: Martin Storsjö <martin@martin.st>		6 years ago
..
Makefile	aarch64: vp8: Move the vp8dsp makefile entries to the right places	6 years ago
asm-offsets.h	arm64: port synth_filter_float_neon from arm	9 years ago
cabac.h	…
dcadsp_init.c	dca: remove unused decode_hf function and quant_d tables	9 years ago
dcadsp_neon.S	dca: remove unused decode_hf function and quant_d tables	9 years ago
fft_init_aarch64.c	fft: Split MDCT bits off from FFT	9 years ago
fft_neon.S	…
fmtconvert_init.c	arm64: int32_to_float_fmul neon asm	9 years ago
fmtconvert_neon.S	arm64: int32_to_float_fmul neon asm	9 years ago
h264chroma_init_aarch64.c	h264chroma: Change type of stride parameters to ptrdiff_t	8 years ago
h264cmc_neon.S	h264chroma: Change type of stride parameters to ptrdiff_t	8 years ago
h264dsp_init_aarch64.c	h264/aarch64: add intra loop filter neon asm	6 years ago
h264dsp_neon.S	h264/aarch64: add intra loop filter neon asm	6 years ago
h264idct_neon.S	aarch64: h264idct: Use the offset parameter to movrel	8 years ago
h264pred_init.c	h264: aarch64: intra prediction optimisations	9 years ago
h264pred_neon.S	h264: aarch64: intra prediction optimisations	9 years ago
h264qpel_init_aarch64.c	…
h264qpel_neon.S	…
hpeldsp_init_aarch64.c	…
hpeldsp_neon.S	…
imdct15_init.c	…
imdct15_neon.S	…
mdct_init.c	fft: Split MDCT bits off from FFT	9 years ago
mdct_neon.S	…
mpegaudiodsp_init.c	mpegaudiodsp: aarch64: Adjust function prototype after `2caa93b813`	8 years ago
mpegaudiodsp_neon.S	aarch64: Remove a dot from a label	7 years ago
neon.S	aarch64: Make transpose_4x4H do a regular transpose	9 years ago
neontest.c	lavc: add clobber tests for the new encoding/decoding API	8 years ago
rv40dsp_init_aarch64.c	h264chroma: Change type of stride parameters to ptrdiff_t	8 years ago
synth_filter_neon.S	arm64: replace 'bic' with immediate with 'and' with inverted immediate	8 years ago
vc1dsp_init_aarch64.c	h264chroma: Change type of stride parameters to ptrdiff_t	8 years ago
videodsp.S	…
videodsp_init.c	…
vorbisdsp_init.c	…
vorbisdsp_neon.S	…
vp8dsp.h	aarch64: vp8: Port bilin functions from arm version	6 years ago
vp8dsp_init_aarch64.c	aarch64: vp8: Port bilin functions from arm version	6 years ago
vp8dsp_neon.S	aarch64: vp8: Optimize vp8_idct_add_neon for aarch64	6 years ago
vp9dsp_init_aarch64.c	aarch64: vp9dsp: Fix vertical alignment in the init file	8 years ago
vp9itxfm_neon.S	aarch64: vp9: Fix assembling with Xcode 6.2 and older	8 years ago
vp9lpf_neon.S	aarch64: vp9lpf: Use dup+rev16+uzp1 instead of dup+lsr+dup+trn1	8 years ago
vp9mc_neon.S	aarch64: vp9: Fix assembling with Xcode 6.2 and older	8 years ago