Janne Grunau
|
d3f5b94762
|
aarch64: opus NEON iMDCT and FFT
Opus celt decoding 11% faster and the iMDCT over 2.5 times faster on
Apple's A7.
|
11 years ago |
Janne Grunau
|
3956a5e0ea
|
aarch64: NEON vorbis_inverse_coupling
From the ARMv7 NEON version. 16 times faster as the C version, overall
more than 12% faster vorbis decoding on Apple's A7.
|
11 years ago |
Janne Grunau
|
8f9fe6ae34
|
aarch64: NEON fixed/floating point MPADSP apply_window
30%/25% (fixed/float) faster mp3 decoding on Apple's A7. The floating
point decoder is approximately 7% faster.
|
11 years ago |
Janne Grunau
|
ee2bc5974f
|
aarch64: NEON float (i)MDCT
Approximately as fast as the ARM NEON version on Apple's A7.
|
11 years ago |
Janne Grunau
|
650c4300d9
|
aarch64: NEON float FFT
Approximately as fast as the ARM NEON version on Apple's A7.
|
11 years ago |
Janne Grunau
|
d3789eeeed
|
aarch64: implement videodsp.prefetch
8% faster h264 decoding on Apple A7.
|
11 years ago |
Diego Biurrun
|
0e083d7e43
|
build: Group general components separate from de/encoders in arch Makefiles
This is in line with how the top-level libavcodec Makefile is structured.
|
11 years ago |
Janne Grunau
|
fe96769bed
|
aarch64: port neon clobber test from arm
|
11 years ago |
Janne Grunau
|
36e3b1f2fd
|
aarch64: h264 loop filter NEON optimizations
Ported from ARMv7 NEON.
|
11 years ago |
Janne Grunau
|
c65d67ef50
|
aarch64: hpeldsp NEON optimizations
Ported from ARMv7 NEON.
|
11 years ago |
Janne Grunau
|
d5dd8c7bf0
|
aarch64: h264 qpel NEON optimizations
Ported from ARMv7 NEON.
|
11 years ago |
Janne Grunau
|
8438b3f09f
|
aarch64: h264 idct NEON assembler optimizations
Ported from ARMv7 NEON.
|
11 years ago |
Janne Grunau
|
71617884a2
|
aarch64: h264 chroma motion compensation NEON optimizations
Since RV40 and VC-1 use almost the same algorithm so optimizations for
those two decoders are easy to do and included.
|
11 years ago |