Ronald S. Bultje
62844c3fd6
h264: Integrate clear_blocks calls with IDCT
...
The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700
to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb
(in the decode_slice loop) goes from 1759 to 1733 cycles on the clip
tested (cathedral), i.e. almost 30 cycles per mb faster.
Signed-off-by: Martin Storsjö <martin@martin.st>
12 years ago
Ronald S. Bultje
1acd7d594c
h264: integrate clear_blocks calls with IDCT.
...
The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700
to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb
(in the decode_slice loop) goes from 1759 to 1733 cycles on the clip
tested (cathedral), i.e. almost 30 cycles per mb faster.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
12 years ago
Mans Rullgard
5e826fd65e
ARM: set Tag_ABI_align_preserved in all asm files
...
All our ARM asm preserves alignment so setting this attribute
in a common location is simpler. This removes numerous warnings
when linking with armcc.
Signed-off-by: Mans Rullgard <mans@mansr.com>
12 years ago
Justin Ruggles
94d2b0d2fd
ARM: Move asm.S from libavcodec to libavutil
...
This will allow for easier implementation of ARM-optimized functions in
libraries other than libavcodec.
13 years ago
Mans Rullgard
3a0b72dee0
ARM: remove needless .text/.align directives
...
The 'function' macro already includes the appropriate
directives.
Signed-off-by: Mans Rullgard <mans@mansr.com>
13 years ago
Mans Rullgard
11043d80f6
ARM: use const macro to define constant data in asm
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
8986fddc2b
ARM: allow building in Thumb2 mode
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
88ff180ad6
ARM: update ff_h264_idct8_add4_neon for 4:4:4 changes
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
2912e87a6c
Replace FFmpeg with Libav in licence headers
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Måns Rullgård
4a6cc8fa25
ARM: fix NEON h264_idct_add8
...
Originally committed as revision 25121 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
fa2d5d54b9
ARM: NEON H264 8x8 IDCT
...
Parts by David Conrad.
Originally committed as revision 24706 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
a7e7d40c2e
ARM: set size of asm functions in object files
...
Originally committed as revision 22404 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
5dad039bf7
ARM: small tweak of NEON H264 IDCT
...
Originally committed as revision 20697 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
b44c6d8edb
ARM: remove unnecessary .fpu neon directives
...
Originally committed as revision 20151 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
172a39d13c
ARM: work around linker bug with movw/movt relocations in shared libs
...
Originally committed as revision 16395 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
760badc1df
ARM: add new h264 idct functions
...
Originally committed as revision 16312 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
a2fc0f6a6d
ARM: replace "armv4l" with "arm"
...
Originally committed as revision 16179 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
1bf98d19d5
ARM: NEON optimised h264_idct_dc_add
...
Originally committed as revision 16151 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
c598cf25f4
ARM: NEON optimised h264_idct_add
...
Originally committed as revision 16150 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago