Jason Garrett-Glaser
40c7d0ae3c
Add automatic prefix handling to yasm functions. Does nothing now, but will
...
be useful for porting x264 asm in the future.
Originally committed as revision 16234 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
a5b807a6c1
Replace /2 by >>1 in decode_cabac_mb_dqp()
...
3 cpu cycles speed up on pentium dual.
Originally committed as revision 16233 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
1aea5d35e5
Simplify ctx update in decode_cabac_mb_dqp().
...
no speed change
Originally committed as revision 16232 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
7cfca0dfd8
Simplify ctx calculation in decode_cabac_mb_dqp()
...
no speed change
Originally committed as revision 16231 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Alexander Strange
d43696309a
Clear FF_INPUT_BUFFER_PADDING_SIZE bytes at the end of NALs in rbsp_buffer.
...
Fixes valgrind uninitialized value warnings at the end of decoding H.264
frames.
Originally committed as revision 16230 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Jason Garrett-Glaser
712ca84c21
Move filter_luma_intra into dsputil for later addition of asm.
...
Originally committed as revision 16228 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Jason Garrett-Glaser
b9fe706305
Simplify chroma AC in CABAC residual decoding.
...
Originally committed as revision 16227 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
8955b66950
Optimize ctx calculation in decode_cabac_mb_mvd(), code by dark shikari.
...
The case for 16x16 blocks becomes 10 cpu cycles faster on pentium dual,
i could not find a speed difference in the case of subblocks though.
Originally committed as revision 16226 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
17779f39b6
Remove unacceptable NULL pointer hack from mc code.
...
Originally committed as revision 16225 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
04618b98e3
Check ref values in CABAC H.264 for validity.
...
Originally committed as revision 16224 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
c25ac15a07
Move idct_(dc)add closer to where it is needed.
...
Originally committed as revision 16223 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
aebb5d6d96
indent
...
Originally committed as revision 16222 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
96465b90a1
Reorder ifs in chroma hl_decode_mb to avoid a duplicate transform_bypass
...
check.
14 cpu cycles speedup on Pentium Dual
Originally committed as revision 16221 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
6456d6d87c
s/h->cbp_table[mb_xy]/h->cbp/
...
Originally committed as revision 16220 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
04824298a9
Faster CAVLC decoding of trailing_ones. Based on a patch by dark shikari.
...
decode_residual is about 3.3% faster.
Originally committed as revision 16219 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Jason Garrett-Glaser
93445d1617
Replace i by trailing_ones, part of a patch by dark shikari.
...
No speed change meassureable with START/STOP_TIMER, but this is needed
for future optimizations.
Originally committed as revision 16218 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
c375d87f6f
Remove if() surrounding decode_cabac_mb_type() that can never be true.
...
Originally committed as revision 16217 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
c325b5054f
Remove unreachable else clause, found by dark shikari.
...
Originally committed as revision 16216 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
dae006d7d7
Remove useless IS_8x8DCT check i forgot, spotted by dark shikari.
...
Originally committed as revision 16215 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
2a5a9c2815
10l, svq3 didnt set h->cbp, this broke decoding a little.
...
Originally committed as revision 16214 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Kostya Shishkov
efeb298ea0
cosmetics: remove one unneeded space
...
Originally committed as revision 16211 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
1eb960352b
Do not calculate idct_dc_add/idct_add when the variables are unused.
...
Originally committed as revision 16210 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
62bc966f8f
Remove redundant nnz variable.
...
Originally committed as revision 16209 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
0a8ca22f4e
indent
...
Originally committed as revision 16208 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
2fd1f0e026
Use the new idct functions (except chroma as it was slower in benchmarks)
...
cathedral +0.5% speed
aladin +0.6% speed [note aladin has been cat-ed 10 times to reduce the influence
of init time]
Speedup also verified via START/STOP_TIMER (difference was very significant
for the changed parts)
Originally committed as revision 16207 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
ac22385931
H.264 idct functions that include the chroma, inter luma and intra16 luma loops
...
thus avoiding the calling overhead.
New functions are not yet used.
Originally committed as revision 16206 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
49c084a745
Skip non intra luma code when there is no coded luma.
...
0.7% speedup for the cathedral sample.
Originally committed as revision 16203 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Loren Merritt
00794ed0d9
1.6x faster pred*_vertical_add_c
...
Originally committed as revision 16202 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
621561cdf3
Skip chroma handling when there is no coded chroma.
...
0.5% overall speedup for the cathedral sample.
Originally committed as revision 16201 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Kostya Shishkov
17b17c53f4
add const qualifier to some pointers for input data
...
Originally committed as revision 16195 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Kostya Shishkov
bb39171beb
use precalculated mb_pos where available
...
Originally committed as revision 16194 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Kostya Shishkov
6c3fca6479
Merge deblocking pattern with CBP for RV30/40 loop filtering
...
Originally committed as revision 16192 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Kostya Shishkov
26bc319c1d
Generate correct deblock pattern for RV30
...
Originally committed as revision 16191 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Kostya Shishkov
f610fc81c1
Unfortunately RV30 luma thirdpel MC functions cannot be substituted with
...
applying two separate filtering passes, so implement them explicitly.
Originally committed as revision 16190 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Kostya Shishkov
6a4583e925
4l: TIFF stores short strings inside tag, do not interpret it is as an offset
...
Fixes issue 753
Originally committed as revision 16189 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Guillaume Poirier
5f51afd602
add AltiVec implementation of int32_to_float_fmul_scalar
...
Originally committed as revision 16186 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
54dab66174
Fix svq3 decoding, is_complex was not initialized.
...
Originally committed as revision 16184 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
4080e67c8e
Replace != 0 || check by |
...
3 cpu cycles faster
Originally committed as revision 16183 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
ad9ca7e720
Split filter_mb_dir() out of filter_mb().
...
1% overall decoding speed up for cathedral-beta2-400extra-crop-avc.mp4
no speed change for Aladin.mpg
Benchmarks done on Pentium dual
Originally committed as revision 16182 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
6c414bb6d4
AC3: fix strict aliasing violation in parser
...
Originally committed as revision 16181 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
cb56b44086
AAC: fix strict aliasing violation in parser
...
Originally committed as revision 16180 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
a2fc0f6a6d
ARM: replace "armv4l" with "arm"
...
Originally committed as revision 16179 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
2600f8c86d
Remove obsolete and misleading comments.
...
Originally committed as revision 16178 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
ac0623b23c
Fix indention, also do a little vertical alignment of changed lines.
...
Originally committed as revision 16176 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Stefano Sabatini
7b1454c439
Implement av_set_string3().
...
Originally committed as revision 16175 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
6120a343aa
Factorize 3 multiplications out, code becomes 3 cpu cycles faster.
...
(not significant as thats just per MB)
Originally committed as revision 16174 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
1dd488e955
Move ENABLE_SMALL back to the per MB check, as otherwise gcc wont remove
...
the code.
Originally committed as revision 16173 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
89db0bae26
Do not recalculate constant part of is_complex for every MB.
...
Originally committed as revision 16172 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Michael Niedermayer
41e4055b63
Support the new lossless mode.
...
Originally committed as revision 16171 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
479044cea5
dct-test: add ARM IDCTs
...
Originally committed as revision 16170 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago