Ronald S. Bultje
4773d90421
vp8: frame-multithreading.
...
Tested on a Mac Pro, 2 CPUs, 2 cores each, OSX 10.6.6:
time ./ffmpeg -v 0 -vsync 0 -threads [1234] -i \
~/Downloads/sintel_trailer_1080p_vp8_vorbis.webm \
-f null -vcodec rawvideo -an -
1: 0m14.630s (89.9 fps)
2: 0m8.056s (163.2 fps)
3: 0m5.882s (223.6 fps)
4: 0m4.952s (265.6 fps)
time ./ffmpeg -v 0 -vsync 0 -threads [1234] -i \
~/Downloads/Elephants_Dream-720p-Stereo.webm \
-f null -vcodec rawvideo -an -
1: 1m12.962s (215.1 fps)
2: 0m44.682s (351.2 fps)
3: 0m31.183s (503.2 fps)
4: 0m25.284s (620.6 fps)
Signed-off-by: Anton Khirnov <anton@khirnov.net>
14 years ago
Stefano Sabatini
ce5e49b0c2
replace deprecated FF_*_TYPE symbols with AV_PICTURE_TYPE_*
14 years ago
Stefano Sabatini
975a1447f7
Replace deprecated FF_*_TYPE symbols with AV_PICTURE_TYPE_*.
...
Signed-off-by: Diego Biurrun <diego@biurrun.de>
14 years ago
Oskar Arvidsson
8dbe585641
Adds 8-, 9- and 10-bit versions of some of the functions used by the h264 decoder.
...
This patch lets e.g. dsputil_init chose dsp functions with respect to
the bit depth to decode. The naming scheme of bit depth dependent
functions is <base name>_<bit depth>[_<prefix>] (i.e. the old
clear_blocks_c is now named clear_blocks_8_c).
Note: Some of the functions for high bit depth is not dependent on the
bit depth, but only on the pixel size. This leaves some room for
optimizing binary size.
Preparatory patch for high bit depth h264 decoding support.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
14 years ago
Alexander Strange
66f608a6aa
vp8.c: rename EDGE_* to VP8_EDGE_*.
14 years ago
Michael Niedermayer
d375c10400
Fake-Merge remote-tracking branch 'ffmpeg-mt/master'
14 years ago
Mans Rullgard
2912e87a6c
Replace FFmpeg with Libav in licence headers
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Jason Garrett-Glaser
81a131312d
VP8: fix other function declaration
...
Was missed in 3efbe137
.
14 years ago
Jason Garrett-Glaser
1eeca88691
VP8: optimize VP8Context struct ordering
...
Shaves at least 3KB off code size on x86, should improve cache utilization.
This would probably be useful to do for other decoders/encoders as well.
14 years ago
Jason Garrett-Glaser
3efbe13739
VP8: fix function declaration
14 years ago
Jason Garrett-Glaser
628b48db85
VP8: use a goto to break out of two loops
...
A break statement was supposed to break out of two loops, but only broke out of one.
Didn't affect output, just could have been marginally slower.
14 years ago
Jason Garrett-Glaser
1d37e908cf
VP8: init one less near_mv
...
This one didn't actually need to be initialized.
(cherry picked from commit 891b1f15a7
)
14 years ago
Jason Garrett-Glaser
8e624c1cee
VP8: split out declarations to new header
...
(cherry picked from commit bcf4568f18
)
14 years ago
Jason Garrett-Glaser
625e9309da
VP8: faster MV clipping
...
(cherry picked from commit 7634771e70
)
14 years ago
Jason Garrett-Glaser
891b1f15a7
VP8: init one less near_mv
...
This one didn't actually need to be initialized.
14 years ago
Jason Garrett-Glaser
bcf4568f18
VP8: split out declarations to new header
14 years ago
Jason Garrett-Glaser
7634771e70
VP8: faster MV clipping
14 years ago
Reinhard Tartler
7ffe76e540
Merge libavcore into libavutil
...
Done to keep ABI compatible. Otherwise this is just silly
14 years ago
Reinhard Tartler
737eb5976f
Merge libavcore into libavutil
...
It is pretty hopeless that other considerable projects will adopt
libavutil alone in other projects. Projects that need small footprint
are better off with more specialized libraries such as gnulib or rather
just copy the necessary parts that they need. With this in mind, nobody
is helped by having libavutil and libavcore split. In order to ease
maintenance inside and around FFmpeg and to reduce confusion where to
put common code, avcore's functionality is merged (back) to avutil.
Signed-off-by: Reinhard Tartler <siretart@tauware.de>
14 years ago
Mans Rullgard
4ae3ee4ae9
VP8: ARM optimised decode_block_coeffs_internal
...
Approximately 5% faster on Cortex-A8.
Signed-off-by: Mans Rullgard <mans@mansr.com>
(cherry picked from commit a7878c9f73
)
14 years ago
Mans Rullgard
a7878c9f73
VP8: ARM optimised decode_block_coeffs_internal
...
Approximately 5% faster on Cortex-A8.
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Jason Garrett-Glaser
ccba4f4c25
VP8: optimized mv prediction and decoding
...
Merge find_near_mvs and mv bitstream decoding: don't do prediction steps
until absolutely necessary.
(cherry picked from commit f3d09d44b7
)
14 years ago
Jason Garrett-Glaser
f3d09d44b7
VP8: optimized mv prediction and decoding
...
Merge find_near_mvs and mv bitstream decoding: don't do prediction steps
until absolutely necessary.
14 years ago
Jason Garrett-Glaser
a1b0a3c8bd
VP8: idct_mb optimizations
...
Currently uses AV_RL32 instead of AV_RL32A, as the latter doesn't exist yet.
(cherry picked from commit 62457f9052
)
14 years ago
Jason Garrett-Glaser
62457f9052
VP8: idct_mb optimizations
...
Currently uses AV_RL32 instead of AV_RL32A, as the latter doesn't exist yet.
14 years ago
Jason Garrett-Glaser
e9266a2be0
VP8: slightly faster loopfilter sharpness logic
...
(cherry picked from commit 8a2c99b486
)
14 years ago
Jason Garrett-Glaser
9efa368f19
VP8: faster deblock strength calculation
...
Convert hev_thresh logic to a LUT, simplify mbedge_lim calculation.
(cherry picked from commit 79dec1541b
)
14 years ago
Jason Garrett-Glaser
c54ac7a8f2
VP8: faster filter_level clip
...
(cherry picked from commit a1b227bb53
)
14 years ago
Jason Garrett-Glaser
8cde1b7997
VP8: simplify lf_delta mb mode logic
...
(cherry picked from commit dd18c9a050
)
14 years ago
Jason Garrett-Glaser
8a2c99b486
VP8: slightly faster loopfilter sharpness logic
14 years ago
Jason Garrett-Glaser
79dec1541b
VP8: faster deblock strength calculation
...
Convert hev_thresh logic to a LUT, simplify mbedge_lim calculation.
14 years ago
Jason Garrett-Glaser
a1b227bb53
VP8: faster filter_level clip
14 years ago
Jason Garrett-Glaser
dd18c9a050
VP8: simplify lf_delta mb mode logic
14 years ago
Jason Garrett-Glaser
5ad4335c22
VP8: merge chroma MC calls
...
Adds some duplicated code, but avoids duplicate edge checks and similar.
~0.5% faster overall on Parkjoy test sample.
(cherry picked from commit 64233e702a
)
14 years ago
Jason Garrett-Glaser
64233e702a
VP8: merge chroma MC calls
...
Adds some duplicated code, but avoids duplicate edge checks and similar.
~0.5% faster overall on Parkjoy test sample.
14 years ago
Jason Garrett-Glaser
a4257d74e0
Slightly simplify VP8 inter_predict
...
Merge an if and a switch.
(cherry picked from commit 73be29b0c4
)
14 years ago
Jason Garrett-Glaser
73be29b0c4
Slightly simplify VP8 inter_predict
...
Merge an if and a switch.
14 years ago
Ronald S. Bultje
d23e3e5fea
Move ff_emulated_edge_mc() into DSPContext.
...
(cherry picked from commit 2e27959879
)
14 years ago
Ronald S. Bultje
6642a17935
Fix VP8 aliasing problems.
...
Replace * (uint32_t *) buf accesses with AV_WN32A/AV_COPY32.
(cherry picked from commit 9d4bdcb714
)
14 years ago
Ronald S. Bultje
2e27959879
Move ff_emulated_edge_mc() into DSPContext.
14 years ago
Ronald S. Bultje
9d4bdcb714
Fix VP8 aliasing problems.
...
Replace * (uint32_t *) buf accesses with AV_WN32A/AV_COPY32.
14 years ago
Diego Elio Pettenò
e7e2df27f8
Add ff_ prefix to data symbols of encoders, decoders, hwaccel, parsers, bsf.
...
None of these symbols should be accessed directly, so declare them as
hidden.
Signed-off-by: Mans Rullgard <mans@mansr.com>
(cherry picked from commit d36beb3f69
)
14 years ago
Diego Elio Pettenò
d36beb3f69
Add ff_ prefix to data symbols of encoders, decoders, hwaccel, parsers, bsf.
...
None of these symbols should be accessed directly, so declare them as
hidden.
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Ronald S. Bultje
ac3c04e4d6
Don't do edge emulation unless the edge pixels will be used in MC.
...
Do not emulate larger edges than we will actually use for this round of
MC. Decoding goes from avg+SE 29.972+/-0.023sec to 29.856+/-0.023, i.e.
0.12sec or ~0.4% faster.
(cherry picked from commit 44002d8323
)
14 years ago
Ronald S. Bultje
44002d8323
Don't do edge emulation unless the edge pixels will be used in MC.
...
Do not emulate larger edges than we will actually use for this round of
MC. Decoding goes from avg+SE 29.972+/-0.023sec to 29.856+/-0.023, i.e.
0.12sec or ~0.4% faster.
14 years ago
Ronald S. Bultje
7148da489e
Fix valgrind invalid read on top MB rows with CODEC_FLAG_EMU_EDGE set.
...
Originally committed as revision 26168 to svn://svn.ffmpeg.org/ffmpeg/trunk
14 years ago
Ronald S. Bultje
ee555de7dd
Support CODEC_FLAG_EMU_EDGE in VP8 decoder.
...
Originally committed as revision 26117 to svn://svn.ffmpeg.org/ffmpeg/trunk
14 years ago
Stefano Sabatini
e16f217ceb
Use new imgutils.h API names, fix deprecation warnings.
...
Originally committed as revision 25058 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Jason Garrett-Glaser
2b476e02e1
Remove some stray +s in VP8
...
Originally committed as revision 24791 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Pascal Massimino
aa93c52c21
remove b4_stride/mb_stride.
...
correct mb_xy to use mb_width.
tighten allocations.
reduce the amount of zeroing.
Originally committed as revision 24760 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago