Christophe GISQUET
7e1ce6a6ac
dsputil: remove shift parameter from scalarproduct_int16
...
There is only one caller, which does not need the shifting. Other use cases
are situations where different roundings would be needed.
The x86 and neon versions are modified accordingly.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
13 years ago
Mans Rullgard
3adba2de3d
ARM: fix indentation in ff_dsputil_init_neon()
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
13 years ago
Mans Rullgard
716f1705e9
ARM: add remaining NEON avg_pixels8/16 functions
13 years ago
Mans Rullgard
baf6b738f2
ARM: NEON optimised vector_fmac_scalar()
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
13 years ago
Mans Rullgard
cbd58a872d
dsputil: remove some unused functions
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
a617c6aaa3
dsputil: update per-arch init funcs for non-h264 high bit depth
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
e7a972e113
simple_idct: add 10-bit version
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
8aa63f0b31
ARM: NEON optimised vector_clip_int32()
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Oskar Arvidsson
19a0729b4c
Adds 8-, 9- and 10-bit versions of some of the functions used by the h264 decoder.
...
This patch lets e.g. dsputil_init chose dsp functions with respect to
the bit depth to decode. The naming scheme of bit depth dependent
functions is <base name>_<bit depth>[_<prefix>] (i.e. the old
clear_blocks_c is now named clear_blocks_8_c).
Note: Some of the functions for high bit depth is not dependent on the
bit depth, but only on the pixel size. This leaves some room for
optimizing binary size.
Preparatory patch for high bit depth h264 decoding support.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
14 years ago
Oskar Arvidsson
8dbe585641
Adds 8-, 9- and 10-bit versions of some of the functions used by the h264 decoder.
...
This patch lets e.g. dsputil_init chose dsp functions with respect to
the bit depth to decode. The naming scheme of bit depth dependent
functions is <base name>_<bit depth>[_<prefix>] (i.e. the old
clear_blocks_c is now named clear_blocks_8_c).
Note: Some of the functions for high bit depth is not dependent on the
bit depth, but only on the pixel size. This leaves some room for
optimizing binary size.
Preparatory patch for high bit depth h264 decoding support.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
14 years ago
Mans Rullgard
d743065e18
ARM: fix ff_apply_window_int16_neon() prototype
...
The length argument should be unsigned. No change in code.
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
2d3b21ffb9
ARM: NEON optimised apply_window_int16()
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Mans Rullgard
2912e87a6c
Replace FFmpeg with Libav in licence headers
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Justin Ruggles
fe2ff6d247
Separate format conversion DSP functions from DSPContext.
...
This will be beneficial for use with the audio conversion API without
requiring it to depend on all of dsputil.
Signed-off-by: Mans Rullgard <mans@mansr.com>
(cherry picked from commit c73d99e672
)
14 years ago
Justin Ruggles
c73d99e672
Separate format conversion DSP functions from DSPContext.
...
This will be beneficial for use with the audio conversion API without
requiring it to depend on all of dsputil.
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Justin Ruggles
a8ae4e0e7b
Remove unneeded add bias from 3 functions.
...
DSPContext.vector_fmul_window()
DCADSPContext.lfe_fir()
SynthFilterContext.synth_filter_float()
Signed-off-by: Mans Rullgard <mans@mansr.com>
(cherry picked from commit 80ba1ddb58
)
14 years ago
Justin Ruggles
80ba1ddb58
Remove unneeded add bias from 3 functions.
...
DSPContext.vector_fmul_window()
DCADSPContext.lfe_fir()
SynthFilterContext.synth_filter_float()
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Justin Ruggles
015f9f1ad3
Change DSPContext.vector_fmul() from dst=dst*src to dest=src0*src1.
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
(cherry picked from commit 6eabb0d3ad
)
14 years ago
Justin Ruggles
6eabb0d3ad
Change DSPContext.vector_fmul() from dst=dst*src to dest=src0*src1.
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
14 years ago
Eli Friedman
b3858964d6
Add const to some pointer parameters.
...
Patch by Eli Friedman, eli D friedman A gmail
Originally committed as revision 23826 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
73404a44c1
ARM: NEON clear_block[s]
...
Originally committed as revision 23412 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
David Conrad
eb6a6cd788
vp3: DC-only IDCT
...
2-4% faster overall decode
Originally committed as revision 22896 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
4693b031a3
Move H264 dsputil functions into their own struct
...
This moves the H264-specific functions from DSPContext to the new
H264DSPContext. The code is made conditional on CONFIG_H264DSP
which is set by the codecs requiring it.
The qpel and chroma MC functions are not moved as these are used by
non-h264 code.
Originally committed as revision 22565 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Kostya Shishkov
9b3c455c50
ARM: NEON scalarproduct_int16 and scalarproduct_and_madd_int16
...
Patch by Kostya, minor fixes by me.
Originally committed as revision 21958 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
1025d19dd7
ARM: NEON 2xN chroma MC
...
Originally committed as revision 20696 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
04e7f6d2d0
ARM: NEON 16x16 and 8x8 avg qpel MC
...
Originally committed as revision 20695 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
bef966e341
ARM: NEON avg_pixels8 and avg_h264_qpel8_mc00
...
Originally committed as revision 20190 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
9abcc9a6f4
ARM: cosmetics
...
Originally committed as revision 20166 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
701c618f7d
ARM: clean up file/function naming conventions
...
Originally committed as revision 20164 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
84d430f85a
ARM: clean up dsputil initialisation
...
- Move v5 and v6 initialisation to separate files.
- Move NEON IDCT selection to ff_dsputil_init_neon()
Originally committed as revision 20163 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
ec71a8e00b
ARM: NEON optimised vector_fmul_add
...
Originally committed as revision 20063 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
f331cec47d
ARM: NEON optimised vector_clipf
...
Originally committed as revision 20031 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
b9b1ad9cd8
ARM: NEON optimised vector_fmul_reverse
...
Originally committed as revision 20029 to svn://svn.ffmpeg.org/ffmpeg/trunk
15 years ago
Måns Rullgård
9bda7f3063
ARM: NEON optimised int32_to_float_fmul_scalar
...
Originally committed as revision 20000 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
275cfd1501
ARM: NEON optimised scalarproduct_float
...
Originally committed as revision 19971 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
1dee3e97c6
ARM: NEON optimisations for some dsputil functions
...
NEON versions of the following functions are added:
vector_fmul_scalar
vector_fmul_sv_scalar
sv_fmul_scalar
butterflies_float
Originally committed as revision 19957 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
e814015d69
ARM: NEON optimised vorbis_inverse_coupling
...
12% faster Vorbis decoding on Cortex-A8.
Originally committed as revision 19637 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
7d90972c84
cosmetics: reindent/prettyprint after last commit
...
Originally committed as revision 19495 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
31cf61b447
Only compile in NEON optimizations for H.264 when the H.264 decoder is enabled.
...
Originally committed as revision 19494 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Diego Biurrun
99e5a9d1ea
Do not redundantly check for both CONFIG_THEORA_DECODER and CONFIG_VP3_DECODER.
...
The Theora decoder depends on the VP3 decoder.
Originally committed as revision 19492 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
6db8697a39
ARM: NEON VP3 Loop Filter
...
Originally committed as revision 18916 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
428bf2ac88
ARM: NEON put_pixels_clamped
...
Originally committed as revision 18712 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
David Conrad
cc2e555420
ARM: NEON optimized put_signed_pixels_clamped
...
Originally committed as revision 18333 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
08e12b2240
ARM: NEON optimised add_pixels_clamped
...
Based on patch by David Conrad.
Originally committed as revision 18332 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
e1f7cb7fa0
ARM: NEON optimised vector_fmul_window
...
Originally committed as revision 16868 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
dd927e2e62
ARM: NEON optimised vector_fmul
...
Originally committed as revision 16867 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
bd53b426b7
ARM: NEON optimised H.264 weighted prediction
...
Originally committed as revision 16771 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
5a29589b81
ARM: NEON optimised H.264 biweighted prediction
...
Originally committed as revision 16770 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
f23740d96f
ARM: NEON optimised float_to_int16
...
Originally committed as revision 16352 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago
Måns Rullgård
760badc1df
ARM: add new h264 idct functions
...
Originally committed as revision 16312 to svn://svn.ffmpeg.org/ffmpeg/trunk
16 years ago