FFmpeg

Commit Graph

Author	SHA1	Message	Date
Ronald S. Bultje	f8c019944d	vp9: re-split the decoder/format/dsp interface header files. The advantage here is that the internal software decoder interface is not exposed to the DSP functions or the hardware accelerations.	8 years ago
Clément Bœsch	1c9f4b5078	lavc/vp9: split into vp9{block,data,mvs} This is following Libav layout to ease merges.	8 years ago
Martin Storsjö	638eceed47	aarch64: Add NEON optimizations for 10 and 12 bit vp9 MC This work is sponsored by, and copyright, Google. This has mostly got the same differences to the 8 bit version as in the arm version. For the horizontal filters, we do 16 pixels in parallel as well. For the 8 pixel wide vertical filters, we can accumulate 4 rows before storing, just as in the 8 bit version. Examples of runtimes vs the 32 bit version, on a Cortex A53: ARM AArch64 vp9_avg4_10bpp_neon: 35.7 30.7 vp9_avg8_10bpp_neon: 93.5 84.7 vp9_avg16_10bpp_neon: 324.4 296.6 vp9_avg32_10bpp_neon: 1236.5 1148.2 vp9_avg64_10bpp_neon: 4639.6 4571.1 vp9_avg_8tap_smooth_4h_10bpp_neon: 130.0 128.0 vp9_avg_8tap_smooth_4hv_10bpp_neon: 440.0 440.5 vp9_avg_8tap_smooth_4v_10bpp_neon: 114.0 105.5 vp9_avg_8tap_smooth_8h_10bpp_neon: 327.0 314.0 vp9_avg_8tap_smooth_8hv_10bpp_neon: 918.7 865.4 vp9_avg_8tap_smooth_8v_10bpp_neon: 330.0 300.2 vp9_avg_8tap_smooth_16h_10bpp_neon: 1187.5 1155.5 vp9_avg_8tap_smooth_16hv_10bpp_neon: 2663.1 2591.0 vp9_avg_8tap_smooth_16v_10bpp_neon: 1107.4 1078.3 vp9_avg_8tap_smooth_64h_10bpp_neon: 17754.6 17454.7 vp9_avg_8tap_smooth_64hv_10bpp_neon: 33285.2 33001.5 vp9_avg_8tap_smooth_64v_10bpp_neon: 16066.9 16048.6 vp9_put4_10bpp_neon: 25.5 21.7 vp9_put8_10bpp_neon: 56.0 52.0 vp9_put16_10bpp_neon/armv8: 183.0 163.1 vp9_put32_10bpp_neon/armv8: 678.6 563.1 vp9_put64_10bpp_neon/armv8: 2679.9 2195.8 vp9_put_8tap_smooth_4h_10bpp_neon: 120.0 118.0 vp9_put_8tap_smooth_4hv_10bpp_neon: 435.2 435.0 vp9_put_8tap_smooth_4v_10bpp_neon: 107.0 98.2 vp9_put_8tap_smooth_8h_10bpp_neon: 303.0 290.0 vp9_put_8tap_smooth_8hv_10bpp_neon: 893.7 828.7 vp9_put_8tap_smooth_8v_10bpp_neon: 305.5 263.5 vp9_put_8tap_smooth_16h_10bpp_neon: 1089.1 1059.2 vp9_put_8tap_smooth_16hv_10bpp_neon: 2578.8 2452.4 vp9_put_8tap_smooth_16v_10bpp_neon: 1009.5 933.5 vp9_put_8tap_smooth_64h_10bpp_neon: 16223.4 15918.6 vp9_put_8tap_smooth_64hv_10bpp_neon: 32153.0 31016.2 vp9_put_8tap_smooth_64v_10bpp_neon: 14516.5 13748.1 These are generally about as fast as the corresponding ARM routines on the same CPU (at least on the A53), in most cases marginally faster. The speedup vs C code is around 4-9x. Signed-off-by: Martin Storsjö <martin@martin.st>	8 years ago
Martin Storsjö	a4d4bad75c	arm: Add NEON optimizations for 10 and 12 bit vp9 MC This work is sponsored by, and copyright, Google. The plain pixel put/copy functions are used from the 8 bit version, for the double size (e.g. put16 uses ff_vp9_copy32_neon), and a new copy128 is added. Compared with the 8 bit version, the filters can no longer use the trick to accumulate in 16 bit with only saturation at the end, but now the accumulators need to be 32 bit. This avoids the need to keep track of which filter index is the largest though, reducing the size of the executable code for these filters. For the horizontal filters, we only do 4 or 8 pixels wide in parallel (while doing two rows at a time), since we don't have enough register space to filter 16 pixels wide. For the vertical filters, we still do 4 and 8 pixels in parallel just as in the 8 bit case, but we need to store the output after every 2 rows instead of after every 4 rows. Examples of relative speedup compared to the C version, from checkasm: Cortex A7 A8 A9 A53 vp9_avg4_10bpp_neon: 2.25 2.44 3.05 2.16 vp9_avg8_10bpp_neon: 3.66 8.48 3.86 3.50 vp9_avg16_10bpp_neon: 3.39 8.26 3.37 2.72 vp9_avg32_10bpp_neon: 4.03 10.20 4.07 3.42 vp9_avg64_10bpp_neon: 4.15 10.01 4.13 3.70 vp9_avg_8tap_smooth_4h_10bpp_neon: 3.38 6.22 3.41 4.75 vp9_avg_8tap_smooth_4hv_10bpp_neon: 3.89 6.39 4.30 5.32 vp9_avg_8tap_smooth_4v_10bpp_neon: 5.32 9.73 6.34 7.31 vp9_avg_8tap_smooth_8h_10bpp_neon: 4.45 9.40 4.68 6.87 vp9_avg_8tap_smooth_8hv_10bpp_neon: 4.64 8.91 5.44 6.47 vp9_avg_8tap_smooth_8v_10bpp_neon: 6.44 13.42 8.68 8.79 vp9_avg_8tap_smooth_64h_10bpp_neon: 4.66 9.02 4.84 7.71 vp9_avg_8tap_smooth_64hv_10bpp_neon: 4.61 9.14 4.92 7.10 vp9_avg_8tap_smooth_64v_10bpp_neon: 6.90 14.13 9.57 10.41 vp9_put4_10bpp_neon: 1.33 1.46 2.09 1.33 vp9_put8_10bpp_neon: 1.57 3.42 1.83 1.84 vp9_put16_10bpp_neon: 1.55 4.78 2.17 1.89 vp9_put32_10bpp_neon: 2.06 5.35 2.14 2.30 vp9_put64_10bpp_neon: 3.00 2.41 1.95 1.66 vp9_put_8tap_smooth_4h_10bpp_neon: 3.19 5.81 3.31 4.63 vp9_put_8tap_smooth_4hv_10bpp_neon: 3.86 6.22 4.32 5.21 vp9_put_8tap_smooth_4v_10bpp_neon: 5.40 9.77 6.08 7.21 vp9_put_8tap_smooth_8h_10bpp_neon: 4.22 8.41 4.46 6.63 vp9_put_8tap_smooth_8hv_10bpp_neon: 4.56 8.51 5.39 6.25 vp9_put_8tap_smooth_8v_10bpp_neon: 6.60 12.43 8.17 8.89 vp9_put_8tap_smooth_64h_10bpp_neon: 4.41 8.59 4.54 7.49 vp9_put_8tap_smooth_64hv_10bpp_neon: 4.43 8.58 5.34 6.63 vp9_put_8tap_smooth_64v_10bpp_neon: 7.26 13.92 9.27 10.92 For the larger 8tap filters, the speedup vs C code is around 4-14x. Signed-off-by: Martin Storsjö <martin@martin.st>	8 years ago
Carl Eugen Hoyos	a07ac1f788	Fix type of shared flac table ff_flac_blocksize_table[]. Fixes ticket #2533.	12 years ago
Mans Rullgard	2912e87a6c	Replace FFmpeg with Libav in licence headers Signed-off-by: Mans Rullgard <mans@mansr.com>	14 years ago
Justin Ruggles	d4df4e5088	share sample rate and blocksize tables between the FLAC encoder and FLAC decoder Originally committed as revision 18089 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Justin Ruggles	2578326f13	Share the function to write a raw FLAC header and use it in the Matroska muxer. Originally committed as revision 17606 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Aurelien Jacobs	7379d5bc0b	use new metadata API in rm (de)muxer Originally committed as revision 17396 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Stefano Sabatini	987903826b	Globally rename the header inclusion guard names. Consistently apply this rule: the guard name is obtained from the filename by stripping the leading "lib", converting '/' and '.' to '_' and uppercasing the resulting name. Guard names in the root directory have to be prefixed by "FFMPEG_". Originally committed as revision 15120 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Vladimir Voroshilov	6bf8b3ef03	Remove unnecessary header inclusion from g729.h Originally committed as revision 14916 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Vladimir Voroshilov	fe3a80d6fa	Move from g729.h all definitions which are used only in g729dec.c Originally committed as revision 14915 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Vladimir Voroshilov	5209846850	G.729 decoder main code (just skeleton, contains only parts, explicitly ok'ed by Michael) Originally committed as revision 14800 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Luca Abeni	e76e2bbc09	Mark the source buffer as "const" Originally committed as revision 10877 to svn://svn.ffmpeg.org/ffmpeg/trunk	17 years ago
Diego Biurrun	5b21bdabe4	Add FFMPEG_ prefix to all multiple inclusion guards. Originally committed as revision 10765 to svn://svn.ffmpeg.org/ffmpeg/trunk	17 years ago
Guillaume Poirier	efb775777f	add a comment to indicate which #endif belong to which #define Originally committed as revision 9356 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Måns Rullgård	699b3f99d0	add multiple inclusion guards to headers Originally committed as revision 9345 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Måns Rullgård	99545457bf	include all prerequisites in header files Originally committed as revision 9344 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Luca Barbato	bd03c380ce	expose av_base64_decode and av_base64_encode Originally committed as revision 8448 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Luca Barbato	558b86a5d0	Reverting stray commit part II, r8156 had the base64 export patch mixed with the nutdec patch Originally committed as revision 8158 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Diego Biurrun	b78e7197a8	Change license headers to say 'FFmpeg' instead of 'this program/this library' and fix GPL/LGPL version mismatches. Originally committed as revision 6577 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Diego Biurrun	5509bffa88	Update licensing information: The FSF changed postal address. Originally committed as revision 4842 to svn://svn.ffmpeg.org/ffmpeg/trunk	19 years ago
Diego Biurrun	115329f160	COSMETICS: Remove all trailing whitespace. Originally committed as revision 4749 to svn://svn.ffmpeg.org/ffmpeg/trunk	19 years ago
Roman Shaposhnik	48b1f80012	* adding integer/floating point AAN implementations for DCT 2-4-8 Originally committed as revision 2430 to svn://svn.ffmpeg.org/ffmpeg/trunk	21 years ago
Michael Niedermayer	b4c3816cfa	optionally merge postscale into quantization table for the float aan dct Originally committed as revision 2420 to svn://svn.ffmpeg.org/ffmpeg/trunk	21 years ago
Michael Niedermayer	65e4c8c919	floating point AAN DCT Originally committed as revision 2415 to svn://svn.ffmpeg.org/ffmpeg/trunk	21 years ago
Michael Niedermayer	b0368839ac	MpegEncContext.(i)dct_* -> DspContext.(i)dct_* bitexact cleanup Originally committed as revision 1617 to svn://svn.ffmpeg.org/ffmpeg/trunk	22 years ago
Zdenek Kabelac	0c1a9edad4	* UINTX -> uintx_t INTX -> intx_t Originally committed as revision 1578 to svn://svn.ffmpeg.org/ffmpeg/trunk	22 years ago
Zdenek Kabelac	bb28568364	* cut&paste fix Originally committed as revision 1249 to svn://svn.ffmpeg.org/ffmpeg/trunk	22 years ago
Zdenek Kabelac	5940262772	* oops fixed bad initialization of ff vals. - put FF_LIBMPEG2_IDCT_PERM into CVS - so it will work for now Originally committed as revision 1227 to svn://svn.ffmpeg.org/ffmpeg/trunk	22 years ago
Zdenek Kabelac	83f238cbf0	* compilation fix (ARM users please check) Originally committed as revision 1225 to svn://svn.ffmpeg.org/ffmpeg/trunk	22 years ago
Michael Niedermayer	50eb9cbc44	idct_permutation_type variable, so the permutation type can quickly be identified Originally committed as revision 1071 to svn://svn.ffmpeg.org/ffmpeg/trunk	22 years ago
Michael Niedermayer	676e200cff	trying to fix the non-x86 IDCTs (untested) Originally committed as revision 1006 to svn://svn.ffmpeg.org/ffmpeg/trunk	22 years ago
Fabrice Bellard	ff4ec49e64	license/copyright change Originally committed as revision 599 to svn://svn.ffmpeg.org/ffmpeg/trunk	23 years ago
Fabrice Bellard	92651f67a0	arm specific code Originally committed as revision 79 to svn://svn.ffmpeg.org/ffmpeg/trunk	24 years ago

3 Commits (782ea8b2e5a96bb18a45210f94fb427009e996b1)