FFmpeg

Commit Graph

Author	SHA1	Message	Date
Kaustubh Raste	6796a1dd8c	avcodec/mips: Improve avc put mc 20, 01 and 03 msa functions Remove loops and unroll as block sizes are known. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	b8854e2439	avcodec/mips: Improve avc chroma vert mc msa functions Replace generic with block size specific function. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	10ab5534e0	avcodec/mips: Improve avc weighted mc msa functions Replace generic with block size specific function. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	ed1586b921	avcodec/mips: Removed generic function call in avc intra msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	deeaaba1ab	avcodec/mips: preload data in hevc sao edge 45 degree filter msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	16adbfe60c	avcodec/mips: Improve avc chroma horiz mc msa functions Replace generic with block size specific function. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	d6737539e7	avcodec/mips: Unrolled loops avc intra msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	7f8417f226	avcodec/mips: Improve hevc uni-w copy mc msa functions Load the specific destination bytes instead of MSA load and pack. Pack the data to half word before clipping. Use immediate unsigned saturation for clip to max saving one vector register. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	f160a63bad	avcodec/mips: Remove generic func use in hevc non-uni copy mc msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	2b15626997	avcodec/mips: preload data in hevc sao edge 90 degree filter msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	bba9c1c6bb	avcodec/mips: Reduced conditional cases in avc inter lpf msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	b5da07d434	avcodec/mips: Unrolled loops and expanded functions in avc put mc 10 & 30 msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	e428e5ded6	avcodec/mips: preload data in hevc sao edge 0 degree filter msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	f4ba85dc82	avcodec/mips: Fixed rnd_val variable to 6 in hevc uni mc msa functions Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	1a85fb7e1e	avcodec/mips: Improve hevc sao band filter msa functions Preload data in band filter 0-8 for better pipeline parallelization. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	0105ed551c	avcodec/mips: Improve avc mc copy msa functions Remove loops and unroll as block sizes are known. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	e5a650e141	avcodec/mips: Improve avc lpf msa functions Optimize luma intra case by reducing conditional cases. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	c6314cd750	avcodec/mips: Improve hevc idct msa functions Align the buffers. Remove reduandant constant array. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	f692e55aab	avcodec/mips: Improve hevc lpf msa functions Seperate the filter processing in all strong, all weak and strong + weak cases. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	9b2c3c406f	avcodec/mips: Improve vp9 mc msa functions Load the specific destination bytes instead of MSA load and pack. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	c75b23cbea	avcodec/mips: Improve vp9 idct msa functions Removed memset calls. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	2e79813a8e	avcodec/mips: Improve vp9 lpf msa functions Updated VP9_LPF_FILTER4_4W macro to process on 8 bit data. Replaced VP9_LPF_FILTER4_8W with VP9_LPF_FILTER4_4W. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	fa805df060	libavcodec/mips: Improve avc idct8 msa function Replace memset call with msa stores. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	7 years ago
Kaustubh Raste	36ea41de37	libavcodec/mips: Improve avc dequant-idct luma dc msa function Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Kaustubh Raste	a776cb2074	libavcodec/mips: Optimize avc idct 4x4 for msa Removed memset call and improved performance. Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Kaustubh Raste	df806605f7	avcodec: Add prefetch for mips Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com> Reviewed-by: Manojkumar Bhosale <Manojkumar.Bhosale@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Carl Eugen Hoyos	f4c133c708	lavc/mips/iirfilter_mips: Include config.h. Fixes the following warning: libavcodec/mips/iirfilter_mips.c:57:5: warning: "HAVE_INLINE_ASM" is not defined	8 years ago
Carl Eugen Hoyos	a88b0b0ba7	lavc/mips/hevc_idct_msa: Add missing const qualifier. Fixes many warnings: initialization discards 'const' qualifier from pointer target type	8 years ago
Shivraj Patil	2a512f86c1	build fix for mips Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	8 years ago
Michael Niedermayer	c217027c11	avcodec/mips: fix build Found-by: Shivraj Patil <shivraj.patil@imgtec.com> Suggested-by: "Ronald S. Bultje" <rsbultje@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Ronald S. Bultje	f8c019944d	vp9: re-split the decoder/format/dsp interface header files. The advantage here is that the internal software decoder interface is not exposed to the DSP functions or the hardware accelerations.	8 years ago
Clément Bœsch	1c9f4b5078	lavc/vp9: split into vp9{block,data,mvs} This is following Libav layout to ease merges.	8 years ago
Clément Bœsch	9dc57688c8	lavc/mips: temporally disable ac3 downmix	8 years ago
Jacek Manko	c104556448	avcodec/mips/Makefile: corrected conditional build of version 1 of vc1dsp optimizations for loongson mmi Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Zhou Xiaoyong	5b74ebe937	avcodec/mips: version 1 of vc1dsp optimizations for loongson mmi Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Zhou Xiaoyong	d84e635d06	avcodec/mips: version 1 of wmv2dsp optimizations for loongson mmi Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Zhou Xiaoyong	c5c6e30781	avcodec/mips: version 1 of vp8dsp optimizations for loongson mmi Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Zhou Xiaoyong	89ec4adad6	avcodec/mips: loongson optimize mmi load and store operators 1.MMI_ load/store macros are defined in libavutil/mips/mmiutils.h 2.Replace some unnecessary unaligned access with aligned operator 3.The MMI_ load/store is compatible with cpu loongson2e/2f which not support instructions start with gs Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Vicente Olivert Riera	04b0792e4a	libavcodec/mips/h264dsp_msa.c: fix type in some function parameters This fixes a build problem for MIPS architecture that looks like this: libavcodec/mips/h264dsp_msa.c:2498:6: error: conflicting types for ‘ff_weight_h264_pixels16_8_msa’ void ff_weight_h264_pixels16_8_msa(uint8_t *src, int stride, This bug was introduced by commit bc26fe89275c267d169b468356c82ee59874407d: avcodec/h264: Use ptrdiff_t for (bi)weight functions That commit changed the data type of some function parameters in some function definitions. However, the implementation of those functions in libavcodec/mips/h264dsp_msa.c wasn't changed accordingly. Signed-off-by: Vicente Olivert Riera <Vincent.Riera@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
Michael Niedermayer	bc26fe8927	avcodec/h264: Use ptrdiff_t for (bi)weight functions Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 years ago
ZhouXiaoyong	2c7fd0e36b	avcodec/mips/h264qpel_mmi.c: Version 2 of the optimizations for loongson mmi 1. no longer use the register names directly and optimized code format 2. to be compatible with O32, specify type of address variable with mips_reg and handle the address variable with PTR_ operator 3. use uld and mtc1 to workaround cpu 3A2000 gslwlc1 bug (gslwlc1 instruction extension bug in O32 ABI) 4. h264qpel use hepldsp optimizations Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
ZhouXiaoyong	8392794c92	avcodec/mips/idctdsp_mmi: Version 2 of the optimizations for loongson mmi 1. no longer use the register names directly and optimized code format 2. to be compatible with O32, specify type of address variable with mips_reg and handle the address variable with PTR_ operator Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
ZhouXiaoyong	377e5db3db	avcodec/mips/pixblockdsp_mmi: Version 2 of the optimizations for loongson mmi 1. no longer use the register names directly and optimized code format 2. to be compatible with O32, specify type of address variable with mips_reg and handle the address variable with PTR_ operator Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
ZhouXiaoyong	05a546181f	avcodec/mips/blockdsp_mmi: Version 2 of the optimizations for loongson mmi 1. no longer use the register names directly and optimized code format 2. to be compatible with O32, specify type of address variable with mips_reg and handle the address variable with PTR_ operator Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
Zhou Xiaoyong	c749be9eb3	avcodec/mips: loongson optimize h264pred with mmi v3 1. no longer use the register names directly and optimized code format 2. to be compatible with O32, specify type of address variable with mips_reg and handle the address variable with PTR_ operator 3. ff_pred16x16_plane_ functions only support N64 ABI now Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
Zhou Xiaoyong	4a963ee698	avcodec/mips: loongson optimize hpeldsp with mmi v1 1.the codes are compatible with O32 ABI 2.use uld and mtc1 to workaround cpu 3A2000 gslwlc1 bug (gslwlc1 instruction extension bug in O32 ABI) Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
Zhou Xiaoyong	a20646bb24	avcodec/mips/mpegvideo_mmi: Version 2 of the optimizations for loongson mmi 1. no longer use the register names directly and optimized code format 2. to be compatible with O32, specify type of address variable with mips_reg and handle the address variable with PTR_ operator Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
ZhouXiaoyong	a3eb5a3cdd	avcodec/mips/h264chroma_mmi: Version 2 of the optimizations for loongson mmi 1. no longer use the register names directly and optimized code format 2. to be compatible with O32, specify type of address variable with mips_reg and handle the address variable with PTR_ operator 3. use uld and mtc1 to workaround cpu 3A2000 gslwlc1 bug (gslwlc1 instruction extension bug in O32 ABI) Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
ZhouXiaoyong	af3e944e7e	avcodec/mips/h264dsp_mmi: Version 2 of the optimizations for loongson mmi 1. no longer use the register names directly and optimized code format 2. to be compatible with O32, specify type of address variable with mips_reg and handle the address variable with PTR_ operator 3. optimize some unaligned loads and stores 4. use uld and mtc1 to workaround cpu 3A2000 gslwlc1 bug (gslwlc1 instruction extension bug in O32 ABI) Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
Jovan Zelincevic	b73c27151e	avcodec/mips: Optimization synced to the newest code base. FFT expanded to 2^17. Signed-off-by: Jovan Zelincevic <jovan.zelincevic@imgtec.com> Reviewed-by: Nedeljko Babic <Nedeljko.Babic@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago

1 2 3 4 5

203 Commits (e91f0c4f8b3e81bc63838cc67370a7b13c8d9e78)