FFmpeg

Commit Graph

Author	SHA1	Message	Date
Andreas Rheinhardt	a0ff31e740	avcodec/vvc/inter: Don't return void Returning a void is not allowed by the spec. Just return instead. Reviewed-by: Nuo Mi <nuomi2021@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	7 months ago
James Almer	94f2274a8b	x86/aacencdsp: fix ff_aac_quantize_bands_avx on unix64 ABI Signed-off-by: James Almer <jamrial@gmail.com>	7 months ago
James Almer	91b9af0058	x86/aacencdsp: add AVX version of quantize_bands quant_bands_signed_c: 1928.0 quant_bands_signed_sse2: 406.0 quant_bands_signed_avx: 207.0 quant_bands_unsigned_c: 1702.0 quant_bands_unsigned_sse2: 404.0 quant_bands_unsigned_avx: 209.0 Signed-off-by: James Almer <jamrial@gmail.com>	7 months ago
Andreas Rheinhardt	3af6136669	avcodec/dnxhdenc: Simplify padding It is unnecessary to first pad to 32bits; the memset later will pad everything will with zeroes anyway. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	7 months ago
Andreas Rheinhardt	b0e0b3c58a	avcodec/dnxhdenc: Move PutBitContext from ctx to stack Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	7 months ago
Andreas Rheinhardt	542abee213	avcodec/cbs_h266_syntax_template: Use correct format specifier H266RawSliceHeader.num_entry_points is an uint32_t. Fixes -Wformat warnings: https://fate.ffmpeg.org/log.cgi?slot=aarch64-osx-clang-1200.0.32.29&time=20240604151047&log=compile Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	7 months ago
Andreas Rheinhardt	dd8fb0aaae	avcodec/hevc/Makefile: Move rules for lavc/* files to lavc/Makefile If any of these files (say A) would be changed in such a way that A acquires a new dependency on another file B, building B would need to be added to all the rules that lead to A being built. Yet currently the rules for several files are spread over the lavc Makefile and the Makefile of the lavc/hevc subdir, making it more likely to be forgotten. So move the rules for these files to the lavc/Makefile. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	7 months ago
Rémi Denis-Courmont	daac101e61	lavc/aacencdsp: fix rounding in R-V V quantize_bands We need to round toward zero here.	7 months ago
Rémi Denis-Courmont	658439934b	lavc/vp8dsp: R-V V vp8_idct_add T-Head C908 (cycles): vp8_idct_add_c: 312.2 vp8_idct_add_rvv_i32: 117.0	7 months ago
Nuo Mi	f68f40736f	avcodec/vvcdec: support mv wraparound A 360 video specific tool see https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9503377 passed files: DMVR_A_Huawei_3.bit WRAP_D_InterDigital_4.bit WRAP_A_InterDigital_4.bit WRAP_B_InterDigital_4.bit WRAP_C_InterDigital_4.bit ERP_A_MediaTek_3.bit	7 months ago
Nuo Mi	685174069f	avcodec/vvcdec: misc, reindent inter.c	7 months ago
Nuo Mi	a4013e748a	avcodec/vvcdec: refact out emulated_edge_no_wrap prepare for refrence wraparound	7 months ago
Nuo Mi	8abdf0a28e	avcodec/vvcdec: misc, move src offset inside emulated_edge	7 months ago
Nuo Mi	2d98786fee	avcodec/vvcdec: refact, remove emulated_edge_dmvr and emulated_edge_bilinear to simplify code	7 months ago
Lynne	714596bcbf	aacdec_usac: zero out alpha values for the current frame	8 months ago
Lynne	c2d459cb51	aacdec_usac: fix stereo alpha values for transients Typo. Also added comments and fixed the branch underneath.	8 months ago
Lynne	7223523335	aacdec_usac: use correct TNS values The standard slightly modified the maximum TNS bands allowed.	8 months ago
Lynne	9b41cc0430	aacdec_usac: do not round noise amplitude values Use floating point division instead of integer division.	8 months ago
Lynne	a18d0659f4	aacdec_usac: skip coeff decoding if the number to be decoded is 0 Yet another thing not mentioned in the spec.	8 months ago
Lynne	1ad9a4008b	aacdec_usac: decouple TNS active from TNS data present flag The issue was that in case of common TNS parameters, TNS was entirely skipped, as tns.present was set to 0.	8 months ago
Lynne	c0fdb0cdfd	aacdec_usac: do not continue parsing bitstream on core_mode == 1 Although LPD is not functional yet, the bitstream ends at that point.	8 months ago
Lynne	8ecaa64b9b	aacdec_usac: respect tns_on_lr flag This was left out, and due to av_unused, forgotten about.	8 months ago
Lynne	25b848a0bd	aacdec_usac: correctly set and use the layout map	8 months ago
Lynne	ae495b56ff	aacdec_usac: remove fallback for custom maps with invalid position Not needed as every possible index is mapped.	8 months ago
Lynne	91ab17e2fe	aacdec_usac: tag LFE channels as such in the channel map Missed.	8 months ago
Lynne	62cd6d9e59	aacdec_usac: clean up nb_elems on error Require that there is a valid layout with a valid number of channels before accepting nb_elems. The value is required when flushing. Thanks to kasper93 for figuring it out.	8 months ago
Lynne	5c328e6c1e	aacdec: increase MAX_ELEM_ID to 64 In USAC, we set the max to 64.	8 months ago
Lynne	91fd6ca000	lavc: bump minor and add APIchanges entry for new USAC profile	8 months ago
Lynne	1c066867df	aac: define a new profile for USAC This allows users to determine whether a stream is USAC or not.	8 months ago
Lynne	ee419804da	mpeg4audio: explicitly define each AOT This makes it far easier to figure out which AOT belongs to which profile. Also, explicitly highlight the holes.	8 months ago
Lynne	8a2fe8a5b9	mpeg4audio: rename AOT_USAC_NOSBR to AOT_USAC The issue is that AOT 45 isn't defined anywhere, and looking at the git blame, it seems to have sprung up through a reordering of the enum, and adding a hole. The spec does not define an explicit AOT for SBR and no SBR, and only uses AOT 42 (previously AOT_USAC_NOSBR), so just rename AOT_USAC to it and replace its use everywhere.	8 months ago
Michael Niedermayer	dce69ba89e	avcodec/libx264: Check init_get_bits8() return code Fixes: CID1594529 Unchecked return value Sponsored-by: Sovereign Tech Fund Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 months ago
Michael Niedermayer	8a64a003b5	avcodec/ilbcdec: Remove dead code Yes the same dead code is in "iLBC Speech Coder ANSI-C Source Code" Fixes: CID1509370 Logically dead code Sponsored-by: Sovereign Tech Fund Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 months ago
Michael Niedermayer	9b76e49061	avcodec/vp8: Check cond init Fixes: CID1598563 Unchecked return value Sponsored-by: Sovereign Tech Fund Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 months ago
Michael Niedermayer	4ac7405aaf	avcodec/vp8: Check mutex init Fixes: CID1598556 Unchecked return value Sponsored-by: Sovereign Tech Fund Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	8 months ago
Rémi Denis-Courmont	3152c684cb	lavc/vc1dsp: R-V V vc1_inv_trans_4x4 T-Head C908 (cycles): vc1dsp.vc1_inv_trans_4x4_c: 310.7 vc1dsp.vc1_inv_trans_4x4_rvv_i32: 120.0 We could use 1 `vlseg4e64.v` instead of 4 `vle16.v`, but that seems to be about 7% slower.	8 months ago
Rémi Denis-Courmont	6ffa639c8a	lavc/vc1dsp: R-V V vc1_inv_trans_4x8 T-Head C908 (cycles): vc1dsp.vc1_inv_trans_4x8_c: 653.2 vc1dsp.vc1_inv_trans_4x8_rvv_i32: 234.0	8 months ago
Rémi Denis-Courmont	a169f3bca5	lavc/vc1dsp: R-V V vc1_inv_trans_8x4 T-Head C908 (cycles): vc1dsp.vc1_inv_trans_8x4_c: 626.2 vc1dsp.vc1_inv_trans_8x4_rvv_i32: 215.2	8 months ago
Rémi Denis-Courmont	04397a29de	lavc/vc1dsp: R-V V vc1_inv_trans_8x8 T-Head C908 (cycles): vc1dsp.vc1_inv_trans_8x8_c: 871.7 vc1dsp.vc1_inv_trans_8x8_rvv_i32: 286.7	8 months ago
Rémi Denis-Courmont	c3dbbb316e	lavc/flacdsp: fix sign extension in R-V V wasted33 We need to use either VWCVT.X.X.V or VSEXT.VF2. The later is preferable to avoid changing VTYPE.	8 months ago
Zhao Zhili	7d46ab9e12	avcodec/mediacodecenc: workaround the alignment requirement for H.265 Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	8 months ago
Zhao Zhili	2a68b2d643	avcodec/mediacodecenc: workaround the alignment requirement only for H.264 There is no bsf for other codecs to modify crop info except H.265. For H.265, the assumption that FFALIGN(width, 16)xFFALIGN(height, 16) is the video resolution can be wrong, since the encoder can use CTU larger than 16x16. In that case, use FFALIGN(width, 16) - width as crop_right is incorrect. So disable the workaround for H.265 now. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	8 months ago
Zhao Zhili	680b3cee1f	avcodec/h265_metadata: Add options to set width/height after crop It's a common usecase to request a video size after crop. Before this patch, user must know the video size before crop, then set crop_right/crop_bottom accordingly. Since HEVC can have different CTU size, it's not easy to get/deduce the video size before crop. With the new width/height options, there is no such requirement. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	8 months ago
Ramiro Polla	2d24a80e5e	avcodec/mpegvideo_enc: give magic number a name	8 months ago
Ramiro Polla	01b1f4c9a5	libavcodec/libxvid: code cleanup (replace magic numbers)	8 months ago
Rémi Denis-Courmont	0415bb74c8	lavc/vp8dsp: remove no longer used macros	8 months ago
Rémi Denis-Courmont	121fb846b9	lavc/vp7dsp: add R-V V vp7_idct_dc_add4uv This is almost the same story as vp7_idct_add4y. We just have to use strided loads of 2 64-bit elements to account for the different data layout in memory. T-Head C908: vp7_idct_dc_add4uv_c: 7.5 vp7_idct_dc_add4uv_rvv_i64: 2.0 vp8_idct_dc_add4uv_c: 6.2 vp8_idct_dc_add4uv_rvv_i32: 2.2 (before) vp8_idct_dc_add4uv_rvv_i64: 2.0 SpacemiT X60: vp7_idct_dc_add4uv_c: 6.7 vp7_idct_dc_add4uv_rvv_i64: 2.2 vp8_idct_dc_add4uv_c: 5.7 vp8_idct_dc_add4uv_rvv_i32: 2.5 (before) vp8_idct_dc_add4uv_rvv_i64: 2.0	8 months ago
Rémi Denis-Courmont	225de53c9d	lavc/vp8dsp: rework R-V V idct_dc_add4y DCT-related FFmpeg functions often add an unsigned 8-bit sample to a signed 16-bit coefficient, then clip the result back to an unsigned 8-bit value. RISC-V has no signed 16-bit to unsigned 8-bit clip, so instead our most common sequence is: VWADDU.WV set SEW to 16 bits VMAX.VV zero # clip negative values to 0 set SEW to 8 bits VNCLIPU.WI # clip values over 255 to 255 and narrow Here we use a different sequence which does not require toggling the vector type. This assumes that the wide addend vector is biased by -128: VWADDU.WV VNCLIP.WI # clip values to signed 8-bit and narrow VXOR.VX 0x80 # flip sign bit (convert signed to unsigned) Also the VMAX is effectively replaced by a VXOR of half-width. In this function, this comes for free as we anyway add a constant to the wide vector in the prologue. On C908, this has no observable effects. On X60, this improves microbenchmarks by about 20%.	8 months ago
Rémi Denis-Courmont	4e120fbbbd	lavc/vp8dsp: add R-V V vp7_idct_dc_add4y As with idct_dc_add, most of the code is shared with, and replaces, the previous VP8 function. To improve performance, we break down the 16x4 matrix into 4 rows, rather than 4 squares. Thus strided loads and stores are avoided, and the 4 DC calculations are vectored. Unfortunately this requires a vector gather to splat the DC values, but overall this is still a win for performance: T-Head C908: vp7_idct_dc_add4y_c: 7.2 vp7_idct_dc_add4y_rvv_i32: 2.2 vp8_idct_dc_add4y_c: 6.2 vp8_idct_dc_add4y_rvv_i32: 2.2 (before) vp8_idct_dc_add4y_rvv_i32: 1.7 SpacemiT X60: vp7_idct_dc_add4y_c: 6.2 vp7_idct_dc_add4y_rvv_i32: 2.0 vp8_idct_dc_add4y_c: 5.5 vp8_idct_dc_add4y_rvv_i32: 2.5 (before) vp8_idct_dc_add4y_rvv_i32: 1.7 I also tried to provision the DC values using indexed loads. It ends up slower overall, especially for VP7, as we then have to compute 16 DC's instead of just 4.	8 months ago
Rémi Denis-Courmont	30797e4ff6	lavc/vp8dsp: add R-V V vp7_idct_dc_add This just computes the direct coefficient and hands over to code shared with VP8. Accordingly the bulk of changes are just rewriting the VP8 code to share. Nothing to write home about: vp7_idct_dc_add_c: 1.7 vp7_idct_dc_add_rvv_i32: 1.2	8 months ago

1 2 3 4 5 ...

50296 Commits (a0ff31e740ec05e947ee0759c9f805a8894586ff)