FFmpeg

Author	SHA1	Message	Date
Martin Storsjö	dd2e524ffa	riscv: Use the correct path for including asm.S Signed-off-by: Martin Storsjö <martin@martin.st>	2 years ago
Rémi Denis-Courmont	c03f9654c9	lavc/aacpsdsp: RISC-V V stereo_interpolate[0]	2 years ago
Rémi Denis-Courmont	a15edb0bc0	lavc/aacpsdsp: RISC-V V hybrid_synthesis_deint	2 years ago
Rémi Denis-Courmont	09f907999f	lavc/aacpsdsp: RISC-V V hybrid_analysis_ileave	2 years ago
Rémi Denis-Courmont	15c3a0bd6e	lavc/aacpsdsp: RISC-V V hybrid_analysis This starts with one-time initialisation of the 26 constant factors like `08edacc248`. That is done with the scalar instruction set. While the formula can readily be vectored, the gains would (probably) be more than lost in transfering the results back to FP registers (or suitably reshuffling them into vector registers). Note that the main loop could likely be scheduled sligthly better by expanding the filter macro and interleaving loads with arithmetic. It is not clear yet if that would be relevant for vector processing (as opposed to traditional SIMD). We could also use fewer vectors, but there is not much point in sparing them (they are all callee-clobbered).	2 years ago
Rémi Denis-Courmont	e180326a0b	lavc/aacpsdsp: RISC-V V mul_pair_single	2 years ago
Rémi Denis-Courmont	b0cacf4c3f	lavc/aacpsdsp: RISC-V V add_squares	2 years ago
Rémi Denis-Courmont	453aba71e6	lavc/vorbisdsp: RISC-V V inverse_coupling This uses the following vectorisation: for (i = 0; i < blocksize; i++) { ang[i] = mag[i] - copysignf(fmaxf(ang[i], 0.f), mag[i]); mag[i] = mag[i] - copysignf(fminf(ang[i], 0.f), mag[i]); }	2 years ago
Rémi Denis-Courmont	220dfd0945	lavc/fmtconvert: RISC-V V int32_to_float_fmul_array8	2 years ago
Rémi Denis-Courmont	47a10b9a99	lavc/fmtconvert: RISC-V V int32_to_float_fmul_scalar	2 years ago
Rémi Denis-Courmont	f41ae62f39	lavc/audiodsp: RISC-V V scalarproduct_int16	2 years ago
Rémi Denis-Courmont	f127a5d29d	lavc/audiodsp: RISC-V V vector_clipf	2 years ago
Rémi Denis-Courmont	27da9514c3	lavc/audiodsp: RISC-V V vector_clip_int32	2 years ago
Rémi Denis-Courmont	1edac8eb46	lavc/pixblockdsp: RISC-V I get_pixels Benchmarks on SiFive U74-MC (courtesy of Shanghai StarFive Tech): get_pixels_c: 180.0 get_pixels_rvi: 136.7	2 years ago
Rémi Denis-Courmont	04d092e7d5	lavc/audiodsp: RISC-V F vector_clipf RV64G supports MIN & MAX instructions natively only on floating point registers, not general purpose ones. The later would require the Zbb extension. Due to that, it is actually faster to perform the clipping "properly" in FPU. Benchmarks on SiFive U74-MC (courtesy of Shanghai StarFive Tech): audiodsp.vector_clipf_c: 29551.5 audiodsp.vector_clipf_rvf: 17871.0 Also tried unrolling with 2 or 8 elements but it gets worse either way.	2 years ago

... 2 3 4 5 6

265 Commits (c4122406f6d2726aea833480a2a8e345833dd881)