mirror of https://github.com/FFmpeg/FFmpeg.git
Segmented loads may be slower than not. So this advantageously uses a unit-strided load and narrowing shifts instead. Before: ps_add_squares_c: 60757.7 ps_add_squares_rvv_f32: 22242.5 After: ps_add_squares_c: 60516.0 ps_add_squares_rvv_i64: 17067.7release/7.0
parent
ab78d22553
commit
eb508702a8
2 changed files with 8 additions and 4 deletions
Loading…
Reference in new issue