mirror of https://github.com/FFmpeg/FFmpeg.git
When the input of a pass has 15 or 16 bits of precision (in particular the column pass), the addition of a bias to W4 may lead to overflows in the input to pmaddwd. This requires postponing the adding of the bias to after the first butterfly. To do so, the fact that m15, unused although zeroed, is exploited. In case the pass is safe, an address can be directly used, and the number of xmm regs can be decreased. Otherwise, the 32bits bias is loaded into it. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>pull/154/head
parent
3b336ec2fb
commit
e652f69b35
2 changed files with 16 additions and 5 deletions
Loading…
Reference in new issue