Merge pull request #15136 from ChipKerchner:dotProd_unroll
* Unroll multiply and add instructions in dotProd_32f - 35% faster. * Eliminate unnecessary v_reduce_sum instructions.pull/15151/head
parent
ac425f67e4
commit
0db4fb1835
1 changed files with 22 additions and 1 deletions
Loading…
Reference in new issue