mirror of https://github.com/opencv/opencv.git
Merge pull request #16556 from ChipKerchner:vectorizeIntegralSumPixels
* Vectorize calculating integral for line for single and multiple channels * Single vector processing for 4-channels - 25-30% faster * Single vector processing for 4-channels - 25-30% faster * Fixed AVX512 code for 4 channels * Disable 3 channel 8UC1 to 32S for SSE2 and SSE3 (slower). Use new version of 8UC1 to 64F for AVX512.pull/16698/head^2
parent
7ffab23a53
commit
8c24af66bd
1 changed files with 950 additions and 91 deletions
File diff suppressed because it is too large
Load Diff
Loading…
Reference in new issue