opencv

Merge pull request #16556 from ChipKerchner:vectorizeIntegralSumPixels

* Vectorize calculating integral for line for single and multiple channels

* Single vector processing for 4-channels - 25-30% faster

* Single vector processing for 4-channels - 25-30% faster

* Fixed AVX512 code for 4 channels

* Disable 3 channel 8UC1 to 32S for SSE2 and SSE3 (slower).  Use new version of 8UC1 to 64F for AVX512.

pull/16698/head^2

Chip Kerchner 5 years ago committed by

GitHub

parent 7ffab23a53

commit 8c24af66bd

No known key found for this signature in database

GPG Key ID: 4AEE18F83AFDEB23

1 changed files with 950 additions and 91 deletions

Split View

1041

modules/imgproc/src/sumpixels.simd.hpp

View File

File diff suppressed because it is too large Load Diff

Merge pull request #16556 from ChipKerchner:vectorizeIntegralSumPixels

1041 modules/imgproc/src/sumpixels.simd.hpp View File

1041

modules/imgproc/src/sumpixels.simd.hpp

View File