Chiebot-Mirror/opencv: Open Source Computer Vision Library - opencv - Gitea: Git with a cup of tea

Open Source Computer Vision Library https://opencv.org/

casualwinds 7b399c4248 Merge pull request #24280 from casualwind:parallel_opt Optimization for parallelization when large core number #24280 Problem description： When the number of cores is large, OpenCV’s thread library may reduce performance when processing parallel jobs. The reason for this problem: When the number of cores (the thread pool initialized the threads, whose number is as same as the number of cores) is large, the main thread will spend too much time on waking up unnecessary threads. When a parallel job needs to be executed, the main thread will wake up all threads in sequence, and then wait for the signal for the job completion after waking up all threads. When the number of threads is larger than the parallel number of a job slices, there will be a situation where the main thread wakes up the threads in sequence and the awakened threads have completed the job, but the main thread is still waking up the other threads. The threads woken up by the main thread after this have nothing to do, and the broadcasts made by the waking threads take a lot of time, which reduce the performance. Solution： Reduce the time for the process of main thread waking up the worker threads through the following two methods: • The number of threads awakened by the main thread should be adjusted according to the parallel number of a job slices. If the number of threads is greater than the number of the parallel number of job slices, the total number of threads awakened should be reduced. • In the process of waking up threads in sequence, if the main thread finds that all parallel job slices have been allocated, it will jump out of the loop in time and wait for the signal for the job completion. Performance Test: The tests were run in the manner described by https://github.com/opencv/opencv/wiki/HowToUsePerfTests. At core number = 160, There are big performance gain in some cases. Take the following cases in the video module as examples: OpticalFlowPyrLK_self::Path_Idx_Cn_NPoints_WSize_Deriv::("cv/optflow/frames/VGA_%02d.png", 2, 1, (9, 9), 11, true) Performance improves 191%:0.185405ms ->0.0636496ms perf::DenseOpticalFlow_VariationalRefinement::(320x240, 10, 10) Performance improves 112%:23.88938ms -> 11.2562ms Among all the modules, the performance improvement is greatest on module video, and there are also certain improvements on other modules. At core number = 160, the times labeled below are the geometric mean of the average time of all cases for one module. The optimization is available on each module. overall \| time(ms) \| \| \| \| \| \| \| -- \| -- \| -- \| -- \| -- \| -- \| -- \| -- \| -- module name \| gapi \| dnn \| features2d \| objdetect \| core \| imgproc \| stitching \| video original \| 0.185 \| 1.586 \| 9.998 \| 11.846 \| 0.205 \| 0.215 \| 164.409 \| 0.803 optimized \| 0.174 \| 1.353 \| 9.535 \| 11.105 \| 0.199 \| 0.185 \| 153.972 \| 0.489 Performance improves \| 6% \| 17% \| 5% \| 7% \| 3% \| 16% \| 7% \| 64% Meanwhile, It is found that adjusting the order of test cases will have an impact on some test cases. For example, we used option --gtest-shuffle to run opencv_perf_gapi, the performance of TestPerformance::CmpWithScalarPerfTestFluid/CmpWithScalarPerfTest::(compare_f, CMP_GE, 1920x1080, 32FC1, { gapi.kernel_package }) case had 30% changes compared to the case without shuffle. I would like to ask if you have also encountered such a situation and could you share your experience? ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake		1 year ago
.github	Added CI with OpenVINO for DNN and G-API.	1 year ago
3rdparty	Merge pull request #24274 from vrabaud:webp_1.3.2	1 year ago
apps	Add missing <sstream> includes	2 years ago
cmake	Merge pull request #24286 from ashadrina:intel_icx_compiler_support	1 year ago
data	Merge pull request #22727 from su77ungr:patch-1	2 years ago
doc	Merge pull request #24179 from Kumataro:fix24145	2 years ago
include	exclude opencv_contrib modules	5 years ago
modules	Merge pull request #24280 from casualwind:parallel_opt	1 year ago
platforms	Merge pull request #24305 from hanliutong:toolchain	1 year ago
samples	Merge pull request #24201 from lpylpy0514:4.x	1 year ago
.editorconfig	add .editorconfig	6 years ago
.gitattributes	cmake: generate and install ffmpeg-download.ps1	7 years ago
.gitignore	Merge pull request #17165 from komakai:objc-binding	5 years ago
CMakeLists.txt	cmake: revise OPENCV_DNN_BACKEND_DEFAULT integration	2 years ago
CONTRIBUTING.md	migration: github.com/opencv/opencv	9 years ago
COPYRIGHT	copyright: 2023 (update)	2 years ago
LICENSE	Merge pull request #18073 from vpisarev:apache2_license	5 years ago
README.md	fix 4.x links	3 years ago
SECURITY.md	Updated PGP key for security reports	2 years ago

README.md

OpenCV: Open Source Computer Vision Library

Resources

Homepage: https://opencv.org
- Courses: https://opencv.org/courses
Docs: https://docs.opencv.org/4.x/
Q&A forum: https://forum.opencv.org
- previous forum (read only): http://answers.opencv.org
Issue tracking: https://github.com/opencv/opencv/issues
Additional OpenCV functionality: https://github.com/opencv/opencv_contrib

Contributing

Please read the contribution guidelines before starting work on a pull request.

Summary of the guidelines:

One pull request per issue;
Choose the right base branch;
Include tests and documentation;
Clean up "oops" commits before submitting;
Follow the coding style guide.