opencv

Commit Graph

Author	SHA1	Message	Date
Smirnov Egor	e608adea60	add ArgMax and ArgMin layers	3 years ago
HAN Liutong	4935b14539	Merge pull request #21012 from hanliutong:rvv_clang Update RVV backend for using Clang. * Update cmake file of clang. * Modify the RVV optimization on DNN to adapt to clang. * Modify intrin_rvv: Disable some existing types. * Modify intrin_rvv: Reinterpret instead of load&cast. * Modify intrin_rvv: Update load&store without cast. * Modify intrin_rvv: Rename vfredsum to fredosum. * Modify intrin_rvv: Rewrite Check all/any by using vpopc. * Modify intrin_rvv: Use reinterpret instead of c-style casting. * Remove all macros which is not used in v_reinterpret * Rename vpopc to vcpop according to spec.	3 years ago
Alexander Alekhin	0835611d3a	dnn(test): re-enable tests which works with OpenVINO 2021.4.x	3 years ago
rogday	1613d30544	Merge pull request #21159 from rogday:ceil_mode fix ceil_mode for Average/MaxPooling * fix ceil_mode * add a comment	3 years ago
Alexander Alekhin	bd396e1fd5	dnn(test): re-enable tests which works with OpenVINO 2021.4.x (3.4)	3 years ago
Alexander Alekhin	f55c9ed1ba	dnn(test): drop non OCV/CPU cases for Int8 - zero code coverage and up to x3-x8 tests slowdown - implementation executes OCV/CPU in all cases - wrong skip conditions	3 years ago
Smirnov Egor	33e97e994d	add sum of 1 input	3 years ago
Smirnov Egor	11e6848bb9	add default order to transpose	3 years ago
Smirnov Egor	829410729c	add new (Log)SoftMax simplification passes	3 years ago
Smirnov Egor	0e2a3686c0	add alpha parameter to ELU layer	3 years ago
Alexander Alekhin	66b2140892	build: eliminate C4309 warning from protobuf files with MSVS2017	3 years ago
Andrew Ryrie	ea7d4be3f8	Merge pull request #20658 from smbz:lstm_optimisation * dnn: LSTM optimisation This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm. fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications: - Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned. - Allow for weight matrices where the number of columns is not a multiple of 8. I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on. * Fix warning about initialisation order * Remove C++11 syntax * Fix build when AVX(2) is not available In this case the CV_TRY_X macros are defined to 0, rather than being undefined. * Minor changes as requested: - Don't check hardware support for AVX(2) when dispatch is disabled for these - Add braces * Fix out-of-bounds access in fully connected layer The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway). This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems. * Improve tail mask handling - Use static array for generating tail masks (as requested) - Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs * Revert whitespace change * Improve readability of conditions for using AVX * dnn(lstm): minor coding style changes, replaced left aligned load	3 years ago
Smirnov Egor	05db8784ae	fix Clip, LeakyReLU, LRN, Split defaults	3 years ago
Supernovae	b594ed99b8	Merge pull request #20933 from shubham-shahh:master Improved overall readability of the code * grid_nms.cu: minor fix-ups * Update grid_stride_range.hpp * Update tf_importer.cpp	3 years ago
Alexander Alekhin	58b06222ff	dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling	3 years ago
Alexander Alekhin	58dc397930	dnn(test): add two_inputs test with FP32/U8 data types - remove similar test from IE scope under HAVE_INF_ENGINE	3 years ago
yuki takehara	a6277370ca	Merge pull request #21107 from take1014:remove_assert_21038 resolves #21038 * remove C assert * revert C header * fix several points in review * fix test_ds.cpp	3 years ago
Alexander Alekhin	31b2d6be75	dnn(test): update InferenceEngine tests (4.x)	3 years ago
Alexander Alekhin	985aa0423d	dnn(test): update InferenceEngine tests	3 years ago
Hanxi Guo	1fcf7ba5bc	Merge pull request #20406 from MarkGHX:gsoc_2021_webnn [GSoC] OpenCV.js: Accelerate OpenCV.js DNN via WebNN * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Fix the build issue * Update concat_layer.cpp Still have some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Delete bib19450.aux * Add WebNN backend for OpenCV DNN Module Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp Add WebNN head files into OpenCV 3rd partiy files Create webnn.hpp update cmake Complete README and add OpenCVDetectWebNN.cmake file add webnn.cpp Modify webnn.cpp Can successfully compile the codes for creating a MLContext Update webnn.cpp Update README.md Update README.md Update README.md Update README.md Update cmake files and update README.md Update OpenCVDetectWebNN.cmake and README.md Update OpenCVDetectWebNN.cmake Fix OpenCVDetectWebNN.cmake and update README.md Add source webnn_cpp.cpp and libary libwebnn_proc.so Update dnn.cpp Update dnn.cpp Update dnn.cpp Update dnn.cpp update dnn.cpp update op_webnn update op_webnn Update op_webnn.hpp update op_webnn.cpp & hpp Update op_webnn.hpp Update op_webnn update the skeleton Update op_webnn.cpp Update op_webnn Update op_webnn.cpp Update op_webnn.cpp Update op_webnn.hpp update op_webnn update op_webnn Solved the problems of released variables. Fixed the bugs in op_webnn.cpp Implement op_webnn Implement Relu by WebNN API Update dnn.cpp for better test Update elementwise_layers.cpp Implement ReLU6 Update elementwise_layers.cpp Implement SoftMax using WebNN API Implement Reshape by WebNN API Implement PermuteLayer by WebNN API Implement PoolingLayer using WebNN API Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Update pooling_layer.cpp Implement poolingLayer by WebNN API and add more detailed logs Update dnn.cpp Update dnn.cpp Remove redundant codes and add more logs for poolingLayer Add more logs in the pooling layer implementation Fix the indent issue and resolve the compiling issue Fix the build problems Fix the build issue FIx the build issue Update dnn.cpp Update dnn.cpp * Fix the build issue * Implement BatchNorm Layer by WebNN API * Update convolution_layer.cpp This is a temporary file for Conv2d layer implementation * Integrate some general functions into op_webnn.cpp&hpp * Update const_layer.cpp * Update convolution_layer.cpp Still have some bugs that should be fixed. * Update conv2d layer and fc layer still have some problems to be fixed. * update constLayer, conv layer, fc layer There are still some bugs to be fixed. * Update conv2d layer, fully connected layer and const layer * Update convolution_layer.cpp * Add OpenCV.js DNN module WebNN Backend (both using webnn-polyfill and electron) * Update dnn.cpp * Fix Error in dnn.cpp * Resolve duplication in conditions in convolution_layer.cpp * Fixed the issues in the comments * Fix building issue * Update tutorial * Fixed comments * Address the comments * Update CMakeLists.txt * Offer more accurate perf test on native * Add better perf tests for both native and web * Modify per tests for better results * Use more latest version of Electron * Support latest WebNN Clamp op * Add definition of HAVE_WEBNN macro * Support group convolution * Implement Scale_layer using WebNN * Add Softmax option for native classification example * Fix comments * Fix comments	3 years ago
Alexander Alekhin	8041ab8a61	Merge pull request #21025 from alalek:issue_21004 * dnn(ocl4dnn): fix LRN layer accuracy problems - FP16 intermediate computation is not accurate and may provide NaN values * dnn(test): update tolerance for FP16	3 years ago
Alexander Alekhin	d934bb15b0	Merge pull request #20998 from alalek:update_protobuf_3.19.1 3rdparty(protobuf): upgrade 3.5.2 => 3.19.1 * 3rdparty(protobuf): upgrade 3.5.2 => 3.19.1 * dnn: update protobuf files (3.19.1) * 3rdparty(protobuf): re-apply OpenCV patch for custom fields (3.19.1) * protobuf: suppress new build warnings * protobuf: remove unused files	3 years ago
ZaKiiiiiiiii	98b6ce353c	Merge pull request #20904 from Crayon-new:fix_bug_in_maxLayer fix bug: wrong output dimension when "keep_dims" is false in pooling layer. * fix bug in max layer * code align * delete permute layer and add test case * add name assert * check other cases * remove c++11 features * style:add "const" remove assert * style:sanitize file names	3 years ago
Alexander Alekhin	562f2375c5	dnn(test): skip tests with high memory usage - 32-bit configuration may fail due to memory fragmentation	3 years ago
Alexander Alekhin	c1d61c88e9	dnn(cmake): don't hijack OpenCL options with Tengine	3 years ago
Alexander Alekhin	d484939c02	Merge pull request #20999 from alalek:dnn_replace_deprecated_calls dnn(protobuf): replace deprecated calls * dnn: replace deprecated ByteSize() => ByteSizeLong() * dnn: replace deprecated calls, use GetRepeatedFieldRef	3 years ago
rogday	b3f966e2ca	Merge pull request #20883 from rogday:eltwise_refactoring * backport elementwise_layers refactor * keep NULL	3 years ago
Alexander Alekhin	1926e919be	dnn(int8): fix using of incorrect UMat constructor	3 years ago
Smirnov Egor	1feb3838b5	add Ceil, Floor, Log, Round, Sqrt, Not, Equal, Less, Greater	3 years ago
Smirnov Egor	238dbffb48	change asserts for Sum	3 years ago
Smirnov Egor	a9d7b6eab7	fix const - input and remove unimplemented function	3 years ago
Smirnov Egor	9c84749e2c	backport YOLOv4x-mish new_coords CUDA implementation	3 years ago
Alexander Alekhin	8c2dd5fb9a	dnn(ocl4dnn): cleanup dead code, improve logging	3 years ago
Alexander Alekhin	724e04e979	dnn(ocl4dnn): add extra checks to convolution layer - prevent running code over unsupported/non-tested configurations - prevent integer div by zero	3 years ago
Alexander Alekhin	94e92cd6c0	dnn(ocl): skip int8 tests due to memory access issues	3 years ago
Smirnov Egor	2221dcc9f2	add SoftNMS implementation	3 years ago
Oliver Kuckertz	a3d7811f24	Merge pull request #20725 from mologie:fix-dnn-tf-on-arm * dnn: fix unaligned memory access crash on armv7 The getTensorContent function would return a Mat pointing to some member of a Protobuf-encoded message. Protobuf does not make any alignment guarantees, which results in a crash on armv7 when loading models while bit 2 is set in /proc/cpu/alignment (or the relevant kernel feature for alignment compatibility is disabled). Any read attempt from the previously unaligned data member would send SIGBUS. As workaround, this commit makes an aligned copy via existing clone functionality in getTensorContent. The unsafe copy=false option is removed. Unfortunately, a rather crude hack in PReLUSubgraph in fact writes(!) to the Protobuf message. We limit ourselves to fixing the alignment issues in this commit, and add getTensorContentRefUnaligned to cover the write case with a safe memcpy. A FIXME marks the issue. * dnn: reduce amount of .clone() calls * dnn: update FIXME comment Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	3 years ago
Alexander Alekhin	646924fce8	dnn(pytest/test_input_3d): reload model between switching targets	3 years ago
HAN Liutong	e5fb50476c	Merge pull request #20521 from hanliutong:dev-rvv-multiVLEN Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics. * Update fastGEMM for multi VLEN. * Update fastGEMM1T for multi VLEN. * Update fastDepthwiseConv for multi VLEN. * Update fastConv for multi VLEN. * Replace malloc with cv::AutoBuffer.	3 years ago
Alexander Alekhin	3e6f27522b	pre: OpenCV 4.5.4 (version++)	3 years ago
Alexander Alekhin	ebef84e9ea	pre: OpenCV 3.4.16 (version++)	3 years ago
Jebastin Nadar	cce78cc5e2	Merge pull request #20535 from SamFC10:onnx-q dnn : int8 quantized layers support in onnx importer * added quantized layers support in onnx importer * added more cases in eltwise node, some more checks * added tests for quantized nodes * relax thresholds for failed tests, address review comments * refactoring based on review comments * added support for unsupported cases and pre-quantized resnet50 test * relax thresholds due to int8 resize layer	3 years ago
Zihao Mu	9085b933d8	Merge pull request #20702 from zihaomu:tf_expand_dim_layer Add ExpandDims layer of tf_importer.cpp * Add ExpandDims to tf_importer. * add -1 expand test case. * Support different dimensions of input. * Compatible with 5-dimensional NDHWC data * Code align * support 3-dim input. * 3-dim bug fixed. * fixing error of code format.	3 years ago
YashasSamaga	505dde09de	support broadcasting in eltwise ops	3 years ago
SamFC10	87ebf2e50b	fix illegal memory access in int8 convolution	3 years ago
Alexander Alekhin	f977d10a19	dnn(ocl): fix conv DWCONV workgroup	3 years ago
Alexander Alekhin	846317ef37	dnn(ocl): fix conv BASIC workgroup	3 years ago
rogday	38b9ec7a18	Merge pull request #20682 from rogday:min * Add Min layer to CPU, OpenCL, Halide, Inference Engine, NGraph and CUDA * fix indentation * add min to fusion and halide tests; fix doc	3 years ago
SamFC10	9c5d7716e2	fix for unsqueeze opset version 13	3 years ago
rogday	c410d7a97d	Merge pull request #20671 from rogday:yolov4x-mish Add support for YOLOv4x-mish * backport to 3.4 for supporting yolov4x-mish * add YOLOv4x-mish test * address review comments Co-authored-by: Guo Xu <guoxu@1school.com.cn>	3 years ago

1 2 3 4 5 ...

1771 Commits (41d108ead6a1f8701092f6b1bee2569f68c16bfd)