opencv

Commit Graph

Author	SHA1	Message	Date
Smirnov Egor	fec2c7e715	fix Flatten layer	3 years ago
Maksim Shabunin	792b7e0629	(3.4) Fixed several issues found by static analysis original commit: `a079c2eb7c`	3 years ago
Smirnov Egor	e97c7e042b	fix max_unpool missing attributes, add default value of keepdims in reducemean/max/sum, add support for keepdims=true in full reduction branch, add new padding type to Pad	3 years ago
rogday	4827fe86bb	Merge pull request #21088 from rogday:onnx_tests Onnx conformance tests * Add ONNX conformance tests * dnn(test): add filters for ONNX conformance tests * add filter lists for OCV backend * address review comments * move test_clip_inbounds to all_denylist * address clip issue * avoid empty lists Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	3 years ago
rogday	1613d30544	Merge pull request #21159 from rogday:ceil_mode fix ceil_mode for Average/MaxPooling * fix ceil_mode * add a comment	3 years ago
Smirnov Egor	33e97e994d	add sum of 1 input	3 years ago
Smirnov Egor	11e6848bb9	add default order to transpose	3 years ago
Smirnov Egor	829410729c	add new (Log)SoftMax simplification passes	3 years ago
Smirnov Egor	0e2a3686c0	add alpha parameter to ELU layer	3 years ago
Andrew Ryrie	ea7d4be3f8	Merge pull request #20658 from smbz:lstm_optimisation * dnn: LSTM optimisation This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm. fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications: - Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned. - Allow for weight matrices where the number of columns is not a multiple of 8. I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on. * Fix warning about initialisation order * Remove C++11 syntax * Fix build when AVX(2) is not available In this case the CV_TRY_X macros are defined to 0, rather than being undefined. * Minor changes as requested: - Don't check hardware support for AVX(2) when dispatch is disabled for these - Add braces * Fix out-of-bounds access in fully connected layer The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway). This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems. * Improve tail mask handling - Use static array for generating tail masks (as requested) - Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs * Revert whitespace change * Improve readability of conditions for using AVX * dnn(lstm): minor coding style changes, replaced left aligned load	3 years ago
Smirnov Egor	05db8784ae	fix Clip, LeakyReLU, LRN, Split defaults	3 years ago
Alexander Alekhin	58b06222ff	dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling	3 years ago
yuki takehara	a6277370ca	Merge pull request #21107 from take1014:remove_assert_21038 resolves #21038 * remove C assert * revert C header * fix several points in review * fix test_ds.cpp	3 years ago
Alexander Alekhin	8041ab8a61	Merge pull request #21025 from alalek:issue_21004 * dnn(ocl4dnn): fix LRN layer accuracy problems - FP16 intermediate computation is not accurate and may provide NaN values * dnn(test): update tolerance for FP16	3 years ago
ZaKiiiiiiiii	98b6ce353c	Merge pull request #20904 from Crayon-new:fix_bug_in_maxLayer fix bug: wrong output dimension when "keep_dims" is false in pooling layer. * fix bug in max layer * code align * delete permute layer and add test case * add name assert * check other cases * remove c++11 features * style:add "const" remove assert * style:sanitize file names	3 years ago
Alexander Alekhin	d484939c02	Merge pull request #20999 from alalek:dnn_replace_deprecated_calls dnn(protobuf): replace deprecated calls * dnn: replace deprecated ByteSize() => ByteSizeLong() * dnn: replace deprecated calls, use GetRepeatedFieldRef	3 years ago
rogday	b3f966e2ca	Merge pull request #20883 from rogday:eltwise_refactoring * backport elementwise_layers refactor * keep NULL	3 years ago
Smirnov Egor	238dbffb48	change asserts for Sum	3 years ago
Smirnov Egor	a9d7b6eab7	fix const - input and remove unimplemented function	3 years ago
Alexander Alekhin	8c2dd5fb9a	dnn(ocl4dnn): cleanup dead code, improve logging	3 years ago
Alexander Alekhin	724e04e979	dnn(ocl4dnn): add extra checks to convolution layer - prevent running code over unsupported/non-tested configurations - prevent integer div by zero	3 years ago
Oliver Kuckertz	a3d7811f24	Merge pull request #20725 from mologie:fix-dnn-tf-on-arm * dnn: fix unaligned memory access crash on armv7 The getTensorContent function would return a Mat pointing to some member of a Protobuf-encoded message. Protobuf does not make any alignment guarantees, which results in a crash on armv7 when loading models while bit 2 is set in /proc/cpu/alignment (or the relevant kernel feature for alignment compatibility is disabled). Any read attempt from the previously unaligned data member would send SIGBUS. As workaround, this commit makes an aligned copy via existing clone functionality in getTensorContent. The unsafe copy=false option is removed. Unfortunately, a rather crude hack in PReLUSubgraph in fact writes(!) to the Protobuf message. We limit ourselves to fixing the alignment issues in this commit, and add getTensorContentRefUnaligned to cover the write case with a safe memcpy. A FIXME marks the issue. * dnn: reduce amount of .clone() calls * dnn: update FIXME comment Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	3 years ago
Alexander Alekhin	f977d10a19	dnn(ocl): fix conv DWCONV workgroup	3 years ago
Alexander Alekhin	846317ef37	dnn(ocl): fix conv BASIC workgroup	3 years ago
SamFC10	9c5d7716e2	fix for unsqueeze opset version 13	3 years ago
rogday	c410d7a97d	Merge pull request #20671 from rogday:yolov4x-mish Add support for YOLOv4x-mish * backport to 3.4 for supporting yolov4x-mish * add YOLOv4x-mish test * address review comments Co-authored-by: Guo Xu <guoxu@1school.com.cn>	4 years ago
Alexander Alekhin	6e66a9222a	dnn(onnx): fix format specifier	4 years ago
Zihao Mu	51b03b87e6	BiasAdd could load Const from second place.	4 years ago
rogday	d31b93b513	Merge pull request #20674 from rogday:prelu_slope Fix PReLU negative slope access pattern * fix prelu negative slope access pattern * change begin() to ptr()	4 years ago
rogday	4807cd8a6e	Merge pull request #20605 from rogday:split_slice_shenanigans Add Normalize subgraph, fix Slice, Mul and Expand * Add Normalize subgraph, support for starts<0 and axis<0 in Slice, Mul broadcasting in the middle and fix Expand's unsqueeze * remove todos * remove range-based for loop * address review comments * change >> to > > in template * fix indexation * fix expand that does nothing	4 years ago
Alexander Alekhin	35e824c287	dnn(ocl): fix out of bound access in GEMM-like kernels - dropped usage of CreateSubBuffer() - buffers lifetime management issue - fixed elementwise offset - avoid out of bounds read access	4 years ago
Alexander Alekhin	5578ad5e14	dnn(ocl): fix automatic globalsize adjusting - if kernel code doesn't support that	4 years ago
Alexander Alekhin	5b2c016834	dnn(ocl): avoid out of buffer access in copyWeightsSwizzled	4 years ago
Alexander Alekhin	407adc7061	dnn(ocl): fix buffer offsets in IDLF kernel - drop CreateSubBuffer - fix FUSED_CONV_ELTWISE mode	4 years ago
rogday	d0e612dc36	Merge pull request #20647 from rogday:resize_concat_optimization Fix resize+concat optimization * fix resize+concat optimization * add comment and fix indentation	4 years ago
WJJ1995	edc442afdb	Merge pull request #20511 from wjj19950828:add_humanseg_support_0806 * support PPSeg model for dnn module * fixed README for CI * add test case * fixed bug * deal with comments * rm dnn_model_runner * update test case * fixed bug for testcase * update testcase	4 years ago
Alexander Alekhin	ae6fabc6fe	dnn(ocl): drop CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE check - it is a hint and it should not block kernel execution	4 years ago
Vincent Rabaud	38d0063c36	Do not use deprecated ReleaseCleared in protobuf library. This is to make code work with protobuf arenas for memory management (ReleaseCleared is incompatible). The cleaning of the memory is also simpler.	4 years ago
Alexander Alekhin	f28e4b86fb	dnn(ocl): fix top initialization in verifyResult	4 years ago
Vincent Rabaud	9cfa84313c	Use the one argument version of SetTotalBytesLimit. The two argument versions has been deprecated, cf https://developers.google.com/protocol-buffers/docs/reference/cpp/google.protobuf.io.coded_stream	4 years ago
Smirnov Egor	fe625a558e	fix hasDynamicShapes for batch_size and fix axis selection in Scale layer	4 years ago
Smirnov Egor	9ef41f68fb	fix Split partial sum	4 years ago
Julia Bareeva	cfb36443fb	Merge pull request #20506 from JulieBar:lstm_activations * Support activations(Sigmoid, Tanh) for LSTM * fix warning	4 years ago
Smirnov Egor	739ff84732	add Max layer to TFImporter	4 years ago
SamFC10	2a177052de	fix bug in prior-box variances	4 years ago
rogday	cff0168f3a	Merge pull request #20453 from rogday:onnx_importer_fix Split layer dispatch into functions in ONNXImporter * split layer dispatch into functions * fixes * identation and comment fixes * fix constness	4 years ago
Julia Bareeva	4e5699fa71	Merge pull request #20450 from JulieBar:lstm_inside Support non-zero hidden state for LSTM * fully support non-zero hidden state for LSTM * check dims of hidden state for LSTM * fix failed test Test_Model.TextRecognition * add new tests for LSTM w/ non-zero hidden params Co-authored-by: Julie Bareeva <julia.bareeva@xperience.ai>	4 years ago
Smirnov Egor	024b43ca06	implement asymmetric padding for conv2d, max_pool and conv2d_backprop_input	4 years ago
SamFC10	96d35f7c54	Fix convolution asymmetric padding bug in onnx importer	4 years ago
Alexander Alekhin	fbde0c6c96	dnn(ie): fix handling of 1D and non-32F outputs of InferenceEngine	4 years ago

1 2 3 4 5 ...

1075 Commits (a079acc0d92d40e23da6036eea3e042beec62523)