Alexander Alekhin
d484939c02
Merge pull request #20999 from alalek:dnn_replace_deprecated_calls
...
dnn(protobuf): replace deprecated calls
* dnn: replace deprecated ByteSize() => ByteSizeLong()
* dnn: replace deprecated calls, use GetRepeatedFieldRef
3 years ago
rogday
b3f966e2ca
Merge pull request #20883 from rogday:eltwise_refactoring
...
* backport elementwise_layers refactor
* keep NULL
3 years ago
Smirnov Egor
238dbffb48
change asserts for Sum
3 years ago
Smirnov Egor
a9d7b6eab7
fix const - input and remove unimplemented function
3 years ago
Alexander Alekhin
8c2dd5fb9a
dnn(ocl4dnn): cleanup dead code, improve logging
3 years ago
Alexander Alekhin
724e04e979
dnn(ocl4dnn): add extra checks to convolution layer
...
- prevent running code over unsupported/non-tested configurations
- prevent integer div by zero
3 years ago
Oliver Kuckertz
a3d7811f24
Merge pull request #20725 from mologie:fix-dnn-tf-on-arm
...
* dnn: fix unaligned memory access crash on armv7
The getTensorContent function would return a Mat pointing to some
member of a Protobuf-encoded message. Protobuf does not make any
alignment guarantees, which results in a crash on armv7 when loading
models while bit 2 is set in /proc/cpu/alignment (or the relevant
kernel feature for alignment compatibility is disabled). Any read
attempt from the previously unaligned data member would send SIGBUS.
As workaround, this commit makes an aligned copy via existing clone
functionality in getTensorContent. The unsafe copy=false option is
removed. Unfortunately, a rather crude hack in PReLUSubgraph in fact
writes(!) to the Protobuf message. We limit ourselves to fixing the
alignment issues in this commit, and add getTensorContentRefUnaligned
to cover the write case with a safe memcpy. A FIXME marks the issue.
* dnn: reduce amount of .clone() calls
* dnn: update FIXME comment
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
3 years ago
Alexander Alekhin
f977d10a19
dnn(ocl): fix conv DWCONV workgroup
3 years ago
Alexander Alekhin
846317ef37
dnn(ocl): fix conv BASIC workgroup
3 years ago
SamFC10
9c5d7716e2
fix for unsqueeze opset version 13
3 years ago
rogday
c410d7a97d
Merge pull request #20671 from rogday:yolov4x-mish
...
Add support for YOLOv4x-mish
* backport to 3.4 for supporting yolov4x-mish
* add YOLOv4x-mish test
* address review comments
Co-authored-by: Guo Xu <guoxu@1school.com.cn>
4 years ago
Alexander Alekhin
6e66a9222a
dnn(onnx): fix format specifier
4 years ago
Zihao Mu
51b03b87e6
BiasAdd could load Const from second place.
4 years ago
rogday
d31b93b513
Merge pull request #20674 from rogday:prelu_slope
...
Fix PReLU negative slope access pattern
* fix prelu negative slope access pattern
* change begin() to ptr()
4 years ago
rogday
4807cd8a6e
Merge pull request #20605 from rogday:split_slice_shenanigans
...
Add Normalize subgraph, fix Slice, Mul and Expand
* Add Normalize subgraph, support for starts<0 and axis<0 in Slice, Mul broadcasting in the middle and fix Expand's unsqueeze
* remove todos
* remove range-based for loop
* address review comments
* change >> to > > in template
* fix indexation
* fix expand that does nothing
4 years ago
Alexander Alekhin
35e824c287
dnn(ocl): fix out of bound access in GEMM-like kernels
...
- dropped usage of CreateSubBuffer() - buffers lifetime management issue
- fixed elementwise offset
- avoid out of bounds read access
4 years ago
Alexander Alekhin
5578ad5e14
dnn(ocl): fix automatic globalsize adjusting
...
- if kernel code doesn't support that
4 years ago
Alexander Alekhin
5b2c016834
dnn(ocl): avoid out of buffer access in copyWeightsSwizzled
4 years ago
Alexander Alekhin
407adc7061
dnn(ocl): fix buffer offsets in IDLF kernel
...
- drop CreateSubBuffer
- fix FUSED_CONV_ELTWISE mode
4 years ago
rogday
d0e612dc36
Merge pull request #20647 from rogday:resize_concat_optimization
...
Fix resize+concat optimization
* fix resize+concat optimization
* add comment and fix indentation
4 years ago
WJJ1995
edc442afdb
Merge pull request #20511 from wjj19950828:add_humanseg_support_0806
...
* support PPSeg model for dnn module
* fixed README for CI
* add test case
* fixed bug
* deal with comments
* rm dnn_model_runner
* update test case
* fixed bug for testcase
* update testcase
4 years ago
Alexander Alekhin
ae6fabc6fe
dnn(ocl): drop CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE check
...
- it is a hint and it should not block kernel execution
4 years ago
Vincent Rabaud
38d0063c36
Do not use deprecated ReleaseCleared in protobuf library.
...
This is to make code work with protobuf arenas for memory
management (ReleaseCleared is incompatible).
The cleaning of the memory is also simpler.
4 years ago
Alexander Alekhin
f28e4b86fb
dnn(ocl): fix top initialization in verifyResult
4 years ago
Vincent Rabaud
9cfa84313c
Use the one argument version of SetTotalBytesLimit.
...
The two argument versions has been deprecated, cf
https://developers.google.com/protocol-buffers/docs/reference/cpp/google.protobuf.io.coded_stream
4 years ago
Smirnov Egor
fe625a558e
fix hasDynamicShapes for batch_size and fix axis selection in Scale layer
4 years ago
Smirnov Egor
9ef41f68fb
fix Split partial sum
4 years ago
Julia Bareeva
cfb36443fb
Merge pull request #20506 from JulieBar:lstm_activations
...
* Support activations(Sigmoid, Tanh) for LSTM
* fix warning
4 years ago
Smirnov Egor
739ff84732
add Max layer to TFImporter
4 years ago
SamFC10
2a177052de
fix bug in prior-box variances
4 years ago
rogday
cff0168f3a
Merge pull request #20453 from rogday:onnx_importer_fix
...
Split layer dispatch into functions in ONNXImporter
* split layer dispatch into functions
* fixes
* identation and comment fixes
* fix constness
4 years ago
Julia Bareeva
4e5699fa71
Merge pull request #20450 from JulieBar:lstm_inside
...
Support non-zero hidden state for LSTM
* fully support non-zero hidden state for LSTM
* check dims of hidden state for LSTM
* fix failed test Test_Model.TextRecognition
* add new tests for LSTM w/ non-zero hidden params
Co-authored-by: Julie Bareeva <julia.bareeva@xperience.ai>
4 years ago
Smirnov Egor
024b43ca06
implement asymmetric padding for conv2d, max_pool and conv2d_backprop_input
4 years ago
SamFC10
96d35f7c54
Fix convolution asymmetric padding bug in onnx importer
4 years ago
Alexander Alekhin
fbde0c6c96
dnn(ie): fix handling of 1D and non-32F outputs of InferenceEngine
4 years ago
César Gouveia
167a12028d
Merge pull request #20374 from cesarpgouveia:bugfix/fix_load_onnxModel_debug
...
* Fix bug while loading onnx model in debug
* dnn: fix other .at using
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
4 years ago
mitruska
18dbac203f
Use explicit version of ngraph NormalizeL2
4 years ago
SamFC10
5b8c10f2f8
modified onnx importer to concat const input blobs
4 years ago
Alexander Alekhin
f2057ce1ab
dnn(ie): replace deprecated calls
4 years ago
Alexander Alekhin
7d842f5bcf
dnn: use OpenVINO 2021.4 defines
4 years ago
SamFC10
55e1dfb778
Fix BatchNorm reinitialization
4 years ago
rogday
7ee1816612
split if into map of functions
4 years ago
Alexander Alekhin
71bae7c23f
dnn(ie): implicit usage of IE::GPU OpenCL kernels cache
4 years ago
Aleksandr Voron
2e143b8799
Merge pull request #19961 from alvoron:dnn_ngraph_int64_fix
...
Explicit usage of int64_t in CropAndResizeLayer (IE backend)
* Update crop_and_resize_layer.cpp
4 years ago
Anastasia Murzova
cc6d48959e
Added reduce sum by channel support
4 years ago
Vitaly Tuzov
aab62aa6dd
Merge pull request #18952 from terfendail:wui_doc
...
* Updated UI documentation to address WUI
* Added documentation for vx_ calls
* Removed vx_store operation overload
* Doxyfile updated to enable wide UI
* Enable doxygen documentation for vx_ WUI functions
* Wide intrinsics definition rework
* core: fix SIMD C++ emulator build (supports 128-bit only)
4 years ago
Anastasia M
3e48a91d97
Merge pull request #19546 from LupusSanctus:am/slice_steps
...
* Added Steps support in DNN Slice layer
* Added code corrections
* dnn(slice): fix OCL and OCL_FP16 processing
4 years ago
Alexander Alekhin
86d0a86141
dnn(ocl): fix gemm kernel scheduling
4 years ago
Alexander Alekhin
56bdd7db5c
dnn: use OpenVINO 2021.3 defines
...
original commit: 6291503793
4 years ago
Anastasia Murzova
e75f1b071b
Added reshape corrections
4 years ago