Vadim Pisarevsky
77b01deb80
Merge pull request #17858 from vpisarev:dnn_depthwise_conv
...
* added depth-wise convolution; gives ~20-30% performance improvement in MobileSSD networks
* hopefully, eliminated compile warnings, errors, as well as failure in one test
* * fixed a few typos
* decreased buffer size in some cases
* added more optimal im2row branch in the case of 1x1 convolutions
* tuned fastConv to reduce the number of passes over arrays
4 years ago
Yashas Samaga B L
f53f491cd2
Merge pull request #17939 from YashasSamaga:cuda4dnn-fix-eltwise-fusion
...
* fix eltwise fusion segfault, more eltwise fusions, fix power fusion
* add assertion
4 years ago
YashasSamaga
ae293f27cf
add DetectionOutputOp
4 years ago
Sinitsina
0ac2f0e04c
mish_functor_update
4 years ago
Ilya Churaev
aaf65bb3af
Fixed removing is_parameter, is_constant, is_output
4 years ago
Ilya Churaev
0291dd3ef8
Replaced copy_with_new_args to clone_with_new_inputs
4 years ago
Liubov Batanina
a35d4f9029
Support Gather for variable inputs
4 years ago
Ilya Churaev
a46ca98c57
Revert "Fixed removing is_parameter, is_constant, is_output"
4 years ago
Ilya Churaev
33db2ac928
Fixed removing is_parameter, is_constant, is_output
4 years ago
Alexander Alekhin
971cc3354b
dnn(test): adjust tests for OpenVINO 2020.4 (4.x branch)
4 years ago
Alexander Alekhin
81e027eef7
dnn: fix OpenCL implementation of Slice layer
4 years ago
Alexander Alekhin
1c371d07b5
dnn(test): adjust tests for OpenVINO 2020.4
4 years ago
Alexander Alekhin
55e8549839
dnn: eliminate IE deprecation warning
4 years ago
Alexander Alekhin
435b6df989
dnn: use OpenVINO 2020.4 defines
...
original commit: 2813aa7eb9
4 years ago
Dmitry Kurtaev
cc584760d3
Fix TensorFlow->ONNX imports
4 years ago
YashasSamaga
1949056423
improved diagnostics for build issues
4 years ago
YashasSamaga
ead1dcf308
error if cuda4dnn depends are not resolved
4 years ago
Alexander Alekhin
5cb8619eca
dnn(ie): enable KEY_CPU_THREADS_NUM for Windows
4 years ago
YashasSamaga
37e2afb8f6
fix concat fusion for cuda4dnn
4 years ago
Yashas Samaga B L
d0e6d2438c
Merge pull request #17363 from YashasSamaga:cuda4dnn-eltwise-fusion2
...
cuda4dnn(conv): fuse eltwise with convolutions
* fuse eltwise with convolutions
* manually rebase to avoid bad git merge
4 years ago
Alexander Alekhin
d5713c657b
dnn(slice): disable buggy OCV/OCL implementation
4 years ago
Alexander Alekhin
99c4b76a6d
dnn(test): add YOLOv4-tiny tests
4 years ago
Ilya Churaev
d69a7a3bbf
Fixed header paths for some nGraph ops
...
* Added dependency on IE version
backport of commit: 992c908b56
4 years ago
Ilya Churaev
992c908b56
Merge pull request #17756 from ilyachur:feature/ichuraev/fix_ngraph_headers
...
* Fixed header paths for some nGraph ops
* Added dependency on IE version
4 years ago
YashasSamaga
cbdaa93e54
reduce slice, concat to copy; enable more concat fusions
4 years ago
YashasSamaga
b7eec216e9
generalize axis for concat fusion
4 years ago
YashasSamaga
4988e131fd
transfer output blobs in background
4 years ago
Liubov Batanina
65dbbf712d
Merge pull request #17733 from l-bat:tiny_yolov4
...
* Supported yolov4-tiny
* Added comments
4 years ago
YashasSamaga
62a63021c7
add cuDNN 8 support
4 years ago
cyy
206c843f36
Merge pull request #17499 from cyyever:fix_CUDA11
...
Fix cuda11
* use cudnn_version.h to detect version when it is available
* remove nppi from CUDA11
* use ocv_list_filterout
* dnn(cuda): temporary disable CUDNN 8.0
4 years ago
Ilya Lavrenov
e58ce89b10
Conditional compilation for IR v7 support
...
backported commit 86905754e4
4 years ago
Ilya Lavrenov
86905754e4
Conditional compilation for IR v7 support
4 years ago
Dmitry Kurtaev
3bdd7bfbea
Remove deprecated Inference Engine CPU extensions
4 years ago
Ilya Lavrenov
ad5e70f94e
Conditional compilation for network reader
...
origibal commit: 63e92cccf2
4 years ago
Dmitry Kurtaev
8e3f5fb209
Remove deprecated Inference Engine CPU extensions
4 years ago
Dmitry Kurtaev
1491934d17
Optimize Mish for CPU backend
4 years ago
YashasSamaga
6573b9ace0
use fp32 mish for fp16 mish
4 years ago
Ilya Lavrenov
63e92cccf2
Conditional compilation for network reader
4 years ago
Liubov Batanina
85c0c8c7ed
Disabling dafault NMS in yolo layer
5 years ago
Yashas Samaga B L
9ba5581d17
Merge pull request #17534 from YashasSamaga:cuda4dnn-remove-unused-funcs
...
cuda4dnn: reduce CUDA version requirements to at least CUDA 9.2
* remove half2 specializations
* do not remove atomicAdd for half in CUDA 10 and below
* remove fp16.hpp
5 years ago
Liubov Batanina
d93b6be3cc
Changed StridedSlice to VariadicSplit in Region layer
5 years ago
Ilya Lavrenov
676b818d6a
Removed plugin dispatcher
...
backport of commit 74113737f3
5 years ago
Ilya Lavrenov
74113737f3
Removed plugin dispatcher
5 years ago
YashasSamaga
87ab4ee567
improve mish performance and accuracy
5 years ago
Ilya Lavrenov
9697e3ac24
Removed error lisneter usage
5 years ago
Alexander Alekhin
d2a9efd039
dnn: use OpenVINO 2020.3 defines
5 years ago
YashasSamaga
265acccd56
allow multiple inputs to resize, fix tests
5 years ago
Maksim Shabunin
9096b1c768
dnn/NGraph: added nullptr checks
5 years ago
Maksim Shabunin
7ce518106b
dnn/NGraph: added nullptr checks
5 years ago
Alexander Alekhin
5f3012fc9a
pre: OpenCV 4.4.0 (version++)
5 years ago