Alexander Alekhin
8b4fa2605e
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
3 years ago
Anna Khakimova
c3910807c5
Merge pull request #21177 from anna-khakimova:ak/simd_mulc
...
* GAPI Fluid: SIMD for MulC kernel.
* Changes for MulDouble kernel.
3 years ago
Sergey Ivanov
c5b8b5687f
Merge pull request #21041 from sivanov-work:gin_gout_concept
...
G-API: GAPI introduce compile guard for some types for gin/gout params passing
* Initial for taged solution
* Move out tags to gtags.hpp & add protection for own::Mat
* Add compile guard to proper place
* Fix MACRO concat
* Add unit tests
* Remove class MACRO injection due to Python3
* Revert back unproper changes
* Apply comments: reuse shape from traits
* Throw away unused gtags
* Apply comments
* Handle own::*
* Fix test
* Fix test(1)
* Fix unix build
* Try on type list
* Apply comments
* Apply comments
* Fix warning
3 years ago
Alexander Alekhin
f61883b227
Merge pull request #21165 from utibenkei:fix_android_ndk_camera_order_of_u_and_v
3 years ago
Alexander Alekhin
35ff9af6ce
Merge pull request #21162 from rogday:softmax_simplification
3 years ago
Alexander Alekhin
dad2b9aac8
Merge pull request #21160 from rogday:elu_alpha
3 years ago
rogday
1613d30544
Merge pull request #21159 from rogday:ceil_mode
...
fix ceil_mode for Average/MaxPooling
* fix ceil_mode
* add a comment
3 years ago
Alexander Alekhin
b6df9debaf
Merge pull request #21174 from APrigarina:fix_qr_encoder
3 years ago
Alexander Alekhin
b9d0dc60b0
Merge pull request #21173 from alalek:3.4_dnn_test_reenable_ov_2021_4
3 years ago
APrigarina
37b1876807
qr encoder: fix memory and unused variables issues
3 years ago
Alexander Alekhin
d206350738
Merge pull request #21172 from alalek:dnn_test_drop_non_cpu_int8
3 years ago
Jong Sin Kim
2da1f9181a
Merge pull request #21170 from JJJoonngg:4.x
...
Check buffer size when frameWidth * frameHeight bigger than allocated buffer size
3 years ago
Alexander Alekhin
bd396e1fd5
dnn(test): re-enable tests which works with OpenVINO 2021.4.x (3.4)
3 years ago
Alexander Alekhin
f55c9ed1ba
dnn(test): drop non OCV/CPU cases for Int8
...
- zero code coverage and up to x3-x8 tests slowdown
- implementation executes OCV/CPU in all cases
- wrong skip conditions
3 years ago
Alexander Alekhin
5da69c0b9a
Merge pull request #21164 from rogday:sum_identity
3 years ago
Alexander Alekhin
a806e8cc58
Merge pull request #21163 from rogday:transpose_default
3 years ago
Anna Khakimova
369b260e12
Merge pull request #21158 from anna-khakimova:ak/simd_subC
...
* GAPI Fluid: SIMD for SubC kernel.
* Applied comments
3 years ago
Alexander Alekhin
d9e7c1626a
Merge pull request #21153 from alalek:build_warnings_msvs2017
3 years ago
Vincent Rabaud
1a1a7bbbfd
Merge pull request #21112 from vrabaud:3.4_luv_overflow
...
* Fix integer overflow in cv::Luv2RGBinteger::process.
For LL=49, uu=205, vv=23, we end up with x=7373056 and y=458
which overflows y*x.
* imgproc(test): adjust test parameters to cover SIMD code
3 years ago
Smirnov Egor
33e97e994d
add sum of 1 input
3 years ago
Smirnov Egor
11e6848bb9
add default order to transpose
3 years ago
Smirnov Egor
829410729c
add new (Log)SoftMax simplification passes
3 years ago
Smirnov Egor
0e2a3686c0
add alpha parameter to ELU layer
3 years ago
Alexander Alekhin
66b2140892
build: eliminate C4309 warning from protobuf files with MSVS2017
3 years ago
Alexander Alekhin
0d2857a242
Merge pull request #21152 from rogday:fix_defaults
3 years ago
Alexander Alekhin
17d99e6266
Merge pull request #21142 from alalek:dnn_two_inputs_ocl_fp16_3.4
3 years ago
Andrew Ryrie
ea7d4be3f8
Merge pull request #20658 from smbz:lstm_optimisation
...
* dnn: LSTM optimisation
This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm.
fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications:
- Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned.
- Allow for weight matrices where the number of columns is not a multiple of 8.
I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on.
* Fix warning about initialisation order
* Remove C++11 syntax
* Fix build when AVX(2) is not available
In this case the CV_TRY_X macros are defined to 0, rather than being undefined.
* Minor changes as requested:
- Don't check hardware support for AVX(2) when dispatch is disabled for these
- Add braces
* Fix out-of-bounds access in fully connected layer
The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway).
This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems.
* Improve tail mask handling
- Use static array for generating tail masks (as requested)
- Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs
* Revert whitespace change
* Improve readability of conditions for using AVX
* dnn(lstm): minor coding style changes, replaced left aligned load
3 years ago
Smirnov Egor
05db8784ae
fix Clip, LeakyReLU, LRN, Split defaults
3 years ago
utibenkei
68667d6057
fix_android_ndk_camera_order_of_u_and_v
3 years ago
Anna Khakimova
d58b5ef74b
Merge pull request #21119 from anna-khakimova:ak/simd_addc
...
* GAPI Fluid: SIMD for AddC kernel
* Final version
* Applied comments.
3 years ago
rogday
f044037ec5
Merge pull request #20733 from rogday:argmaxnd
...
Implement ArgMax and ArgMin
* add reduceArgMax and reduceArgMin
* fix review comments
* address review concerns
3 years ago
Suleyman TURKMEN
a97f21ba4e
Merge pull request #20957 from sturkmen72:update-documentation
...
Update documentation
* Update DNN-based Face Detection And Recognition tutorial
* samples(dnn/face): update face_detect.cpp
* final changes
Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>
3 years ago
Supernovae
b594ed99b8
Merge pull request #20933 from shubham-shahh:master
...
Improved overall readability of the code
* grid_nms.cu: minor fix-ups
* Update grid_stride_range.hpp
* Update tf_importer.cpp
3 years ago
Alexander Alekhin
58b06222ff
dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling
3 years ago
Alexander Alekhin
58dc397930
dnn(test): add two_inputs test with FP32/U8 data types
...
- remove similar test from IE scope under HAVE_INF_ENGINE
3 years ago
Alexander Alekhin
15073d63d9
Merge pull request #21126 from alalek:dnn_test_ie_update_4.x
3 years ago
yuki takehara
a6277370ca
Merge pull request #21107 from take1014:remove_assert_21038
...
resolves #21038
* remove C assert
* revert C header
* fix several points in review
* fix test_ds.cpp
3 years ago
Alexander Alekhin
31b2d6be75
dnn(test): update InferenceEngine tests (4.x)
3 years ago
Alexander Alekhin
57ee14d62d
Merge remote-tracking branch 'upstream/3.4' into merge-3.4
3 years ago
Alexander Alekhin
d470cfe86e
Merge pull request #21134 from alalek:gapi_fix_build_ie_msvs2015
3 years ago
Alexander Alekhin
b55d8f46f4
Merge pull request #21133 from alalek:dnn_test_ie_update_3.4
3 years ago
Alexander Alekhin
c15218e37a
gapi: fix build with MSVS2015 (with IE)
3 years ago
Alexander Alekhin
985aa0423d
dnn(test): update InferenceEngine tests
3 years ago
Alexander Alekhin
c14a8dce93
Merge pull request #21131 from cclauss:codespell
3 years ago
Alexander Alekhin
f5d45221ca
Merge pull request #21130 from cclauss:print-function
3 years ago
Christian Clauss
d668aa7c24
Merge pull request #21121 from cclauss:patch-2
...
* GitHub Action to lint Python code
* Move from automatic events to manual ones
* flake8: Do not look for undefined names
3 years ago
Orest Chura
e20fe421e7
Merge pull request #21103 from OrestChura:oc/fix_1D_Mat_RMat_View_issue
...
[G-API] Fix issue of getting 1D Mat out of RMat::View
* Fix issue of getting 1D Mat out of RMat::View
- added test
- fixed for standalone too (removed Assert(dims.empty()))
* Fixed asVeiw() function for standalone
* Put more detailed comment
3 years ago
Orest Chura
2deb38d615
Merge pull request #21083 from OrestChura:oc/fix_coverity_vino_issues
...
[G-API] Fixed Coverity issues
* Fixed Coverity issues
- VectorRef&OpaqueRef m_kind = CV_UNKNOWN
- added same-type overload for saturate()
- sanitized resize value in ByteMemoryInStream::operator>> (std::string& str)
- handled throws from ~GStreamingExecutor()
* Catching exception by const ref
* Addressing Sergey's comments
* Applied enable_if semanitcs to saturate(x, round) too
* Removed uncaught_exception, made destructor noexcept back
* Split Fluid ConvertTo to multiple functions to avoid ifs; added CV_ALWAYS_INLINE
* Added FIXME to address throwings from stop()
* Fix standalone
* Addressing comments
* Guarded SIMD optimizations properly
* Removed excess parameter from simd_impl functions
3 years ago
Maxim Pashchenkov
b95d71af2b
Merge pull request #21106 from mpashchenkov:mp/ocv-gapi-clean-samples
...
G-API: Cleaning samples
* parseSSD + removed render details from gcpukernel
* self-rev
* Applying comment
* Added operators
* warnings
3 years ago
Christian Clauss
ebe4ca6b60
Fix typos discovered by codespell
3 years ago