Anna Khakimova
beb14c70da
GAPI Fluid: SIMD optimization for sep filters 5x5 kernel size (gaussBlur)
5 years ago
Anton Potapov
b5426a8679
G-API (Fluid) core support for I420
...
- Extended NV12 support in Fluid Core engine to cover I420
5 years ago
Anna Khakimova
363976694e
GAPI Fluid: The run_sepfilter() has logic error into handler for 5x5 and larger kernels
5 years ago
Maksim Shabunin
fccf284088
Fixed relative paths handling in cap_gstreamer:
5 years ago
Lubov Batanina
cfc781949d
Merge pull request #15811 from l-bat:eltwise_div
...
Supported ONNX Squeeze, ReduceL2 and Eltwise::DIV
* Support eltwise div
* Fix test
* OpenCL support added
* refactoring
* fix code style
* Only squeeze with axes supported
5 years ago
Sebastián Gurín
dd9262c318
expose FS
5 years ago
Alexander Alekhin
f42d5399aa
core(persistence): add more checks for implementation limitations
5 years ago
Maksim Shabunin
ad5874779c
Install: added missing gapi headers
5 years ago
Dimitri Gerin
7c4158d8c2
Fix dnn::getLayerInputs
5 years ago
czgdp1807
07ef08e966
removed typo
5 years ago
Igor Murzov
cdbfdcc363
Fix OpenCL device detection when some OpenCL platform has no devices
...
It's not an error if some OpenCL platform has no devices. This makes
OpenCL device detection work correctly in the following scenario:
$ OPENCV_OPENCL_DEVICE=:GPU: ./opencv_test_dnn
OpenCV version: 4.1.2-dev
OpenCV VCS version: 4.1.2-80-g467748ee98-dirty
Build type: Debug
Compiler: /usr/bin/g++ (ver 7.4.0)
Parallel framework: pthreads
CPU features: SSE SSE2 SSE3 *SSE4.1 *SSE4.2 *FP16 *AVX *AVX2 *AVX512-SKX?
Intel(R) IPP version: ippIP AVX2 (l9) 2019.0.0 Gold (-) Jul 24 2018
OpenCL Platforms:
AMD Accelerated Parallel Processing
Portable Computing Language
CPU: pthread-AMD Ryzen 7 2700X Eight-Core Processor (OpenCL 1.2 pocl HSTR: pthread-x86_64-pc-linux-gnu-znver1)
NVIDIA CUDA
dGPU: GeForce GTX 1080 (OpenCL 1.2 CUDA)
Current OpenCL device:
Type = dGPU
Name = GeForce GTX 1080
Version = OpenCL 1.2 CUDA
Driver version = 430.26
5 years ago
Chip Kerchner
2112aa31e6
Merge pull request #15828 from ChipKerchner:momentsToHal
...
* Convert moments in tile algorithms to HAL (1.3x faster for VSX).
* Adding NEON code back in for non 64-bit platforms.
* Remove floats from post processing.
5 years ago
TH3CHARLie
2c2716de0f
core(test): add test for YAML parse multiple documents
...
- added removal of temporary file
5 years ago
Alexander Alekhin
dcf72e49e2
core(persistence): fix processing of multiple documents
5 years ago
Igor Murzov
6d5b900324
Simplify OpenCL info dumping code:
...
* Reduce code nesting
* Drop redundant .c_str() calls
5 years ago
TH3CHARLie
a165f55579
Merge pull request #15842 from TH3CHARLie:yaml-fix
...
* fix yaml parse
* add test for YAML parse multiple documents
* remove trailing whitespace in test
5 years ago
Oleg Alexandrov
53139e6ebe
Merge pull request #15838 from oleg-alexandrov:patch-2
...
Correct stereoRectify documentation
5 years ago
berak
b7c8e9e874
python: fix type error msg
5 years ago
Gael Colas
e65b51ca3c
Merge pull request #15821 from ColasGael:colasg-viz-color
...
Fix wrong definition of viz::Color::navy()
5 years ago
Oleg Alexandrov
d56535afce
Merge pull request #15820 from oleg-alexandrov:patch-1
...
Clarify stereoRectify() doc
The function stereoRectify() takes as input a coordinate transform between two cameras. It is ambiguous how it goes. I clarified that it goes from the second camera to the first.
5 years ago
yuriyluxriot
4e156a162f
Merge pull request #15812 from yuriyluxriot:fls_replaces_tls
...
* Use FlsAlloc/FlsFree/FlsGetValue/FlsSetValue instead of TlsAlloc/TlsFree/TlsGetValue/TlsSetValue to implment TLS value cleanup when thread has been terminated on Windows Vista and above
* Fix 32-bit build
* Fixed calling convention of cleanup callback
* WINAPI changed to NTAPI
* Use proper guard macro
5 years ago
Chip Kerchner
ed7e4273cd
Merge pull request #15555 from ChipKerchner:flipVectorize
...
* Vectorize flipHoriz and flipVert functions.
* Change v_load_mirror_1 to use vec_revb for VSX
* Only use vec_revb in ISA3.0
* Removing vec_revb code since some of the older compilers don't fully support it.
* Use new v_reverse intrinsic and cleanup code.
* Ensure there are no alignment issues with copies
5 years ago
Alexander Alekhin
48073e6d95
pylint: eliminate warnings
5 years ago
Alexander Alekhin
657c17bb8c
cmake: fix ITT define condition
5 years ago
Ciprian Alexandru Pitis
d2e02779c4
Merge pull request #15799 from Cpitis:feature/parallelization
...
Parallelize pyrDown & calcSharrDeriv
* ::pyrDown has been parallelized
* CalcSharrDeriv parallelized
* Fixed whitespace
* Set granularity based on amount of threads enabled
* Granularity changed to cv::getNumThreads, now each thread should receive 1/n sized stripes
* imgproc: move PyrDownInvoker<CastOp>::operator() implementation
* imgproc(pyramid): remove syloopboundary()
* video: SharrDerivInvoker replace 'Mat*' => 'Mat&' fields
5 years ago
Oleg Alexandrov
af433d0352
Merge pull request #15780 from oleg-alexandrov:master
...
* Doc bugfix
The documentation page StereoBinaryBM and StereoBinarySGBM says that it returns a disparity that is scaled multiplied by 16. This scaling must be undone before calling reprojectImageTo3D, otherwise the results are wrong. The function reprojectImageTo3D() could do this scaling internally, maybe, but at least the documentation must explain that this has to be done.
* calib3d: update reprojectImageTo3D documentation
* calib3d: add StereoBM/StereoSGBM into notes list
5 years ago
Dizhenin Vlad
edc5518f68
Merge pull request #15608 from SimpleVlad:3.4
...
* Add flags for build js
* Add poi.json
* Rebase whitelist into JSON file
* Rework generator of white_list
* Fix small typos
* Transfer opencv_js.josn in opencv_js.config.py
* Edit OPENCV_JS_WHITELIST
* Write comment
* Add description
* Fix typos in desc
* flag's append deleeted
* Fix whitespace
* variable deleted
* fix comment on lines 229 and 235
5 years ago
Alexander Alekhin
79f792ad05
ts: do not block reporting of launched "DISABLED_" tests
...
If tests are run through GTest option `--gtest_also_run_disabled_tests`
5 years ago
Anna Khakimova
c394847c35
GAPI Fluid: Dynamic window size
5 years ago
atalaman
ea64bb58a5
Merge pull request #15751 from TolyaTalamanov:at/refactor-render-tests
...
* Refactor render tests
* Fix comment to review
* Move ocv render tests stuff to specific file
* Add OCV prefix for render tests
* Fix comments to review
5 years ago
Igor Murzov
a9d23a6479
Fix wording in some tutorials
5 years ago
Chip Kerchner
a71ff50130
Merge pull request #15623 from ChipKerchner:optimizeHOGpipeline
...
* Use circular lut hustory buffer in computeGradient of HOG
* Initialize prefetch data outside main loop. Avoid code duplication.
5 years ago
Alexander Alekhin
cd6d79d106
gapi: fix opencv_world build
5 years ago
Alexander Alekhin
80c4cedd25
android: use .getRowStride() in JavaCamera2View
5 years ago
André Lippok
86a8ff6129
Fixed typo in assertion
5 years ago
Alexander Alekhin
6ec5ae0215
core(trace): add ITT control parameter
...
- OPENCV_TRACE_ITT_ENABLE
5 years ago
Dmitry Budnikov
8dfba51884
add empty implementation
5 years ago
Vitaly Tuzov
42b1d04999
StereoSGBM algorithm updated to use wide universal intrinsics
5 years ago
Vitaly Tuzov
1ce5a724c7
Fixed StereoBM uniqueness check
5 years ago
Alexander Alekhin
17e2bf5717
core(tls): implement releasing of TLS on thread termination
...
- move TLS & instrumentation code out of core/utility.hpp
- (*) TLSData lost .gather() method (to dispose thread data on thread termination)
- use TLSDataAccumulator for reliable collecting of thread data
- prefer using of .detachData() + .cleanupDetachedData() instead of .gather() method
(*) API is broken: replace TLSData => TLSDataAccumulator if gather required
(objects disposal on threads termination is not available in accumulator mode)
5 years ago
ChipKerchner
c46f119e0e
Convert demosaic functions to HAL
5 years ago
Anna Khakimova
8bf5bed0a9
GAPI:Fix for using cv makes G_TYPED_KERNEL fail
5 years ago
Ruslan Garnov
e89c1103ff
Returned GModel::mkDataNode() overload for external backends
5 years ago
Chip Kerchner
5a6a49405d
Merge pull request #15738 from ChipKerchner:bugInt64x2Comparison
...
Fixing bug with comparison of v_int64x2 or v_uint64x2
* Casting v_uint64x2 to v_float64x2 and comparing does NOT work in all cases. Rewrite using epi64 instructions - faster too.
* Fix bad merge.
* Fix equal comparsion for non-SSE4.1. Add test cases for v_int64x2 comparisons.
* Try to fix merge conflict.
* Only test v_int64x2 comparisons if CV_SIMD_64F
* Fix compiler warning.
5 years ago
Alexander Alekhin
fe4f27b99b
gapi: fix build
...
- gcc 4.8.4 (ARMv7)
5 years ago
anton-potapov
471b40040a
Merge pull request #15735 from anton-potapov:gapi_async_documentaion
...
* G-API: Doxygen documentatation for Async API
* G-API: Doxygen documentatation for Async API
- renamed local variable (reading parameter async) async ->
asyncNumReq in object_detection DNN sample
to avoid Doxygen erroneous linking the sample to cv::gapi::wip::async
documentation
5 years ago
Dmitry Kurtaev
dfe0368835
Fix custom IE layers in case of no MKLDNN plugin
5 years ago
Yashas Samaga B L
613c12e590
Merge pull request #14827 from YashasSamaga:cuda4dnn-csl-low
...
CUDA backend for the DNN module
* stub cuda4dnn design
* minor fixes for tests and doxygen
* add csl public api directory to module headers
* add low-level CSL components
* add high-level CSL components
* integrate csl::Tensor into backbone code
* switch to CPU iff unsupported; otherwise, fail on error
* add fully connected layer
* add softmax layer
* add activation layers
* support arbitary rank TensorDescriptor
* pass input wrappers to `initCUDA()`
* add 1d/2d/3d-convolution
* add pooling layer
* reorganize and refactor code
* fixes for gcc, clang and doxygen; remove cxx14/17 code
* add blank_layer
* add LRN layer
* add rounding modes for pooling layer
* split tensor.hpp into tensor.hpp and tensor_ops.hpp
* add concat layer
* add scale layer
* add batch normalization layer
* split math.cu into activations.cu and math.hpp
* add eltwise layer
* add flatten layer
* add tensor transform api
* add asymmetric padding support for convolution layer
* add reshape layer
* fix rebase issues
* add permute layer
* add padding support for concat layer
* refactor and reorganize code
* add normalize layer
* optimize bias addition in scale layer
* add prior box layer
* fix and optimize normalize layer
* add asymmetric padding support for pooling layer
* add event API
* improve pooling performance for some padding scenarios
* avoid over-allocation of compute resources to kernels
* improve prior box performance
* enable layer fusion
* add const layer
* add resize layer
* add slice layer
* add padding layer
* add deconvolution layer
* fix channelwise ReLU initialization
* add vector traits
* add vectorized versions of relu, clipped_relu, power
* add vectorized concat kernels
* improve concat_with_offsets performance
* vectorize scale and bias kernels
* add support for multi-billion element tensors
* vectorize prior box kernels
* fix address alignment check
* improve bias addition performance of conv/deconv/fc layers
* restructure code for supporting multiple targets
* add DNN_TARGET_CUDA_FP64
* add DNN_TARGET_FP16
* improve vectorization
* add region layer
* improve tensor API, add dynamic ranks
1. use ManagedPtr instead of a Tensor in backend wrapper
2. add new methods to tensor classes
- size_range: computes the combined size of for a given axis range
- tensor span/view can be constructed from a raw pointer and shape
3. the tensor classes can change their rank at runtime (previously rank was fixed at compile-time)
4. remove device code from tensor classes (as they are unused)
5. enforce strict conditions on tensor class APIs to improve debugging ability
* fix parametric relu activation
* add squeeze/unsqueeze tensor API
* add reorg layer
* optimize permute and enable 2d permute
* enable 1d and 2d slice
* add split layer
* add shuffle channel layer
* allow tensors of different ranks in reshape primitive
* patch SliceOp to allow Crop Layer
* allow extra shape inputs in reshape layer
* use `std::move_backward` instead of `std::move` for insert in resizable_static_array
* improve workspace management
* add spatial LRN
* add nms (cpu) to region layer
* add max pooling with argmax ( and a fix to limits.hpp)
* add max unpooling layer
* rename DNN_TARGET_CUDA_FP32 to DNN_TARGET_CUDA
* update supportBackend to be more rigorous
* remove stray include from preventing non-cuda build
* include op_cuda.hpp outside condition #if
* refactoring, fixes and many optimizations
* drop DNN_TARGET_CUDA_FP64
* fix gcc errors
* increase max. tensor rank limit to six
* add Interp layer
* drop custom layers; use BackendNode
* vectorize activation kernels
* fixes for gcc
* remove wrong assertion
* fix broken assertion in unpooling primitive
* fix build errors in non-CUDA build
* completely remove workspace from public API
* fix permute layer
* enable accuracy and perf. tests for DNN_TARGET_CUDA
* add asynchronous forward
* vectorize eltwise ops
* vectorize fill kernel
* fixes for gcc
* remove CSL headers from public API
* remove csl header source group from cmake
* update min. cudnn version in cmake
* add numerically stable FP32 log1pexp
* refactor code
* add FP16 specialization to cudnn based tensor addition
* vectorize scale1 and bias1 + minor refactoring
* fix doxygen build
* fix invalid alignment assertion
* clear backend wrappers before allocateLayers
* ignore memory lock failures
* do not allocate internal blobs
* integrate NVTX
* add numerically stable half precision log1pexp
* fix indentation, following coding style, improve docs
* remove accidental modification of IE code
* Revert "add asynchronous forward"
This reverts commit 1154b9da9da07e9b52f8a81bdcea48cf31c56f70.
* [cmake] throw error for unsupported CC versions
* fix rebase issues
* add more docs, refactor code, fix bugs
* minor refactoring and fixes
* resolve warnings/errors from clang
* remove haveCUDA() checks from supportBackend()
* remove NVTX integration
* changes based on review comments
* avoid exception when no CUDA device is present
* add color code for CUDA in Net::dump
5 years ago
Steve Nicholson
acb3b3bd4d
Add documentation and example program for intersectConvexConvex
5 years ago
TH3CHARLie
da0fec7308
fix incorrect imshow behavior
5 years ago