Vitaly Tuzov
3b015dfc7d
Merge pull request #14210 from terfendail:wui_512
...
AVX512 wide universal intrinsics (#14210 )
* Added implementation of 512-bit wide universal intrinsics(WIP)
* Added implementation of 512-bit wide universal intrinsics: implemented WUI vector types(WIP)
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented load/store
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented fp16 load/store
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented recombine and zip, implemented non-saturating and saturating arithmetics
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented bit operations
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented comparisons
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented lane shifts and reduction
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented absolute values
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented rounding and cast to float
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented LUT
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented type extension/narrowing and matrix operations
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented load_deinterleave for 2 and 3 channels images
* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented load_deinterleave for 2- and implemented for 4-channel images
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented store_interleave
* Added implementation of 512-bit wide universal intrinsics(WIP): implemented signmask and checks
* Added implementation of 512-bit wide universal intrinsics(WIP): build fixes
* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented popcount in case AVX512_BITALG is unavailable
* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented zip
* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented rotate for s8 and s16
* Added implementation of 512-bit wide universal intrinsics(WIP): reimplemented interleave/deinterleave for s8 and s16
* Added implementation of 512-bit wide universal intrinsics(WIP): updated v512_set macros
* Added implementation of 512-bit wide universal intrinsics(WIP): fix for GCC wrong _mm512_abs_pd definition
* Added implementation of 512-bit wide universal intrinsics(WIP): reworked v_zip to avoid AVX512_VBMI intrinsics
* Added implementation of 512-bit wide universal intrinsics(WIP): reworked v_invsqrt to avoid AVX512_ER intrinsics
* Added implementation of 512-bit wide universal intrinsics(WIP): reworked v_rotate, v_popcount and interleave/deinterleave for U8 to avoid AVX512_VBMI intrinsics
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed integral image SIMD part
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed warnings
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed load_deinterleave for u8 and u16
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed v_invsqrt accuracy for f64
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed interleave/deinterleave for u32 and u64
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed interleave_pairs, interleave_quads and pack_triplets
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed rotate_left
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed rotate_left/right, part 2
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed 512-wide universal intrinsics based resize
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed findContours by avoiding use of uint64 dependent 512-wide v_signmask()
* Added implementation of 512-bit wide universal intrinsics(WIP): fixed trailing whitespaces
* Added implementation of 512-bit wide universal intrinsics(WIP): reworked specific intrinsic sets dependent parts to check availability of intrinsics based on CPU feature group defines
* Added implementation of 512-bit wide universal intrinsics(WIP):Updated AVX512 implementation of v_popcount to avoid AVX512VPOPCNTDQ intrinsics if unavailable.
* Added implementation of 512-bit wide universal intrinsics(WIP): Fixed universal intrinsics data initialisation, v_mul_wrap, v_floor, v_ceil and v_signmask.
* Added implementation of 512-bit wide universal intrinsics(WIP): Removed hasSIMD512()
* Added implementation of 512-bit wide universal intrinsics(WIP): Fixes for gcc build
* Added implementation of 512-bit wide universal intrinsics(WIP): Reworked v_signmask, v_check_any() and v_check_all() implementation.
6 years ago
Dmitry Kurtaev
9c0af1f675
Enable more deconvolution layer configurations with IE backend
6 years ago
Vitaly Tuzov
723165f878
fix for AVX2 version of v_reduce_min intrinsic
6 years ago
Lubov Batanina
f94253b82b
Merge pull request #14627 from l-bat:demo_kinetics
...
* Support 3D ResNet-34-kinetics
* Update sample
* Remove preprocess
* Change test
* Fix sample
6 years ago
Alexander Alekhin
52548bde05
dnn(test): replace file content reading
6 years ago
Dmitry Kurtaev
081d9bc73f
Fix Identity Switch from Keras
6 years ago
Ahmed Ashour
ca8a1d2cff
java: generated code inline return
6 years ago
Alexander Alekhin
38a3c1ce6b
dnn(test): update test tags for Debug build
6 years ago
LaurentBerger
a3be8d7cd7
dead link
6 years ago
Alexey Nikolaev
05563f5bc4
Merge pull request #14592 from aleksey-nikolaev:master
...
VideoCapture Direct Show. Added getter for CV_CAP_PROP_CONVERT_RGB, CAP_PROP_CHANNEL properties. (#14592 )
* Added getter for CV_CAP_PROP_CONVERT_RGB, CAP_PROP_CHANNEL properties.
Some refactoring.
* One space less.
6 years ago
Lubov Batanina
61d3222a22
Merge pull request #14537 from l-bat:fix_network_vizualizer
...
Fix dnn visualizer (#14537 )
* Fixed print layer params
* Fix print
6 years ago
Dmitry Kurtaev
44d21e5a79
Enable Slice layer on Inference Engine backend
6 years ago
Rostislav Vasilikhin
8c698262ea
rgb2hls_b: out of bounds read fixed
6 years ago
Rostislav Vasilikhin
791ebd05fc
out of bounds read fixed in rgb2luv_b
6 years ago
Alexander Alekhin
483f28723c
calib3d: initialize local vars, fix indentation in for loops
6 years ago
utibenkei
898cf70210
Merge pull request #14597 from utibenkei:fix_java_missing_consts
...
* java: fix java missing_consts
* java: test Calib3d constants
6 years ago
Alexander Alekhin
b958498074
java(test): test package filtering
...
Usage example:
- run.py -a -t java --package=calib3d
6 years ago
Alexander Alekhin
171dd9eff5
java(test): enable debug mode to show useful stacktraces
6 years ago
Rostislav Vasilikhin
e07ffe902e
Merge pull request #14616 from savuor:hsv_wide
...
HSV and HLS color conversions rewritten to wide intrinsics (#14616 )
* RGB2HSV_b vectorized
* RGB2HSV_f: widen
* RGB2HSV_f: shorten, more intuitive
* HSV2RGB_f and HSV2RGB_b widen
* hls2rgb_f widen
* instrumentation instead vx_cleanup
* RGB2HLS_f widen
* RGB2HLS_b rewritten to wide universal intrinsics
* define guard against no SIMD code
* hls2rgb_b rewritten
* extra define removed
* warning fixed
* hls2rgb_b: performance fixed
6 years ago
Alexander Alekhin
22701f0c27
imgcodecs(CAP_IMAGES): fix handling of input pattern
6 years ago
Vitaly Tuzov
f0fb91f2d4
Fixed v_signmask implementation for AVX2, updated universal intrinsics tests.
6 years ago
Ahmed Ashour
f9564e053d
java: test: use assertNotNull and assertFalse
6 years ago
Jan Starzynski
fb9c083a51
handle all orientations properly
6 years ago
Ahmed Ashour
f3319f6140
java: remove redundant declaration of java.lang package
6 years ago
catree
7ed858e38e
Fix issue with solvePnPRansac and Nx3 1-channel input when the number of points is 5. Try to uniform the input shape of projectPoints and undistortPoints.
6 years ago
Dmitry Kurtaev
26e426adb1
StridedSlice from TensorFlow
6 years ago
Thang Tran
eb243847eb
js: added floodFill function to JS binding
...
previously floodFill() is white-list without any implementation.
floodFill() is now fully functional in JS.
6 years ago
Rostislav Vasilikhin
e90e0ef9aa
Merge pull request #14106 from savuor:lab_wide
...
Lab, Luv and XYZ conversions rewritten to wide intrinsics (#14106 )
* rgb2xyz<float> re-vectorized
* rgb2xyz_i vectorized for ushort and uchar
* xyz2rgb<float> vectorized
* xyz2rgb_i vectorized for both uchar and ushort
* intermediate conversions (int->float) rewritten
* packed rgb2luv rewritten
* (some) float conversions rewritten
* burnt volatile int _3 and similar
* RGB2Lab_b rewritten
* tests: logging made better
* RGB2Lab_f (LRGB path) rewritten
* Lab2RGBfloat rewritten
* Lab2RGBinteger and Lab2RGB_b rewritten to wide universal intrinsics
* Luv2RGBinteger wide vectorized
* RGB2Lab_b fixed: v_sub_wrap instead of saturated sub
* warnings fixed
* trying to fix compilation on older compilers
* using 16x8 registers for 8-element dot product
* cleanup added
* splineInterpolate: loop unrolled, perf fix for f32x4
* Lab2RGBfloat: grab 2x more data to process on f32x4
* nrepeats for Luv2RGBfloat, +20% perf
* minor
* nrepeats to RGB2Lab_f
* Lab2RGBinteger: no tab for linear BGR
* nrepeats for RGB2Luvfloat
* Luv2RGBinteger: no tab for linear RGB
* +10% more to perf of Luv2RGBfloat
* nrepeats for 256-simd for Lab2RGBfloat
* less warnings
* BOM removed
* CV_SIMD_WIDTH used for lanes number checking
* trilinearPackedInterpolate: 128-bit specialization added
* fix build; no vx_cleanup(), instrumentation instead
6 years ago
dianlujitao
f0f50b757d
Fix LogSoftmax for ONNX
...
Fix wrong indentation as well while at it
6 years ago
catree
33cb9c5ca5
Add SOLVEPNP_IPPE for planar pose estimation. Add solvePnPGeneric function that returns all the pose solutions and the reprojection errors.
6 years ago
catree
b5e2ec4ea4
Fix typo in NormTypes documentation.
6 years ago
Kohei Yoshida
7b4aefedea
Add support for loading ONNX model from in-memory buffer.
6 years ago
Vitaly Tuzov
7a55f2af3b
Updated AVX2 implementation of v_popcount for u8.
6 years ago
Daniel Ingram
962d57b4d6
Merge pull request #14559 from daniel-s-ingram:master
...
* Fix typo: 'divisble' -> 'divisible'
* Fix typo: 'One of arguments' -> 'One of the arguments'
6 years ago
Alexander Alekhin
8936a0f3a5
android: fix JavaCamera2View UV plane handling
6 years ago
Alexander Alekhin
68012bfbf9
calib3d: fix uninitialized fields in ctors
6 years ago
Vitaly Tuzov
1220dd4877
Updated v_popcount description, reference implementation and test.
6 years ago
Vitaly Tuzov
96ab78dc4f
Reworked v_popcount implementation to provide number of bits in a single lane
6 years ago
Liubov Batanina
dfa753c6b4
Support OCV backend
6 years ago
Dmitry Kurtaev
6389dfe49c
Fixed DetectionOutput output blob shape
6 years ago
Liubov Batanina
dadb1473c1
Add BatchNorm3d layer
6 years ago
Dmitry Kurtaev
c3b0a68a2b
Async mode for dnn's object detection sample
6 years ago
Maksim Shabunin
9a81ff7315
Backported AVFoundation fixes from master branch
6 years ago
Dmitry Kurtaev
471b83ccd5
Modify paddings computation for SAME pad mode
6 years ago
Sayed Adel
5a77f4cee3
Merge pull request #14007 from seiko2plus:core_avx512_infa
...
* core: improve AVX512 infrastructure by adding more CPU features groups
* cmake: use groups for AVX512 optimization flags
* core: remove gap in CPU flags enumeration
* cmake: restore default CPU_DISPATCH
6 years ago
Dmitry Kurtaev
adc1ef9308
Fix uint8 input data for Async mode of dnn
6 years ago
seungju
ceff82877d
fix a typo in md file
...
fix a typo in ml_intro.md, tress -> trees
6 years ago
Thang Tran
1aff378ae8
imgproc: fixed bug from intersectConvexConvex
...
Added checks for all of vertices from each contour instead of checking
only for the first vertex.
6 years ago
Dmitry Kurtaev
a6ed8f268a
Remove extra weights cloning from TensorFlow importer
6 years ago
Lubov Batanina
77fa59c3da
Merge pull request #14301 from l-bat:conv3d
...
Support Convolution3D layer on IE backend (#14301 )
* Add Convolution3D layer
* Disable CXX11
* Fixed tests
* Add Pooling3D layer
* Merge Conv2d with Conv3d and Pool2d with Pool3d layers
* Split pads
* Add Deconvolution layer
* Refactoring
* Deduplication
* Refactoring
* Add utils for Convolution and Pooling layers
6 years ago