Alexander Alekhin
f96569da1e
Merge pull request #20746 from easonycwang:easonycwang/inpaint_perf_opt
3 years ago
easonycwang
91ff45fbde
Tile:
...
This submission is used to improve the performance of the inpaint algorithm for 3 channels images(RGB or BGR).
Reason:
The original algorithm implementation did not consider the cache hits.
The loop of channels is outside the core loop, so the perfmance is not very good.
Moving the channel loop inside the core loop can significantly improve cache hits, thereby improving performance.
Performance:
360P, about >= 30% improvement
iphone8P: 5.52ms -> 3.75ms
iphone6s: 14.04ms -> 9.15ms
3 years ago
Alexander Alekhin
45aabc5d0d
Merge pull request #20742 from UnaNancyOwen:fix_text_detection
3 years ago
Tsukasa Sugiura
f9bd83c854
fix cast in text detection sample
3 years ago
Amir Tulegenov
86a51015b1
Merge pull request #19554 from amirtu:OCV-215_cvtColorTwoPlane_wrong_output_when_Y_Plane_Mat_has_step
...
different paddings in cvtColorTwoPlane() for biplane YUV420
* Different paddings support in cvtColorTwoPlane() for biplane YUV420
* Build fix for dispatch case.
* Resoted old behaviour for y.step==uv.step to exclude perf regressions.
Co-authored-by: amir.tulegenov <amir.tulegenov@xperience.ai>
Co-authored-by: Alexander Smorkalov <alexander.smorkalov@xperience.ai>
3 years ago
Alexander Alekhin
998406d20e
Merge pull request #20729 from MikaelUrankar:3.4
3 years ago
mikael
f7b4b750d8
Detect FP16 on FreeBSD aarch64
3 years ago
Alexander Alekhin
a4e2c56317
Merge pull request #20713 from SamFC10:unsqueeze-opset13
3 years ago
SamFC10
9c5d7716e2
fix for unsqueeze opset version 13
3 years ago
Alexander Alekhin
46fd26e366
Merge pull request #20699 from alalek:dnn_perf_update_convolution_tests
3 years ago
rogday
c410d7a97d
Merge pull request #20671 from rogday:yolov4x-mish
...
Add support for YOLOv4x-mish
* backport to 3.4 for supporting yolov4x-mish
* add YOLOv4x-mish test
* address review comments
Co-authored-by: Guo Xu <guoxu@1school.com.cn>
3 years ago
Alexander Alekhin
6fa63dcc0c
Merge pull request #20700 from alalek:cmake_fix_handling_ie_version
3 years ago
Alexander Alekhin
96f25332ea
Merge pull request #20692 from sturkmen72:update_convexhull_cpp
3 years ago
Alexander Alekhin
3385d38648
cmake: fix handling of INF_ENGINE_RELEASE
...
- default value should be handled earlier
3 years ago
Alexander Panov
1618c963e4
Merge pull request #20676 from AleksandrPanov:delete_createConvexHull_convertTo
...
* deleted dublicated createConvexHull and convertTo
* replaced checkVector(2) with points.empty()
3 years ago
Alexander Alekhin
1e3be09b3b
Merge pull request #20693 from alalek:backport_merge34_fix
3 years ago
Alexander Alekhin
6e66a9222a
dnn(onnx): fix format specifier
3 years ago
Suleyman TURKMEN
9d1e8b1e1d
Update convexhull.cpp
3 years ago
Alexander Alekhin
f605373a2b
Merge pull request #20690 from alalek:build_opencv_winpack_dldt_2021.4.1
3 years ago
Alexander Alekhin
56b7622612
Merge pull request #20687 from alalek:core_ocl_fix_intel_gemm
3 years ago
Alexander Alekhin
696a6ccd57
Merge pull request #20686 from zihaomu:tf_biasadd_bug
...
DNN: BiasAdd tf_importer.cpp could load Const from First Place
3 years ago
Zihao Mu
51b03b87e6
BiasAdd could load Const from second place.
3 years ago
Alexander Alekhin
aa7ba0bc1a
build: winpack_dldt with dldt 2021.4.1
3 years ago
Alexander Alekhin
07e4076585
Merge pull request #20689 from diablodale:fix20613-selectocldevice
3 years ago
Dale Phurrough
de1a459879
fix opencv/opencv#20613
...
* copy 4.x selectOpenCLDevice() -- it is compatible
* filter platforms rather than trying only first matching
* this works on 3.4 and 4.x master
3 years ago
Alexander Alekhin
1aacb9bb15
dnn(perf): update convolution tests
3 years ago
Alexander Alekhin
9b4ecc96f6
core(ocl): buffer bounds in intelblas_gemm_buffer_NT
3 years ago
Alexander Alekhin
e3f4f874c5
Merge pull request #20670 from alalek:core_ocl_fix_intel_gpu_gemm_requirements
...
core(OpenCL): fix intel_gpu_gemm kernel requirements
* core(ocl): fix intel_gpu_gemm integration
- allow bailout to generic OpenCL kernel
* core(ocl): avoid failures of generic OpenCL gemm kernel
* core(ocl): define alignment requirements of intel_gpu_gemm kernels
3 years ago
Alexander Alekhin
6ace801418
Merge pull request #20661 from alalek:dnn_ocl_fix_gemm_like_kernel
3 years ago
rogday
d31b93b513
Merge pull request #20674 from rogday:prelu_slope
...
Fix PReLU negative slope access pattern
* fix prelu negative slope access pattern
* change begin() to ptr()
3 years ago
Alexander Alekhin
ac0fd6aa9a
Merge pull request #20675 from diablodale:fix2-20544
3 years ago
Dale Phurrough
068f33cfdf
add nodiscard to features2d clone funcs
3 years ago
rogday
4807cd8a6e
Merge pull request #20605 from rogday:split_slice_shenanigans
...
Add Normalize subgraph, fix Slice, Mul and Expand
* Add Normalize subgraph, support for starts<0 and axis<0 in Slice, Mul broadcasting in the middle and fix Expand's unsqueeze
* remove todos
* remove range-based for loop
* address review comments
* change >> to > > in template
* fix indexation
* fix expand that does nothing
3 years ago
Alexander Alekhin
35e824c287
dnn(ocl): fix out of bound access in GEMM-like kernels
...
- dropped usage of CreateSubBuffer() - buffers lifetime management issue
- fixed elementwise offset
- avoid out of bounds read access
3 years ago
Alexander Alekhin
1e0d290f2e
Merge pull request #20655 from alalek:dnn_ocl_fix_globalsize
3 years ago
Alexander Alekhin
0097a8d097
Merge pull request #20657 from zchrissirhcz:upd-doc-for-py_fast
3 years ago
Zhuo Zhang
36cc43170d
docs: fix image path for py_fast doc
3 years ago
Alexander Alekhin
5578ad5e14
dnn(ocl): fix automatic globalsize adjusting
...
- if kernel code doesn't support that
3 years ago
Alexander Alekhin
d11f0a709d
Merge pull request #20652 from alalek:ocl_debug
3 years ago
Alexander Alekhin
0a43b23275
Merge pull request #20651 from alalek:issue_18361
3 years ago
Alexander Alekhin
7967683296
Merge pull request #20648 from alalek:issue_20615
3 years ago
Alexander Alekhin
5b2c016834
dnn(ocl): avoid out of buffer access in copyWeightsSwizzled
3 years ago
Alexander Alekhin
aaff125608
core(ocl): debug capabilities
3 years ago
Alexander Alekhin
407adc7061
dnn(ocl): fix buffer offsets in IDLF kernel
...
- drop CreateSubBuffer
- fix FUSED_CONV_ELTWISE mode
3 years ago
rogday
d0e612dc36
Merge pull request #20647 from rogday:resize_concat_optimization
...
Fix resize+concat optimization
* fix resize+concat optimization
* add comment and fix indentation
3 years ago
Alexander Alekhin
7c23ec90a9
Merge pull request #20646 from VadimLevin:dev/vlevin/fix-vector-conversion
3 years ago
Vadim Levin
390957fec4
fix: NumPy array allocation error message in vector conversion
3 years ago
Alexander Alekhin
060a76dc3e
Merge pull request #20573 from rogday:onnx_scale_fix
3 years ago
Alexander Alekhin
6625810d2a
Merge pull request #20618 from VadimLevin:dev/vlevin/fix-vector-conversion
3 years ago
WJJ1995
edc442afdb
Merge pull request #20511 from wjj19950828:add_humanseg_support_0806
...
* support PPSeg model for dnn module
* fixed README for CI
* add test case
* fixed bug
* deal with comments
* rm dnn_model_runner
* update test case
* fixed bug for testcase
* update testcase
3 years ago