Alexander Alekhin
e3f4f874c5
Merge pull request #20670 from alalek:core_ocl_fix_intel_gpu_gemm_requirements
...
core(OpenCL): fix intel_gpu_gemm kernel requirements
* core(ocl): fix intel_gpu_gemm integration
- allow bailout to generic OpenCL kernel
* core(ocl): avoid failures of generic OpenCL gemm kernel
* core(ocl): define alignment requirements of intel_gpu_gemm kernels
3 years ago
Dale Phurrough
8be86cbdfd
add usageFlags to UMat static factories
...
- add abi compatible overloads
- add test case
3 years ago
Alexander Alekhin
3d394943e6
core(ocl): avoid limit of Image kernel args
4 years ago
Alexander Alekhin
87e607a19b
core(ocl): skip SPIR test on AMD devices if problem detected
4 years ago
Dale Phurrough
96a15434a2
add move construct/assigns to cv::ocl main classes
...
- enables inline construct and assigns with r-values
- enables compiler-created default move
construct/assigns
- includes test cases
4 years ago
Alexander Alekhin
198b5096aa
Merge pull request #16754 from alalek:issue_16752
...
* core(test): FP16 norm test
* core: norm()-FP16 disable OpenCL
* core(norm): fix 16f32f local buffer size
5 years ago
Dmitry Kurtaev
24ab751547
Merge pull request #12565 from dkurt:dnn_non_intel_gpu
...
* Remove isIntel check from deep learning layers
* Remove fp16->fp32 fallbacks where it's not necessary
* Fix Kernel::run to prevent localsize > globalsize
6 years ago
Vadim Pisarevsky
6d7f5871db
added basic support for CV_16F (the new datatype etc.) ( #12463 )
...
* added basic support for CV_16F (the new datatype etc.). CV_USRTYPE1 is now equal to CV_16F, which may break some [rarely used] functionality. We'll see
* fixed just introduced bug in norm; reverted errorneous changes in Torch importer (need to find a better solution)
* addressed some issues found during the PR review
* restored the patch to fix some perf test failures
6 years ago
Tomoaki Teshima
8fd1bbde76
fix test error on Tinker Board (OpenCL on Arm platform)
...
* loosen some test threshold mainly for integer types
* use relative error for floating points result
* avoid division by zero by following the comment
* fix the indentation
7 years ago
luz.paz
5718d09e39
Misc. modules/ typos
...
Found via `codespell`
7 years ago
Alexander Alekhin
4a297a2443
ts: refactor OpenCV tests
...
- removed tr1 usage (dropped in C++17)
- moved includes of vector/map/iostream/limits into ts.hpp
- require opencv_test + anonymous namespace (added compile check)
- fixed norm() usage (must be from cvtest::norm for checks) and other conflict functions
- added missing license headers
7 years ago
Alexander Alekhin
15b909e80b
ocl: add SPIR Program loading test
...
SPIR kernels are located in opencv_extra
7 years ago
Alexander Alekhin
13c4a02157
ocl: low-level API to support OpenCL binary programs
7 years ago
Alexander Alekhin
d6c5e18e24
core(test): fix input data for OCL FP16 test
7 years ago
Alexander Alekhin
a4a47b538c
build: detect Android via '__ANDROID__' macro
...
https://sourceforge.net/p/predef/wiki/OperatingSystems
7 years ago
Tomoaki Teshima
d81cdb8e1c
add OpenCL version of convertFp16 and test
...
* disable vector operation for now
* brush up the implementation based on comment
8 years ago
Alexander Alekhin
dd304dbe05
test: fix min/max name mismatching
8 years ago
Alexander Alekhin
5da8d65371
eliminate compiler warnings
8 years ago
Alexander Alekhin
17e4d33b0e
added regression test for #5876
9 years ago
Philippe FOUBERT
b668d3f693
Modification of the related tests in core/tests/ocl
9 years ago
Ilya Lavrenov
cf4c79ebdd
disabled some kernels for Android && NVidia
10 years ago
Ilya Lavrenov
3a78a22733
convertScaleAbs for s8, f64
10 years ago
Alexander Karsakov
237cb93143
Added extra checks to ocl::Image2D
10 years ago
Alexander Karsakov
3a263c6326
Added tests for Image2D
10 years ago
Alexander Karsakov
ec827bed8e
Changed check to NORM_RELATIVE for OCL.Mat_Scalar_Scale tests
10 years ago
Alexander Karsakov
a89ff402fc
Refactoring of OCL_FftPlan class
10 years ago
VBystricky
c5632d412e
Change epsilon for NORM_INF_2args accuracy tests
10 years ago
vbystricky
1fe403f461
Enable OpenCL version of norm and convertScaleAbs or 32F data
...
Fix error in minmaxloc.cl
Change test for convertScaleAbs
Fix minMaxIdx for _src2 align
Change epsilon on the tests
10 years ago
Vadim Pisarevsky
d97f9fc477
now the tests pass with loop_times==30
10 years ago
Alexander Karsakov
c3100eeb19
Fixed buffer initialization in reduce kernel. Enabled OCL version of reduce for SUM, MAX, MIN modes.
10 years ago
Alexander Karsakov
713ddb89bd
Set minimum matrix size for AmdBlas::gemm to 20 since it works incorrect for small sizes
10 years ago
Alexander Karsakov
6ad4521b78
Fixed typos
10 years ago
Alexander Alekhin
55188fe991
world fix
10 years ago
vbystricky
2637b18b67
Fix epsilon for OpenCL function ConverTo, for convertation from float to integer types
10 years ago
vbystricky
2d587ad691
Prevent generating test case with mask filled 0 only
10 years ago
VBystricky
2026094691
Increase epsilon for OpenCL version UMatDot and ReduceAvg accurency tests
10 years ago
vbystricky
aad4104297
Fix accurency test for OpenCL matrix operations
10 years ago
vbystricky
184c5917bf
Fix accurency test of OpenCL merge function.
10 years ago
Vadim Pisarevsky
5267ed46c7
1. disabled OpenCL acceleration for cv::multiply() (CV_32F), cv::divide (CV_32F), cv::convertScaleAbs (CV_32F) and cv::reduce (SUM, CV_32F), cv::reduce (MIN & MAX), cv::flip (3-channel case).
...
2. changed the number of test loops from 1 to 30 (except for cv::pow() test, which fails for yet unknown reason)
3. disabled IPP acceleration for 3-channel norms.
4. modified relativeNorm test function to handle very small values
10 years ago
Alexander Karsakov
e51c0810b6
Added accuracy and performance tests for DFT all modes.
10 years ago
Alexander Karsakov
66ac46214d
Final refactoring, fixes
10 years ago
Alexander Karsakov
1d2cf0e20e
Added nonzero_rows support
11 years ago
Alexander Karsakov
52f76a3283
Added rest Elena's changes
11 years ago
Alexander Karsakov
7791264525
Added multi-block scheme
11 years ago
Alexander Karsakov
2b9e556055
Added Elena's changes with implemented DFT_INVERSE C2C mode.
11 years ago
Alexander Karsakov
b17bf031f6
Added DFT_SCALE for forward transforms
11 years ago
Alexander Karsakov
6c8b6bd0c7
Added packing to CCS format
11 years ago
Alexander Karsakov
ed07241f89
Completed all forward transforms.
11 years ago
Alexander Karsakov
e5a3ab3cb9
Added fftplan cache
11 years ago
Alexander Karsakov
0318d27720
Enabled precalculated wave
11 years ago