Namgoo Lee
5a2faab2e6
CUDA 10.1 Build Issue Fix
6 years ago
Namgoo Lee
2b6be3cb0f
cudev - Rework some code
...
- Use shfl_down, instead of __shfl_down, on warp scan
- Remove race conditions
6 years ago
Namgoo Lee
21eb60f88b
cudalegacy: Use safe block scan function
6 years ago
Namgoo Lee
970293a229
__shfl_up_sync with mask for CUDA >= 9
...
* __shfl_up_sync with proper mask value for CUDA >= 9
* BlockScanInclusive for CUDA >= 9
* compatible_shfl_up for use in integral.hpp
* Use CLAHE in cudev
* Add tests for BlockScan
6 years ago
Tomoaki Teshima
e6ef9221cb
fix test failure of cudev
...
* follow the implementation of Luv2RGBfloat in imgproc/src/color_lab.cpp
* loosen threshold in cudaimgproc
6 years ago
Tomoaki Teshima
6a5266df79
fix CvFp16Test failure
6 years ago
cyy
8f78a1123b
fix uninitialized read errors reported by CUDA-INITCHECK
6 years ago
Hamdi Sahloul
a39e0daacf
Utilize CV_UNUSED macro
6 years ago
luz.paz
d05714995c
Misc. modules/ cont. pt2
...
Found via `codespell`
7 years ago
Namgoo Lee
25c36fb05f
cv::cuda::cvtColor bug fix ( #10640 )
...
* cuda::cvtColor bug fix
Fixed bug in conversion formula between RGB space and LUV space.
Testing with opencv_test_cudaimgproc.exe, this commit reduces the number
of failed tests from 191 to 95. (96 more tests pass)
* Rename variables
7 years ago
catree
6d06fcb414
Fix CUDA integral.
7 years ago
Peter J. Stieber
5669ee815b
Replace private.cuda.hpp with conditional include of cuda_fp16.h.
7 years ago
Boris Fomitchev
c48807c383
Merge pull request #9418 from borisfom:cuda9
...
CUDA9 build fixed, added detection (#9418 )
* CUDA9 build fixed, added detection
* Replacing deprecated __shfl_xxx with __shfl_sync, fixing bogus CUDA9 warnings
7 years ago
nnorwitz
9210cefb36
Use %% for inline assembly rather than % so this compiles with clang.
8 years ago
sourin
a34fbf7bb1
Fixed identifiers warns
8 years ago
Tomoaki Teshima
2974b049e7
cudev: add feature to convert FP32(float) from/to FP16(half) on GPU
...
* add feature of Fp16 on GPU (cudev)
* add test
* leave template function as unimplemented to raise error
8 years ago
aravind
f4f1561781
Fixed cv::cuda::reduce bug.
9 years ago
Vladislav Vinogradov
2afb02fcb4
fix BORDER_WRAP processing on Maxwell generation
9 years ago
Vladislav Vinogradov
e22979f334
fix #4343 : cv::cuda::findMinMaxLoc incorrect output for single row matrix
10 years ago
Vladislav Vinogradov
9b8c3fd675
rewrite cuda::cvtColor with new device layer and fix test failures
10 years ago
Vladislav Vinogradov
8237418be6
add Allocator parameter to cudev::GpuMat_ contructors
10 years ago
Vladislav Vinogradov
53862687d5
rename CudaMem -> HostMem to better reflect its purpose
10 years ago
Vladislav Vinogradov
b5ab82fdbd
mark old CUDA device layer as deprecated and remove it from doxygen documentation
...
add a note to use new cudev module as a replacement
10 years ago
Vladislav Vinogradov
25f33a7e30
update cudev color conversions according to the latest changes in CPU code
10 years ago
Maksim Shabunin
ceb6e8bd94
Doxygen documentation: cuda
10 years ago
Vladislav Vinogradov
f1e44fa5ca
fix bug #3678 (cuda::integral failures)
11 years ago
Roman Donchenko
bfa40e180f
Removed another usage of __func__, following #1763 .
11 years ago
Vladislav Vinogradov
23cc31e041
used new device layer for cv::cuda::LUT
11 years ago
Vladislav Vinogradov
1ef211b889
used new device layer for cv::gpu::reduce
11 years ago
Vladislav Vinogradov
e1aa2fd06c
added gridMinMaxLoc function
11 years ago
Vladislav Vinogradov
bbd519be42
fixed warnings
11 years ago
Vladislav Vinogradov
045a856c24
used new device layer for cv::gpu::minMax
11 years ago
Vladislav Vinogradov
b705e0d886
used new device layer for cv::gpu::sum
11 years ago
Vladislav Vinogradov
9fe92e2111
renamed grid/glob_reduce.hpp -> grid/reduce.hpp
11 years ago
Vladislav Vinogradov
7b3bbcea71
used new device layer for cv::gpu::transpose
11 years ago
Vladislav Vinogradov
6dbb32a05d
switched to new device layer in split/merge
11 years ago
Vladislav Vinogradov
7c8c836a7b
switched to new device layer in polar <-> cart
11 years ago
Vladislav Vinogradov
b11cccaaca
switched to new device layer in bitwize operations
11 years ago
Vladislav Vinogradov
ef9917ecf1
used new device layer for cv::gpu::compare
11 years ago
Vladislav Vinogradov
9c5da2ea22
used new device layer for cv::gpu::add
11 years ago
Vladislav Vinogradov
32d578f5f0
fixed gridTransform overloads problems
11 years ago
Vladislav Vinogradov
f4fb7fe1be
fixed compilation error "ambiguous symbol" on CUDA 5.0:
...
disabled Texture Reference API for old CUDA toolkits
11 years ago
Vladislav Vinogradov
776c0cb08c
switched to new device layer in gpucodec module
11 years ago
Vladislav Vinogradov
20f636fcee
fixed cudev compilation for old pre-Fermi archs
11 years ago
Vladislav Vinogradov
cfe4a71dc6
renamed gpu* source to cuda* in core module
11 years ago
Vladislav Vinogradov
e895b7455e
renamed gpu namespace -> cuda
11 years ago
Vladislav Vinogradov
7bda6992e5
fixed gpu BGR <-> YUV color conversion
11 years ago
Roman Donchenko
2304a561ca
Fixed a -Wenum-compare warning in cudev.
11 years ago
Vladislav Vinogradov
2311b0b43b
added cudev::GpuMat_<T> support to Input/Output arrays
11 years ago
Vladislav Vinogradov
1c0d2b37a7
added globPtr for GpuMat
11 years ago