Vladislav Vinogradov
8fcef225fb
switched to Input/Output Array in reductions operations
12 years ago
Vladislav Vinogradov
3eeaa9189c
Revert "Revert "Merge pull request #836 from jet47:gpu-modules""
12 years ago
Vladislav Vinogradov
58e472754a
fixed norm diff function (it uses pre-allocated buffer now)
12 years ago
Andrey Kamaev
416fb50594
Revert "Merge pull request #836 from jet47:gpu-modules"
...
This reverts commit fba72cb60d
, reversing
changes made to 02131ffb62
.
12 years ago
Vladislav Vinogradov
b4f3d08725
gpuarithm module fixes
12 years ago
Vladislav Vinogradov
71eeaa7276
gpuarithm module fixes
12 years ago
Vladislav Vinogradov
d08ebfe4d3
moved rectStdDev to gpuarithm
12 years ago
Vladislav Vinogradov
c2402053b9
removed columnSum function (it is a duplicate for reduce)
12 years ago
Vladislav Vinogradov
31c8b527c6
gpuarithm module for arithmetics operations on matrices
12 years ago
Vladislav Vinogradov
54e7c76d99
fixed compilation errors
12 years ago
Vladislav Vinogradov
b50090f850
restore cudaSafeCall
12 years ago
Vladislav Vinogradov
204a19b431
moved common gpu utility functionality to gpu_private.hpp
12 years ago
marina.kolpakova
84f45e1e03
fixed build with CUDA after latest merge
12 years ago
Vladislav Vinogradov
f0b19d4659
updated license header in whole gpu module
12 years ago
Vladislav Vinogradov
61b54149b1
added mask support to gpu norm and sum
12 years ago
Vadim Pisarevsky
efd00238e2
fixed warnings; restored fixed_size parameter in AutoBuffer
12 years ago
Vladislav Vinogradov
281d036fcf
optimizations:
...
- new reduce implementation (with kepler optimizations)
- saturate_cast via asm command
- video SIMD instructions in element operations
- float arithmetics instead of double
- new deviceSupports function
12 years ago
Vadim Pisarevsky
c197a46e7e
removed rarely used fixed_size parameter from AutoBuffer type, added optional AutoBuffer* but to cvarrToMat in order to speedup CvSeq->Mat conversion; finished conversion of convex hull and related functions to C++
12 years ago
Vladislav Vinogradov
dd8e442bda
replaced DeviceInfo().supports with deviceSupports
12 years ago
Vladislav Vinogradov
e8f9762ef3
matrix reduction
13 years ago
OpenCV Buildbot
81f826db2b
Normalize line endings and whitespace
13 years ago
OpenCV Buildbot
04384a71e4
Normalize line endings and whitespace
13 years ago
Anatoly Baksheev
55b1d7c097
added CUDA_DISABLER preprocessor commands
13 years ago
Anatoly Baksheev
70204a8e68
Removed PtrElemStep, Marked DevMem1D as deprecated, now should use PtrStepSz now
13 years ago
Anatoly Baksheev
4ce8de5174
Merged revision(s) 8455 from trunk/opencv/modules/gpu/src:
...
compilation with CUDA5.0 (#1982 , thanks to Rodrigo)
........
13 years ago
Anatoly Baksheev
543825f2f5
compilation with CUDA5.0 ( #1982 , thanks to Rodrigo)
13 years ago
Vladislav Vinogradov
eaea6782d5
added more assertion on device features to gpu functions and tests
...
moved TargerArchs and DeviceInfo to core
fixed bug in GpuMat::copy with mask (incorrect index in function tab)
13 years ago
Vladislav Vinogradov
26691e00d4
fixed gpu core tests (added additional check for device's feature support)
...
added assertion on double types for old devices
13 years ago
Vladislav Vinogradov
2d30480982
created wrappers for new NPP functions
...
removed void integral(const GpuMat& src, GpuMat& sum, GpuMat& sqsum, Stream& stream) - it fails with NPP_NOT_IMPLEMENTED error
updated docs, accuracy and performance tests
13 years ago
Vladislav Vinogradov
f8aba8608d
fixed build under CUDA 4.1
13 years ago
Vladislav Vinogradov
0f53f2993e
removed BEGIN_OPENCV_DEVICE_NAMESPACE macros
14 years ago
Vladislav Vinogradov
fcfa72081e
moved GpuMat and DevMem2D to core module, some code refactoring
14 years ago
Anatoly Baksheev
7076dfd7d6
gpu module: refactored devmem2d.hpp (atomic bomb)
14 years ago
Vladislav Vinogradov
8b23c79294
implemented gpu::reduce
14 years ago
Vladislav Vinogradov
42ced17c2c
fixed some warnings under win64
14 years ago
Vladislav Vinogradov
926a6bba00
modified according to CUDA 4.0 API updates
14 years ago
Vladislav Vinogradov
6a03be2632
added lightweight class DeviceBuffer to matrix_reductions.cpp
14 years ago
Vladislav Vinogradov
1c9f4e7ca2
fixed gpu::meanStdDev and gpu::norm under CUDA 4.0
...
fixed compilation under Win64
14 years ago
Anatoly Baksheev
6b34532901
minor + warnings
14 years ago
Alexey Spizhevoy
63806c9ab9
renamed gpu::DeviceInfo::has into gpu::DeviceInfo::supports
14 years ago
Alexey Spizhevoy
16e74ab306
added buffered version of norm, updated performance sample and docs
14 years ago
Alexey Spizhevoy
3795142604
implemented gpu::norm via absSum, sqrSum, and minMax (removed norm_diff call), added support of other data types
14 years ago
Alexey Spizhevoy
ae529f4bc6
added absSum function
14 years ago
Alexey Spizhevoy
575fd1fe4c
reafactoring: replaced query device props functions with the DeviceInfo class
14 years ago
Alexey Spizhevoy
891e2ff310
replaced has* methods in the GPU module with the TargetArchs monostate
14 years ago
Vladislav Vinogradov
811ba31897
added synchronization after NPP calls
14 years ago
Alexey Spizhevoy
db41449be8
fixed errors under MacOS
14 years ago
Alexey Spizhevoy
01dafce1a1
fixed some bugs in GPU matrix reductions, removed <functional> into precomp.hpp
14 years ago
Alexey Spizhevoy
0da71a01ff
fixed some GPU tests failing when compiled for 1.1(no doubles) and run on 1.3(with doubles)
14 years ago
Alexey Spizhevoy
8779306800
updated main CMakeLists.txt gpu module section, now user can manage binary and intermediate code versions of the gpu module image
...
added more functions to check version of gpu code in runtime
14 years ago