cudawarped
10f6cbf41c
Fix 24789
11 months ago
cudawarped
7d681cf80d
build: first class cuda support
11 months ago
chacha21
6d2064b32f
added URL to understand CUDA options
...
A reference to https://docs.opencv.org/4.x/d2/dbc/cuda_intro.html is given.
This page is not up-to-date but it can evolve independently
11 months ago
Stefan Isak
5bffcdf7e8
Merge pull request #24382 from sisakat:cuda-compile-multicore
...
Enable multicore CUDA compilation #24382
CUDA source files are compiled single threaded. The option `--threads` was introduced in NVCC 11.2. The option specifies the number of threads to be used for compilation (see [NVIDIA NVCC Documentation](https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/index.html#threads-number-t )).
With CMake 3.12 the environment variable `CMAKE_BUILD_PARALLEL_LEVEL` was introduced (see [CMake Documentation](https://cmake.org/cmake/help/latest/envvar/CMAKE_BUILD_PARALLEL_LEVEL.html )). This variable is used to set the NVCC `--threads` option.
### Pull Request Readiness Checklist
See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request
- [x] I agree to contribute to the project under Apache 2 License.
- [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
- [x] The PR is proposed to the proper branch
- [ ] There is a reference to the original bug report and related work
- [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
- [ ] The feature is well documented and sample code can be built with the project CMake
1 year ago
cudawarped
358e3065ab
cuda: add default ptx when CUDA_ARCH_BIN is missing and allow CUDA_ARCH_PTX to be passed in isolation
1 year ago
CSBVision
6ea22535e2
Fixes #23187 (part 1)
...
Supports delay-loading for Ninja generators and Python bindings.
2 years ago
cudawarped
927aa1e617
cuda: add ada lovelace and hopper cuda arch
2 years ago
Alexander Alekhin
1bc3077890
cmake: VERSION_GREATER_EQUAL is not supported in CMake 3.5.1
2 years ago
cudawarped
5e03305da5
build: only disable win32 cuda warnings on CUDA SDK >= 11.2
2 years ago
cudawarped
b1288dad40
Enable cudacodec libs by default, prevent their locations being lost when cmake is re-run and add log info when the libs/headers cannot be found.
2 years ago
CSBVision
332ff4bf1c
Update OpenCVDetectCUDA.cmake
...
Adds the option to enable delay loading of CUDA DLLs on Windows. This is particularly useful to use the same binary on systems with and without CUDA support without distributing the CUDA DLLs to systems that cannot use them at all due to missing CUDA-supported hardware.
Resolves #13509
2 years ago
cudawarped
45d04175d4
Suppress
...
warning #1394-D: field of class type without a DLL interface used in a class with a DLL interface
from windows CUDA builds.
2 years ago
cudawarped
be670e168a
Merge pull request #22615 from cudawarped:nvcuvenc
...
Update CMake rules to include Nvidia Video Codec SDK encoding libs
2 years ago
Tomoaki Teshima
2336b0706d
add support for Orin GPU
2 years ago
Xerxes Battiwalla
1e1984a586
Fixed typo in error message in OpenCVDetectCUDA.cmake
...
There was a minor typo in the FATAL error message when the specified CUDA generation does not match any known generation
3 years ago
Alexander Alekhin
171fbf879f
cmake: fix typo in CUDA_GENERATION=Auto cache
4 years ago
Tomoaki Teshima
ac58b2f857
compute capability 8.6
...
- CC for RTX3090, RTX3080 and RTX3070
4 years ago
Tomoaki Teshima
d92af2aa85
* stop showing old generations
...
* keep it possible to build for old CC
* make sure old generations don't come up for the choice
* remove related version check of old one
4 years ago
Tomoaki Teshima
269b810601
re-enable automatic CC detection on Jetson
...
* treat both CMAKE_C_COMPILER and c_compiler_realpath as candidate
4 years ago
Alexander Alekhin
8bf1b9a422
cmake(cuda): repair ccbin, re-implement execute_process() cache
...
- preventive fix for arch "11.0" (CUDA_ARCH_BIN_OR_PTX_10 bug)
- new var: OPENCV_CUDA_DETECTION_NVCC_FLAGS
- new var: OPENCV_CMAKE_CUDA_DEBUG
4 years ago
Alexander Alekhin
e0f9eac521
cmake: backport CUDA scripts
4 years ago
Alexander Alekhin
eb6678ebef
Merge pull request #17699 from alalek:build_core_cuda
...
* core(cuda): fix build
- MSVS 19.25.28612.0
- CUDA release 11.0, V11.0.167
* cmake(cuda): backport workaround for CUDA 11
* cmake(cuda): call CUDA_BUILD_CLEAN_TARGET() on finalize
* cmake(cuda): use CMAKE_SUPPRESS_REGENERATION with MSVS
4 years ago
Alexander Smorkalov
0ccc839397
Restored compatibility with CMake older than 3.7.
4 years ago
Alexander Alekhin
98ad7d01d5
cmake(cuda): update handling of -std=c++11/14 flags
4 years ago
cyy
206c843f36
Merge pull request #17499 from cyyever:fix_CUDA11
...
Fix cuda11
* use cudnn_version.h to detect version when it is available
* remove nppi from CUDA11
* use ocv_list_filterout
* dnn(cuda): temporary disable CUDNN 8.0
4 years ago
Tomoaki Teshima
4cec9e5ff1
fix native detection on Jetson
4 years ago
Tomoaki Teshima
95ac650af6
make the fail safe path actually safe
...
* use only supported CC in the list
* workaround of #17526
4 years ago
Tomoaki Teshima
52844614c4
add Ampere CC
...
* Ampere has CC 8.0
* Arm64 server support has been added in CUDA 11 (only V100 for now)
5 years ago
Tomoaki Teshima
1cba763189
fix build error of automatic CC detection
5 years ago
Alexander Alekhin
d01cbe9320
cudacodec(build): fix detection in CMake, cleanup duplicate includes
5 years ago
Tomoaki Teshima
d4af89781b
fix corner case of libnvcuvid
...
* detect header automatically and not based on version number
5 years ago
cyy
c6638d62e6
respect CUDA_HOST_COMPILER when detecting CUDA arch
5 years ago
Tomoaki Teshima
156406b56c
select the architecture based on nvcc result
...
* cache the result
* DRY
* brush up based on review
5 years ago
Alexander Alekhin
a45928045a
Merge pull request #16150 from alalek:cmake_avoid_deprecated_link_private
...
* cmake: avoid deprecated LINK_PRIVATE/LINK_PUBLIC
see CMP0023 (CMake 2.8.12+)
* cmake: fix 3rdparty list
- don't include OpenCV modules
5 years ago
Vishal Chiluka
bca49b99ca
Removing static linking of cuda library
5 years ago
Alexander Alekhin
0fc584961c
cmake: workaround to fix link issues with stubs/libcuda.so.1
5 years ago
Yashas Samaga B L
ae279966c2
Merge pull request #14660 from YashasSamaga:dnn-cuda-build
...
add cuDNN dependency and setup build for cuda4dnn (#14660 )
* update cmake for cuda4dnn
- Adds FindCUDNN
- Adds new options:
* WITH_CUDA
* OPENCV_DNN_CUDA
- Adds CUDA4DNN preprocessor symbol for the DNN module
* FIX: append EXCLUDE_CUDA instead of overwrite
* remove cuDNN dependency for user apps
* fix unused variable warning
6 years ago
Peter J. Stieber
50ef9830e2
Added Turing to the _generations list.
6 years ago
Tomoaki Teshima
3e710d8eec
use correct CC value for Jetson Xavier
6 years ago
Alexander Alekhin
d6a8e08acc
cmake: fix variable expand in CMake conditions
6 years ago
jasjuang
a66fd527b0
add support for latest Turing gpu and cuda 10
6 years ago
Hamdi Sahloul
0e80f56e44
Bugfix: Detect CUDA archs for Windows
6 years ago
Sacha
9ff4475695
Support WITH_CUDA with clang compiler.
6 years ago
Alexander Alekhin
e79c729b20
CUDA: allow to use external FindCUDA from modern CMake
...
CMake 3.9.0+ is required
6 years ago
Alexander Alekhin
d4a8a9ee96
cmake: filter NVCC compiler flags (Wimplicit-fallthrough, OPENCV_CUDA_NVCC_FILTEROUT_OPTIONS)
...
issue 11552
7 years ago
Alexander Alekhin
2a330e3006
cuda: eliminate warnings
7 years ago
Tomoaki Teshima
779a42678d
fix CUDA build crash
...
* add --std=c++11 flag to nvcc definition
* remove not used abs functions
7 years ago
Alexander Alekhin
08941b7890
cmake: avoid amending of CMAKE_COMPILER_IS_[GNUCXX|CLANGCXX|CCACHE] vars
...
- Recommended compiler checks:
- GCC: CV_GCC
- Clang: CV_CLANG
- fixed problem with CMAKE_CXX_COMPILER_ID=Clang/AppleClang mess on MacOSX
Details: cmake --help-policy CMP0025
- do not declare Clang as GCC compiler
7 years ago
Balint Cristian
0c4e294504
Enable CUDA_NVCC_FLAGS at arch detect step.
7 years ago
Boris Fomitchev
c48807c383
Merge pull request #9418 from borisfom:cuda9
...
CUDA9 build fixed, added detection (#9418 )
* CUDA9 build fixed, added detection
* Replacing deprecated __shfl_xxx with __shfl_sync, fixing bogus CUDA9 warnings
7 years ago