opencv

Commit Graph

Author	SHA1	Message	Date
Vadim Pisarevsky	8b3d6603d5	another round of dnn optimization (#9011 ) * another round of dnn optimization: * increased malloc alignment across OpenCV from 16 to 64 bytes to make it AVX2 and even AVX-512 friendly * improved SIMD optimization of pooling layer, optimized average pooling * cleaned up convolution layer implementation * made activation layer "attacheable" to all other layers, including fully connected and addition layer. * fixed bug in the fusion algorithm: "LayerData::consumers" should not be cleared, because it desctibes the topology. * greatly optimized permutation layer, which improved SSD performance * parallelized element-wise binary/ternary/... ops (sum, prod, max) * also, added missing copyrights to many of the layer implementation files * temporarily disabled (again) the check for intermediate blobs consistency; fixed warnings from various builders	8 years ago
Alexander Alekhin	f8a75c4361	dispatch: added CV_TRY_${OPT} macro, fix dnn build - 1: OPT is available directly or via dispatcher - 0: optimization is not compiled at all	8 years ago
Maksim Shabunin	32d4af36e2	Fixing some static analysis issues	8 years ago
Alexander Alekhin	006966e629	trace: initial support for code trace	8 years ago
James Clarke	25020f2672	fast_math.hpp: Use __asm__ rather than asm; fixes including with -std=c99	8 years ago
Alexander Alekhin	3e3e2dd512	android: make optional "cpufeatures", build fixes for NDK r15	8 years ago
Rostislav Vasilikhin	29593635ed	licence updated	8 years ago
Alexander Alekhin	e23b59da5c	build: fix v_reduce_sum4 (requires SSE3)	8 years ago
Vadim Pisarevsky	fbafc700ea	added v_reduce_sum4() universal intrinsic; corrected number of threads in cv::getNumThreads() in the case of GCD	8 years ago
Alexander Alekhin	71517a910a	build: fix errors for MSVS2010-2013, reduce default softfloat scope	8 years ago
Rostislav Vasilikhin	c6a3a18894	SoftFloat integrated (#8668 ) * everything is put into softfloat.cpp and softfloat.hpp * WIP: try to integrate softfloat into OpenCV * extra functions removed * softfloat made stateless * CV_EXPORTS added * operators fixed * exp added, log: WIP * log32 fixed * shorter names; a lot of TODOs * log64 rewritten * cbrt32 added * minors, refactoring * "inline" -> "CV_INLINE" * cast to bool warnings fixed * several warnings fixed * fixed warning about unsigned unary minus * fixed warnings on type cast * inline -> CV_INLINE * special cases processing added (NaNs, Infs, etc.) * constants for NaN and Inf added * more macros and helper functions added * added (or fixed) tests for pow32, pow64, cbrt32 * exp-like functions fixed * minor changes * fixed random number generation for tests * tests for exp32 and exp64: values are compared to SoftFloat-based naive implementation * minor warning fix * pow(f, i) 32/64: special cases handling added * unused functions removed * refactoring is in progress (not compiling) * CV_inline added * unions {uint_t, float_t} removed * tests compilation fixed * static const members -> static methods returning const * reinterpret_cast * warning fixed * const-ness fixed * all FP calculations (even compile-time) are done in SoftFloat + minor fixes * pow(f, i) removed from interface (can cause incorrect cast) to internals of pow(f, f), tests fixed * CV_INLINE -> inline * internal constants moved to .cpp file * toInt_minMag() methods merged into toInt() methods * macros moved to .cpp file * refactoring: types renamed to softfloat and softdouble; explicit constructors, etc. * toFloat(), toDouble() -> operator float(), operator double() * removed f32/f64 prefixes from functions names * toType() methods removed, round() and trunc() functions added * minor change * minors * MSVC: warnings fixed * added int cvRound(), cvFloor, cvCeil, cvTrunc, saturate_cast<T>() * typo fixed * type cast fixed	8 years ago
catree	542cdb2c39	Improve solvePnP doc, add assert >= 4 in solvePnP, escape underscore character for Scalar_ documentation. Add reference to SOLVEPNP_ITERATIVE in the doc.	8 years ago
Vitaly Tuzov	1d62a025b3	Moved size restrictions for OpenVX processed images to corresponding cpp files	8 years ago
mschoeneck	4a4d94f266	Merge pull request #8694 from mschoeneck:Canny Parallelize Canny with custom gradient (#8694) * New Canny implementation. Restructuring code in parallelCanny class. Align mag buffer and map. * Fix warnings. * Missing SIMD check added. * Replaced local trailingZeros in contours.cpp. Use alignSize in canny.cpp * Fix warnings in alignSize and allocate just minimum extra columns. * Fix another warning in map.create. * Exchange for loop by do loop to avoid double check at the beginning. Define extra SIMD CANNY_CHECK to avoid unnecessary continue.	8 years ago
krishraghuram	9ea2f5211e	Correct the existing documented T-API functions to match the doxygen format (#8758 ) * Correct the existing documented T-API functions to match the doxygen format. * docs: fix comments style * T-API documentation: minor formatting changes	8 years ago
Maksim Shabunin	b04ed5956e	Fixed several issues found by static analysis in core module	8 years ago
Alexander Alekhin	c5e9d1adae	macro for static analysis tools	8 years ago
cDc	003745432f	fix Mat_ release #8680	8 years ago
André Mewes	70e6391f38	create homogeneous affine matrix when constructing from 4x3 cv::Mat	8 years ago
Robert Bragg	8f5ea7deda	core: avoid clash with _N define from ctype.h in headers This updates the public headers to use _Nm instead of _N in templates since _N is defined by the widely used ctype.h.	8 years ago
Pavel Vlasov	11c2ffaf1c	Update for IPP for OpenCV 2017u2 integration; Updated integrations for: cv::split cv::merge cv::insertChannel cv::extractChannel cv::Mat::convertTo - now with scaled conversions support cv::LUT - disabled due to performance issues Mat::copyTo Mat::setTo cv::flip cv::copyMakeBorder - currently disabled cv::polarToCart cv::pow - ipp pow function was removed due to performance issues cv::hal::magnitude32f/64f - disabled for <= SSE42, poor performance cv::countNonZero cv::minMaxIdx cv::norm cv::canny - new integration. Disabled for threaded; cv::cornerHarris cv::boxFilter cv::bilateralFilter cv::integral	8 years ago
Peter Würtz	4c095a76c0	Add docstring for UMat::handle	8 years ago
Pavel Vlasov	35c7216846	IPP for OpenCV 2017u2 initial enabling patch;	8 years ago
Vadim Pisarevsky	dd54f7a22a	got rid of Blob and BlobShape completely; use cv::Mat and std::vector<int> instead	8 years ago
Arnaud Brejeon	636ab095b0	Merge pull request #8535 from arnaudbrejeon:std_array Add support for std::array<T, N> (#8535) * Add support for std::array<T, N> * Add std::array<Mat, N> support * Remove UMat constructor with std::array parameter	8 years ago
insoow	2922738b6d	Merge pull request #8104 from insoow:master Gemm kernels for Intel GPU (#8104) * Fix an issue with Kernel object reset release when consecutive Kernel::run calls Kernel::run launch OCL gpu kernels and set a event callback function to decreate the ref count of UMat or remove UMat when the lauched workloads are completed. However, for some OCL kernels requires multiple call of Kernel::run function with some kernel parameter changes (e.g., input and output buffer offset) to get the final computation result. In the case, the current implementation requires unnecessary synchronization and cleanupMat. This fix requires the user to specify whether there will be more work or not. If there is no remaining computation, the Kernel::run will reset the kernel object Signed-off-by: Woo, Insoo <insoo.woo@intel.com> * GEMM kernel optimization for Intel GEN The optimized kernels uses cl_intel_subgroups extension for better performance. Note: This optimized kernels will be part of ISAAC in a code generation way under MIT license. Signed-off-by: Woo, Insoo <insoo.woo@intel.com> * Fix API compatibility error This patch fixes a OCV API compatibility error. The error was reported due to the interface changes of Kernel::run. To resolve the issue, An overloaded function of Kernel::run is added. It take a flag indicating whether there are more work to be done with the kernel object without releasing resources related to it. Signed-off-by: Woo, Insoo <insoo.woo@intel.com> * Renaming intel_gpu_gemm.cpp to intel_gpu_gemm.inl.hpp Signed-off-by: Woo, Insoo <insoo.woo@intel.com> * Revert "Fix API compatibility error" This reverts commit `2ef427db91`. Conflicts: modules/core/src/intel_gpu_gemm.inl.hpp * Revert "Fix an issue with Kernel object reset release when consecutive Kernel::run calls" This reverts commit `cc7f9f5469`. * Fix the case of uninitialization D When C is null and beta is non-zero, D is used without initialization. This resloves the issue Signed-off-by: Woo, Insoo <insoo.woo@intel.com> * fix potential output error due to 0 * nan Signed-off-by: Woo, Insoo <insoo.woo@intel.com> * whitespace fix, eliminate non-ASCII symbols * fix build warning	8 years ago
Vitaly Tuzov	9dc36a1ece	Tuned restrictions for Canny, Warp, FAST, Accumulate and Convolution OpenVX HAL calls on small images	8 years ago
Vitaly Tuzov	87bb74312b	Disabled vxuConvolution call for Sobel, GaussianBlur and Box filter evaluation	8 years ago
Matthias Grundmann	fce7469961	Update utility.hpp Adding missing header for ostream decl. in line 384	8 years ago
Fangjun KUANG	4065778255	fix typos.	8 years ago
Vitaly Tuzov	0f1a56da7c	Changed restrictions for OpenVX HAL calls on small images	8 years ago
jveitchmichaelis	8f19363c07	Update documentation for getCudaEnabledDeviceCount Inform users that getCudaEnabledDeviceCount can return -1 in some cases.	8 years ago
nnorwitz	24e8cd1a78	Use %% for inline assembly rather than % so this compiles with clang. Same as `9210cefb36` but for this file too.	8 years ago
Vitaly Tuzov	bf62dca45a	Extended restrictions for OpenVX HAL calls on small images	8 years ago
Vitaly Tuzov	bf5b7843e8	Extended set of OpenVX HAL calls disabled for small images	8 years ago
Alexander Alekhin	e5d9b608c4	cmake: fix fp16 support	8 years ago
Sergiu Deitsch	4f31759965	prevent copying in cv::Mat_<T> move assignment	8 years ago
Tomoaki Teshima	507071cc6f	suppress warnings on Jetson TK1	8 years ago
berak	3e0b63f65b	fix comment in optim.hpp	8 years ago
jexner	b45e784beb	Fix segmentation fault in cv::Mat::forEach This issue concerns only matrices with more dimensions than columns. See https://github.com/opencv/opencv/issues/8447	8 years ago
Fangjun KUANG	da94d85789	add more info to the error code.	8 years ago
Fangjun KUANG	f82d64c6e5	Add more info to the error code.	8 years ago
Alexander Alekhin	17e5e4cd5a	core: CPU target dispatcher update - use suffixes like '.avx.cpp' - added CMake-generated files for '.simd.hpp' optimization approach - wrap HAL intrinsic headers into separate namespaces for different build flags - automatic vzeroupper insertion (via CV_INSTRUMENT_REGION macro)	8 years ago
Fangjun KUANG	94521629ab	fix issue 8411.	8 years ago
KUANG, Fangjun	eae1ebfd29	fix issue 8411.	8 years ago
Naba Kumar	29680100ac	Support for creating streams with custom allocator	8 years ago
Naba Kumar	00f3ad7217	Implement DFT as cv::Algorithm to support concurrent streams	8 years ago
Naba Kumar	cdcf44b3ef	Expose BufferPool class for external use also	8 years ago
Hamdi Sahloul	171e705ba4	Fixes the constructor of 1x14, 2x7, 7x2 or 14x1 matrix	8 years ago
Fangjun KUANG	3ad6d13ff3	Fix an error in the documentation.	8 years ago

1 2 3 4 5 ...

1263 Commits (bbb14d3746232a2c5e93c87e648f5fdfb9bad604)