opencv

Commit Graph

Author	SHA1	Message	Date
Paul Murphy	1c4a64f0a1	Merge pull request #16138 from pmur:reg_16137 * imgproc: Prevent 1B overrun of 8C3 SIMD optimization The fourth value read via v_load_q is essentially ignored, but can cause trouble if it happens to cross page boundaries. The final few iterations may attempt to read the most extreme elements of S, which will read 1B beyond the array in most aligment cases. Dynamically compute the stop. This could be hoised from the loop, but will require a more extensive change. Likewise, cleanup the iteration increment statements to make it more obvious they do channel count (3) elements per pass. This should resolve #16137 * imgproc(resize): extra check	5 years ago
shimat	b89581960c	s/Voroni/Voronoi/g	5 years ago
Maksim Shabunin	435c97c7a2	imgproc: add parameter checks in calcHist and calcBackProj	5 years ago
RAJKIRAN NATARAJAN	b9435b9e38	Merge pull request #16094 from saskatchewancatch:issue-16053 * Add eps error checking for approxPolyDP to allow sensible values only for epsilon value of Douglas-Peucker algorithm. * Review changes for PR	5 years ago
Paul Murphy	a011035ed6	Merge pull request #15257 from pmur:resize * resize: HResizeLinear reduce duplicate work There appears to be a 2x unroll of the HResizeLinear against k, however the k value is only incremented by 1 during the unroll. This results in k - 1 duplicate passes when k > 1. Likewise, the final pass may not respect the work done by the vector loop. Start it with the offset returned by the vector op if implemented. Note, no vector ops are implemented today. The performance is most noticable on a linear downscale. A set of performance tests are added to characterize this. The performance improvement is 10-50% depending on the scaling. * imgproc: vectorize HResizeLinear Performance is mostly gated by the gather operations for x inputs. Likewise, provide a 2x unroll against k, this reduces the number of alpha gathers by 1/2 for larger k. While not a 4x improvement, it still performs substantially better under P9 for a 1.4x improvement. P8 baseline is 1.05-1.10x due to reduced VSX instruction set. For float types, this results in a more modest 1.2x improvement. * Update U8 processing for non-bitexact linear resize * core: hal: vsx: improve v_load_expand_q With a little help, we can do this quickly without gprs on all VSX enabled targets. * resize: Fix cn == 3 step per feedback Per feedback, ensure we don't overrun. This was caught via the failure observed in Test_TensorFlow.inception_accuracy.	5 years ago
Alexander Alekhin	734de34b7a	Merge pull request #16085 from alalek:imgproc_threshold_to_zero_ipp_bug * imgproc(IPP): wrong result from threshold(THRESH_TOZERO) * imgproc(IPP): disable IPP code to pass THRESH_TOZERO test	5 years ago
Alexander Alekhin	b369c456f2	imgproc(color): clarify error message	5 years ago
Brian Wignall	af997529a1	Fix some typos	5 years ago
Brian Wignall	9276f1910b	Fix some typos	5 years ago
Everton Constantino	75315fb297	Merge pull request #15494 from everton1984:hal_vector_get_n Improving VSX performance of integral function * Adding support for vector get function on VSX datatypes so the integral function gains a bit of performance. * Removing get as a datatype member function and implementing a new HAL instruction v_extract_n to get the n-th element of a vector register. * Adding SSE/NEON/AVX intrinsics. * Implement new HAL instruction v_broadcast_element on VSX/AVX/NEON/SSE. * core(simd): add tests for v_extract_n/v_broadcast_element - updated docs - commented out code to repair compilation - added WASM and MSA default implementations * core(simd): fix compilation - x86: avoid _mm256_extract_epi64/32/16/8 with MSVS 2015 - x86: _mm_extract_epi64 is 64-bit only * cleanup	5 years ago
clunietp	2185bce4b7	Fix 13577	5 years ago
Alexander Alekhin	f4d55d512f	imgproc: fix bit-exact GaussianBlur() / sepFilter2D() (#15855 ) * imgproc: fix bit-exact GaussianBlur() / sepFilter2D() - avoid kernels with bad approximation - GaussiabBlur - apply error-diffusion approximation for kernel (8-bit fraction) * java(test): update features2d ref data * test: update test_facedetect	5 years ago
ChipKerchner	1d33335e33	Convert demosiacing with variable number of gradients to HAL - 5.5x faster	5 years ago
Alexander Alekhin	763b80d5fa	imgproc(IPP): disable ippiDistanceTransform_3x3_8u32f_C1R	5 years ago
Alexander Alekhin	7ecdcf6ca6	build: GCC9 compilation	5 years ago
Chip Kerchner	2112aa31e6	Merge pull request #15828 from ChipKerchner:momentsToHal * Convert moments in tile algorithms to HAL (1.3x faster for VSX). * Adding NEON code back in for non 64-bit platforms. * Remove floats from post processing.	5 years ago
Ciprian Alexandru Pitis	d2e02779c4	Merge pull request #15799 from Cpitis:feature/parallelization Parallelize pyrDown & calcSharrDeriv * ::pyrDown has been parallelized * CalcSharrDeriv parallelized * Fixed whitespace * Set granularity based on amount of threads enabled * Granularity changed to cv::getNumThreads, now each thread should receive 1/n sized stripes * imgproc: move PyrDownInvoker<CastOp>::operator() implementation * imgproc(pyramid): remove syloopboundary() * video: SharrDerivInvoker replace 'Mat*' => 'Mat&' fields	5 years ago
Alexander Alekhin	17e2bf5717	core(tls): implement releasing of TLS on thread termination - move TLS & instrumentation code out of core/utility.hpp - () TLSData lost .gather() method (to dispose thread data on thread termination) - use TLSDataAccumulator for reliable collecting of thread data - prefer using of .detachData() + .cleanupDetachedData() instead of .gather() method () API is broken: replace TLSData => TLSDataAccumulator if gather required (objects disposal on threads termination is not available in accumulator mode)	5 years ago
ChipKerchner	c46f119e0e	Convert demosaic functions to HAL	5 years ago
Steve Nicholson	acb3b3bd4d	Add documentation and example program for intersectConvexConvex	5 years ago
jasjuang	4c7db02925	document CC_STAT_MAX in ConnectedComponentsTypes	5 years ago
Everton Constantino	9ca9249992	Merge pull request #15527 from everton1984:faster_acc * Adding support for vectorized masking for uchar/ushort. * Fixing bug where mask was zeroing the dst. Improved the way to calculate the mask and tweaked for further performance improvements. * Fixing mask comparison test. * Restricting to one channel. * Adding support for 3 channels, switch old approach to start using HAL's v_select.	5 years ago
Alexander Alekhin	a007220c52	imgproc: update histogram test	5 years ago
Alexander Alekhin	f301f17b61	imgproc: accurate histogram value thresholding	5 years ago
Alexander Alekhin	c69245da1f	imgproc: fix fitLine() implementation - update optimal solutions on each iteration	5 years ago
Alexander Alekhin	f81e401cd0	imgproc: fix indexing issue in pyramids UBSAN violation expression: 'tab = tabR - x;'	5 years ago
Vitaly Tuzov	1c17b3281a	Fixed OOB reading in pyrDown	5 years ago
Vitaly Tuzov	7b3a752012	Fixed universal intrinsic undistort() implementation	5 years ago
Alexander Alekhin	e7b6753a10	imgproc: avoid manual memory allocation in connectedcomponents.cpp	5 years ago
Everton Constantino	76e403cf25	Merge pull request #15440 from everton1984:new_integral_tests * Adding all possible data type interactions to the perf tests since some use SIMD acceleration and others do not. * Disabling full tests by default. * Giving proper names, removing magic numbers and sanity checks of new performance tests for the integral function. * Giving proper names, making array static.	5 years ago
atinfinity	3b9f981358	removed tegra optimization	5 years ago
Chip Kerchner	26228e6b4d	Merge pull request #15358 from ChipKerchner:imgwarpToHal * Convert ImgWarp from SSE SIMD to HAL - 2.8x faster on Power (VSX) and 15% speedup on x86 * Change compile flag from CV_SIMD128 to CV_SIMD128_64F for use of v_float64x2 type * Changing WarpPerspectiveLine from class functions and dispatching to static functions. * Re-add dynamic runtime and dispatch execution. * RRestore SSE4_1 optimizations inside opt_SSE4_1 namespace	5 years ago
atinfinity	824465ea27	Merge pull request #15388 from atinfinity:impl-turbo-colormap Implementation of colormap "Turbo" (#15388) * implemented turbo colormap * add colormap image * changed float value to avoid cast * sorted flag check alphabetically	5 years ago
Alexander Alekhin	29dbeb253c	build: fix build with ICC	5 years ago
luz.paz	fcc7d8dd4e	Fix modules/ typos Found using `codespell -q 3 -S ./3rdparty -L activ,amin,ang,atleast,childs,dof,endwhile,halfs,hist,iff,nd,od,uint` backporting of commit: `ec43292e1e`	5 years ago
luz.paz	ec43292e1e	Fix modules/ typos Found using `codespell -q 3 -S ./3rdparty -L activ,amin,ang,atleast,childs,dof,endwhile,halfs,hist,iff,nd,od,uint`	5 years ago
Alexander Alekhin	32772a5436	3.4: backported changes from 'master' branch	5 years ago
Maksim Shabunin	6d5ac67681	Restored IPP call reduction	5 years ago
dcouwenh	d3cf0d2c06	Bayer VNG Demosaicing Fix #2 (Merge pull request #15086 ) * Update demosaicing.cpp Fixed calculation of Bs for non-green pixels. * Fixed cvtColor perf test for bayer VNG	5 years ago
Vitaly Tuzov	e0f8bb83a6	Merge pull request #14994 from terfendail:wintr_undistort WUI based implementation to initUndistortRectifyMap (#14994) * Add initUndistortRectifyMap performance test * Move cv namespace boundaries * Add wide universal intrinsics based implementation to initUndistortRectifyMap * Dispatch undistort	5 years ago
Chip Kerchner	c9fcc12e3b	Merge pull request #15048 from ChipKerchner:reduceStoreGatheringThreshold * Reduce store gathering pressures - speeds thresholds by up to 20% * Rename temporary histogram array and initialize so that MACOSX builder is happy	5 years ago
Vitaly Tuzov	894ad33bf4	Fix pixel value evaluation overflow in bit-exact GaussianBlur implementation	5 years ago
Alexander Alekhin	32c6e58bdb	imgproc: fix unaligned memory access may cause crashes on ARM platform	5 years ago
Tomoaki Teshima	594a95839c	fix test failure of OCL_ImgProc/CvtColor8u.mRGBA2RGBA	5 years ago
Vitaly Tuzov	82e5b961d3	Fixed initUndistortRectifyMap AVX2 implementation	5 years ago
arnaudbrejeon	a37201abee	Fix crash, add assert and test	6 years ago
Vitaly Tuzov	9befb7a1d7	Merge pull request #14916 from terfendail:wsignmask_deprecated * Avoid using v_signmask universal intrinsic and mark it as deprecated * Renamed v_find_negative to v_scan_forward	6 years ago
StefanBruens	3e4a195b61	Merge pull request #14936 from StefanBruens:crosscorr_cleanup Crosscorr cleanup (#14936) * Simplify code for convolution destination type/size For the 2d filter code, destination size equals source size, and the crossCorr function even (re-)creates the output matrix with the given size. The number of channels also have to match. The destination type() is the one used to create the output matrix, so we can use its type() here. This is a preparatory patch. Signed-off-by: Stefan Brüns <stefan.bruens@rwth-aachen.de> * Remove redundant destination size and type parameters from crossCorr All calling sites of crossCorr already use (..., mat, mat.size(), mat.type(), ...), so the parameters are redundant. Signed-off-by: Stefan Brüns <stefan.bruens@rwth-aachen.de>	6 years ago
Alexander Alekhin	4a6888ccf6	imgproc: fix kmeans() call from grabCut()	6 years ago
Alexander Alekhin	5ac55fc132	core: eliminate AVX512 build warnings from MSVS2017 and GCC8 -O1 mode	6 years ago

1 2 3 4 5 ...

3051 Commits (f270e8d04030bcb6e99a9905b224e2f7e0c5581f)