opencv

Commit Graph

Author	SHA1	Message	Date
Andrew Ryrie	ea7d4be3f8	Merge pull request #20658 from smbz:lstm_optimisation * dnn: LSTM optimisation This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm. fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications: - Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned. - Allow for weight matrices where the number of columns is not a multiple of 8. I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on. * Fix warning about initialisation order * Remove C++11 syntax * Fix build when AVX(2) is not available In this case the CV_TRY_X macros are defined to 0, rather than being undefined. * Minor changes as requested: - Don't check hardware support for AVX(2) when dispatch is disabled for these - Add braces * Fix out-of-bounds access in fully connected layer The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway). This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems. * Improve tail mask handling - Use static array for generating tail masks (as requested) - Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs * Revert whitespace change * Improve readability of conditions for using AVX * dnn(lstm): minor coding style changes, replaced left aligned load	3 years ago
Smirnov Egor	05db8784ae	fix Clip, LeakyReLU, LRN, Split defaults	3 years ago
Alexander Alekhin	58b06222ff	dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling	3 years ago
Alexander Alekhin	58dc397930	dnn(test): add two_inputs test with FP32/U8 data types - remove similar test from IE scope under HAVE_INF_ENGINE	3 years ago
yuki takehara	a6277370ca	Merge pull request #21107 from take1014:remove_assert_21038 resolves #21038 * remove C assert * revert C header * fix several points in review * fix test_ds.cpp	3 years ago
Alexander Alekhin	985aa0423d	dnn(test): update InferenceEngine tests	3 years ago
Christian Clauss	ebe4ca6b60	Fix typos discovered by codespell	3 years ago
Christian Clauss	cdbb042ce4	Use print() function in both Python 2 and Python 3	3 years ago
Alexander Alekhin	61f1ee2d2d	core(logger): dump timestamp information with message	3 years ago
Vincent Rabaud	d4741eece1	Fix H clamping for very small negative values. In case of very small negative h (e.g. -1e-40), with the current implementation, you will go through the first condition and end up with h = 6.f, and will miss the second condition.	3 years ago
nickjackolson	b696928a5b	add !empty assertion in seamlessClone() issue #20617 addresses lack of warnings on seamlessClone() function when src is None. This commit adds source check using CV_Assert therefore debugging would be easier. Signed-off-by: nickjackolson <metedurlu@gmail.com>	3 years ago
nickjackolson	79d4e865fe	Add warning message to imread() Add a warning message using CV_LOG__WARNING(). This way api behaviour is preserved. Outputs are the same but user gets an extra warning in case fopen() fails to access image file for some reason. This would help new users and also debugging complex apps which use imread() Signed-off-by: nickjackolson <metedurlu@gmail.com>	3 years ago
Alexander Alekhin	de7f8eec04	js(test): pin cli-table dependency	3 years ago
Alexander Alekhin	473f10877c	doc(videoio): fix apiPreference note, replace DSHOW(deprecated)->MSMF	3 years ago
Qiushi Zheng	3e51448ef0	Merge pull request #17889 from ZhengQiushi:my_3.4 QR code (encoding process) * add qrcode encoder * qr encoder fixes * qr encoder: fix api and realization * fixed qr encoder, added eci and kanji modes * trigger CI * qr encoder constructor fixes Co-authored-by: APrigarina <ann73617@gmail.com>	3 years ago
Alexander Alekhin	8041ab8a61	Merge pull request #21025 from alalek:issue_21004 * dnn(ocl4dnn): fix LRN layer accuracy problems - FP16 intermediate computation is not accurate and may provide NaN values * dnn(test): update tolerance for FP16	3 years ago
tv3141	cb286a66be	Merge pull request #21030 from tv3141:fix_seg_fault_houghlinespointset Fix seg fault houghlinespointset * Clarify parameter doc for HoughLinesPointSet * Fix seg fault. * Add regression test. * Fix latex typo	3 years ago
Piotr Kubaj	68e425f869	Add support for runtime CPU feature check on POWER on FreeBSD. 1. Code uses PPC_FEATURE_HAS_VSX, but it's not checked similarly to PPC_FEATURE2_ARCH_3_00 and PPC_FEATURE2_ARCH_3_00 for availability. FreeBSD has those macros in machine/cpu.h, but I went with the way chosen for PPC_FEATURE2_ARCH_3_00 and PPC_FEATURE2_ARCH_3_00. Other than that, FreeBSD also has sys/auxv.h and that's where elf_aux_info() is defined. 2. getauxval() is actually Linux-only, but code checked for __unix__. It won't work on all UNIX, so change it back to __linux__. Add another code variant strictly for FreeBSD. 3. Update comment. This commit adds code for FreeBSD, but recently there appeared support for powerpc64 in OpenBSD.	3 years ago
ZaKiiiiiiiii	98b6ce353c	Merge pull request #20904 from Crayon-new:fix_bug_in_maxLayer fix bug: wrong output dimension when "keep_dims" is false in pooling layer. * fix bug in max layer * code align * delete permute layer and add test case * add name assert * check other cases * remove c++11 features * style:add "const" remove assert * style:sanitize file names	3 years ago
Vincent Rabaud	ffd010767f	Only use fma functions when CV_FMA3 is set. In practice, processors offering AVX2/AVX512 also FMA, that is why it got unnoticed.	3 years ago
Alexander Alekhin	c1d61c88e9	dnn(cmake): don't hijack OpenCL options with Tengine	3 years ago
APrigarina	8e72e1ed88	add test case for QR detect fix	3 years ago
cpengu	66dd871288	Update qrcode.cpp Fixed issue #20880, QRDetect::searchHorizontalLines() boundary condition will skip the matched qrcode near the end	3 years ago
Alexander Alekhin	d484939c02	Merge pull request #20999 from alalek:dnn_replace_deprecated_calls dnn(protobuf): replace deprecated calls * dnn: replace deprecated ByteSize() => ByteSizeLong() * dnn: replace deprecated calls, use GetRepeatedFieldRef	3 years ago
Alexander Alekhin	b3e16c6423	videoio(dshow): eliminate build warnings from MSVC-Clang	3 years ago
Souriya Trinh	30d6766db4	Add conventional Bayer naming.	3 years ago
Alexander Alekhin	0ee61d178f	highgui: drop invalid cvGetWindowImageRect - return type is C++ template - removal from 'extern "C"' scope broke ABI anyway, so this symbols is removed completelly	3 years ago
Alexander Alekhin	a49cda6523	core: eliminate Winvalid-noreturn in base.hpp	3 years ago
Alexander Alekhin	d612c72405	build: fix MSVC-Clang warnings about unused parameters in stubs	3 years ago
Alexander Alekhin	75e2ba5af3	core(simd): fix compilation with MSVC-Clang	3 years ago
Alexander Alekhin	1726bb6c0d	build(icc): fix nodiscard attribute handling	3 years ago
Noah Stier	84a81579ba	tvl1 cuda optflow optimization	3 years ago
berak	a6f5717567	resolves #20913 imgproc: remove asserts for circles_ in HoughCircles	3 years ago
AleksandrPanov	d21622bef4	fix findMinEnclosingTriangle and add tests	3 years ago
Harvey	ce68291d83	32bit rgb bmp file should not copy data as rgba	3 years ago
Zhuo Zhang	7da51787b9	Merge pull request #20900 from zchrissirhcz:3.4-hwfeatures-support-qnx * fix: correctly check neon flags for QNX platform * refactor: change __QNXNTO__ to __QNX__	3 years ago
rogday	b3f966e2ca	Merge pull request #20883 from rogday:eltwise_refactoring * backport elementwise_layers refactor * keep NULL	3 years ago
Michel Promonet	9a9e457dd6	Allow to set av_log_set_level to reduce ffmpeg level below AV_LOG_ERROR	3 years ago
Alexander Alekhin	b5fcb06a76	core(SIMD): update int64 SSE constructor	3 years ago
Sergiu Deitsch	f8f9f3c438	fixed AVX compile error Some older compilers do not allow to pass a `const int` as an immediate. Use an unnamed enum instead.	3 years ago
Wehzie	f9e747dbc6	Fixed typo in CV_Error message Error was "Input parameters must be a matrices!", but "matrices" is plural and doesn't allow the unspecific article "a".	3 years ago
Nicholas Ho	bd0732b1d0	Merge pull request #20740 from Nicholas-Ho-arm:3.4_SymmColumnVec_32f8u * Add SymmColumnVec_32f8u * Fix double to float warnings	3 years ago
Alexander Alekhin	982503e9a8	core: ensure 'int' result from CV_POPCNT_U64(x)	3 years ago
Stanislaw Halik	3d93675ff9	fix link error on shared libs with -MT	3 years ago
Smirnov Egor	238dbffb48	change asserts for Sum	3 years ago
Smirnov Egor	a9d7b6eab7	fix const - input and remove unimplemented function	3 years ago
Alexander Alekhin	b1cf550123	release: OpenCV 3.4.16	3 years ago
Alexander Alekhin	4985311d46	core(tls): avoid destruction of TlsAbstraction singleton	3 years ago
Jonas Vautherin	9537a909f7	Merge pull request #20801 from JonasVautherin:fix-gst-error-handling * Fix gst error handling * Use the return value instead of the error, which gives no guarantee of being NULL in case of error * Test err pointer before accessing it * Remove unreachable code * videoio(gstreamer): restore check in writer code Co-authored-by: Alexander Alekhin <alexander.a.alekhin@gmail.com>	3 years ago
Alexander Alekhin	8c2dd5fb9a	dnn(ocl4dnn): cleanup dead code, improve logging	3 years ago

1 2 3 4 5 ...

20346 Commits (0d2857a2425e08eb0296001d27dbf3da0de5bbf8)