FFmpeg

Commit Graph

Author	SHA1	Message	Date
Ganesh Ajjanagadde	55d3e97970	avutil/intmath: use de Bruijn based ff_ctz It has already been demonstrated that the de Bruijn method has benefits over the current implementation: commit `971d12b7f9`. That commit implemented it for long long, this extends it to the int version. Tested with FATE. Signed-off-by: Ganesh Ajjanagadde <gajjanagadde@gmail.com> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	9 years ago
Ronald S. Bultje	93866c2aa2	intmath: remove av_ctz. It's a non-installed header and only used in one place (flacenc). Since ff_ctz is static inline, it's fine to use that instead.	9 years ago
Michael Niedermayer	2a4d1a66e8	avutil/intmath: Change debruijn_ctz64 to use 8bit elements This reduces the memory & cache need from 256 to 64 bytes the code also seems faster with this change Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
Ganesh Ajjanagadde	971d12b7f9	avutil/mathematics: speed up av_gcd by using Stein's binary GCD algorithm This uses Stein's binary GCD algorithm: https://en.wikipedia.org/wiki/Binary_GCD_algorithm to get a roughly 4x speedup over Euclidean GCD on standard architectures with a compiler intrinsic for ctzll, and a roughly 2x speedup otherwise. At the moment, the compiler intrinsic is used on GCC and Clang due to its easy availability. Quick note regarding overflow: yes, subtractions on int64_t can, but the llabs takes care of that. The llabs is also guaranteed to be safe, with no annoying INT64_MIN business since INT64_MIN being a power of 2, is shifted down before being sent to llabs. The binary GCD needs ff_ctzll, an extension of ff_ctz for long long (int64_t). On GCC, this is provided by a built-in. On Microsoft, there is a BitScanForward64 analog of BitScanForward that should work; but I can't confirm. Apparently it is not available on 32 bit builds; so this may or may not work correctly. On Intel, per the documentation there is only an intrinsic for _bit_scan_forward and people have posted on forums regarding _bit_scan_forward64, but often their documentation is woeful. Again, I don't have it, so I can't test. As such, to be safe, for now only the GCC/Clang intrinsic is added, the rest use a compiled version based on the De-Bruijn method of Leiserson et al: http://supertech.csail.mit.edu/papers/debruijn.pdf. Tested with FATE, sample benchmark (x86-64, GCC 5.2.0, Haswell) with a START_TIMER and STOP_TIMER in libavutil/rationsl.c, followed by a make fate. aac-am00_88.err: builtin: 714 decicycles in av_gcd, 4095 runs, 1 skips de-bruijn: 1440 decicycles in av_gcd, 4096 runs, 0 skips previous: 2889 decicycles in av_gcd, 4096 runs, 0 skips Signed-off-by: Ganesh Ajjanagadde <gajjanagadde@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	9 years ago
Timothy Gu	c5d9e9b354	doxygen: Remove lavu_internal group There is no use in an internal group for a public API documentation.	9 years ago
James Almer	78347549a4	avutil/intmath: check for ICC before GCC Intel compiler also defines __GNUC__, so the Intel specific intrinsics were not really being used. Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>	9 years ago
James Almer	bc65abc8d7	libavutil: add x86 optimized av_popcount Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>	10 years ago
Michael Niedermayer	f8607cfb0a	avutil/intmath: Add () to protect the ff_log2() argument Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Matthew Oliver	2060f4cbba	avutil/intmath: enable builtin intrinsics for icl and msvc. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	10 years ago
Reimar Döffinger	1a558cec64	intmath.h: Remove duplicated ARM include. Signed-off-by: Reimar Döffinger <Reimar.Doeffinger@gmx.de>	10 years ago
Justin Ruggles	dfde8a34e5	lavu: add av_ctz() for trailing zero bit count	12 years ago
Mans Rullgard	ebe46b8063	ARM: reinstate optimised intmath.h Use of the ARM optimised intmath.h was accidentally dropped in `9734b8b`. Signed-off-by: Mans Rullgard <mans@mansr.com>	12 years ago
Mans Rullgard	8c0a3d5fe0	avutil: remove inline av_log2 from public API This removes inline av_log2 and av_log2_16bit from the public API, instead exporting them as regular functions. In-tree code still gets the inline and otherwise optimised variants. Signed-off-by: Mans Rullgard <mans@mansr.com>	12 years ago
Diego Biurrun	9734b8ba56	Move avutil tables only used in libavcodec to libavcodec.	12 years ago
Mans Rullgard	5b170c0bea	x86: remove FASTDIV inline asm GCC 4.3 and later do the right thing with the plain C code. Earlier versions in 32-bit mode generate one extra instruction, needlessly zeroing what would be the high half of the shifted value. At least two gcc configurations miscompile the inline asm in some situations. In 64-bit mode, all gcc versions generate imul r64, r64 followed by shr. On Intel i7 and later, this imul is faster 32-bit mul. On older Intel and all AMD, it is slightly slower. On Atom it is much slower. Considering where the FASTDIV macro is used, any overall negative performance impact of this change should be negligible. If anyone cares, they should file a bug against gcc and get the instruction selection fixed. Signed-off-by: Mans Rullgard <mans@mansr.com>	12 years ago
Diego Biurrun	66baa45801	configure: Drop fastdiv option There is no point in having the user disable any fastdiv macros. Besides the condition implementation was broken and only disabled the C implementation, but no platform specific assembly versions.	12 years ago
Luca Barbato	757cd8d876	doxy: provide a start page and document libavutil Introduce a basic layout, the subpages are currently left empty. Split libavutil in multiple groups as example of the structure	13 years ago
Mans Rullgard	2912e87a6c	Replace FFmpeg with Libav in licence headers Signed-off-by: Mans Rullgard <mans@mansr.com>	14 years ago
Måns Rullgård	a955b59658	Remove macro duplication between common.h and intmath.h Originally committed as revision 24086 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	2e874c7704	intmath: whitespace cosmetics Originally committed as revision 24085 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	b90b1b4c3c	Fix build on configurations without fast av_log2() This is a bit hackish. I will try to think of something nicer, but this will do for now. Originally committed as revision 22366 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	94ca624fbc	Move ff_sqrt() to libavutil/intmath.h Originally committed as revision 22345 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	75fb5c24ed	Move FASTDIV macro to intmath.h Originally committed as revision 21335 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Måns Rullgård	544f5a922f	Optimise av_log2 with clz when available 10% faster flac decoding on x86 and ARM. Originally committed as revision 21217 to svn://svn.ffmpeg.org/ffmpeg/trunk	15 years ago
Stefano Sabatini	987903826b	Globally rename the header inclusion guard names. Consistently apply this rule: the guard name is obtained from the filename by stripping the leading "lib", converting '/' and '.' to '_' and uppercasing the resulting name. Guard names in the root directory have to be prefixed by "FFMPEG_". Originally committed as revision 15120 to svn://svn.ffmpeg.org/ffmpeg/trunk	16 years ago
Måns Rullgård	3540b950ec	add missing #include "common.h" to libavutil headers Originally committed as revision 12502 to svn://svn.ffmpeg.org/ffmpeg/trunk	17 years ago
Zuxy Meng	85074d3c93	Reapply r12489: Add pure, const and malloc attributes to proper functions in libavutil. Fix a compilation failure in r12489. Originally committed as revision 12498 to svn://svn.ffmpeg.org/ffmpeg/trunk	17 years ago
Benoit Fouet	2119bb8f51	revert r12489. Originally committed as revision 12490 to svn://svn.ffmpeg.org/ffmpeg/trunk	17 years ago
Zuxy Meng	6544f48f03	Pure, const and malloc attributes to libavutil. Patch by Zuxy Meng: zuxy meng gmail com Original thread: [FFmpeg-devel] [PATCH] Pure, const and malloc attributes to libavutil Date: 03/18/2008 6:09 AM Originally committed as revision 12489 to svn://svn.ffmpeg.org/ffmpeg/trunk	17 years ago
Diego Biurrun	5b21bdabe4	Add FFMPEG_ prefix to all multiple inclusion guards. Originally committed as revision 10765 to svn://svn.ffmpeg.org/ffmpeg/trunk	17 years ago
Måns Rullgård	99545457bf	include all prerequisites in header files Originally committed as revision 9344 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Diego Biurrun	b78e7197a8	Change license headers to say 'FFmpeg' instead of 'this program/this library' and fix GPL/LGPL version mismatches. Originally committed as revision 6577 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Diego Biurrun	04d7f60143	Add official LGPL license headers to the files that were missing them. Originally committed as revision 6219 to svn://svn.ffmpeg.org/ffmpeg/trunk	18 years ago
Måns Rullgård	b9a73d8d2f	move adler32 to libavutil Originally committed as revision 5731 to svn://svn.ffmpeg.org/ffmpeg/trunk	19 years ago

30 Commits (216cc1f6fe33b256ce708fade5e6638b2bb54d2b)