536 Commits (2831837182fe26f0a19a4d366f3f0553311f1291)

Author SHA1 Message Date
Lynne 27cffd16aa
x86/tx_float: replace fft_sr_avx with fft_sr_fma3 3 years ago
Lynne 0938ff9701
x86/tx_float: improve temporary register allocation for loads 3 years ago
Lynne 19c0bb2aa9
x86/tx_float: add AV_CPU_FLAG_AVXSLOW/SLOW_GATHER flags where appropriate 3 years ago
Lynne 9e94c35941
Revert "x86/tx_float: remove vgatherdpd usage" 3 years ago
Lynne 82a68a8771
x86/tx_float: remove vgatherdpd usage 3 years ago
Wu Jianhua f629ea2e18 avutil/cpu: add AVX512 Icelake flag 3 years ago
Andreas Rheinhardt 636631d9db Remove unnecessary libavutil/(avutil|common|internal).h inclusions 3 years ago
Andreas Rheinhardt 6c694074e1 avutil/x86/emms: Don't unnecessarily include lavu/cpu.h 3 years ago
Alexander Kanavin 91326dc942 libavutil: include assembly with full path from source root 3 years ago
Lynne 3bbe9c5e38
lavu/tx: refactor assembly codelet definition 3 years ago
Lynne 2e82c61055
x86/tx_float: avoid redefining macros 3 years ago
Lynne 35080149ef
x86/tx_float: mark AVX2 functions as AVXSLOW 3 years ago
Lynne 6c397f6bb5
x86/tx_float: add missing FF_TX_OUT_OF_PLACE flag to functions 3 years ago
Lynne 9787005846
x86/tx_float: do not build tx_float_init.c if x86 assembly is disabled 3 years ago
Lynne 28bff6ae54
x86/tx_float: add permute-free FFT versions 3 years ago
Lynne ef4bd81615
lavu/tx: rewrite internal code as a tree-based codelet constructor 3 years ago
James Almer 8c2d2fd6cc avutil/cpu: move slow gather checks below in the function 3 years ago
Alan Kelly ffbab99f2c libavutil/cpu: Add AV_CPU_FLAG_SLOW_GATHER. 3 years ago
James Almer 67b92d68c6 x86/intmath: add VEX encoded versions of av_clipf() and av_clipd() 3 years ago
Mark Reid c3502f4f75 libavutil/common: clip nan value to amin 3 years ago
Lynne 997f9bdb99
x86/tx_float: correctly load the transform length 3 years ago
James Almer 7a6ea6ce2a x86/tx_float: remove ff_ prefix from external constant tables 4 years ago
Lynne bb40f800bd
x86/tx_float: fix forgotten 2-argument mulps 4 years ago
Lynne e2cf0a1f68
x86/tx_float: use all arguments on vperm2f and vpermilps and reindent comments 4 years ago
James Almer fddddc7ec2 x86/tx_float: Fixes compilation with old yasm 4 years ago
Lynne e448a4b4ea
lavu/x86/tx_float: fix FMA3 implying AVX2 is available 4 years ago
Lynne 119a3f7e8d
lavu/x86: add FFT assembly 4 years ago
Andreas Rheinhardt f3c197b129 Include attributes.h directly 4 years ago
Henrik Gramner 0b2b03568f avutil/x86inc: fix warnings when assembling with Nasm 2.15 4 years ago
Martin Storsjö 1001b6a750 libavutil: x86: Include stdlib.h before using _byteswap_ulong 5 years ago
James Almer 9d002d7818 x86/float_dsp: add ff_vector_dmul_{sse2,avx} 6 years ago
James Almer 481741ece0 x86/pixelutils: don't use the AVX2 functions on CPUs known to be slow with them 6 years ago
James Almer d5b3077ecf x86/pixelutils: add missing preprocessor wrapper to the AVX2 functions 6 years ago
Jun Zhao d36b8394f4 avutil/pixelutils: sad_32x32 sse2/avx2 optimizations. 6 years ago
alexander schmid b23c4a9dbd lavu/x86/cpu: Fix aesni detection 6 years ago
Jun Zhao 09628cb1b4 avutil/pixelutils: correct the function name in comments 6 years ago
Diego Biurrun 4cf84e254a Drop some unnecessary config.h #includes 7 years ago
Henrik Gramner 6f62b0bd4f x86inc: Drop cpuflags_slowctz 7 years ago
Henrik Gramner eb5f063e7c x86inc: Correctly set mmreg variables 7 years ago
Henrik Gramner 6b6edd1216 x86inc: Support creating global symbols from local labels 7 years ago
Henrik Gramner 9e4b3675f2 x86inc: Use .rdata instead of .rodata on Windows 7 years ago
Henrik Gramner 3a02cbe3fa x86inc: Enable AVX emulation for floating-point pseudo-instructions 7 years ago
James Almer 90d216cb90 x86inc: set the correct amount of simd regs in x86_64 when avx512 is enabled but not used 7 years ago
Henrik Gramner f7197f68dc x86inc: AVX-512 support 7 years ago
James Darnley e2218ed8ce avutil: add alignment needed for AVX-512 7 years ago
James Darnley 4783a01c11 avutil: detect when AVX-512 is available 7 years ago
James Darnley 8b81eabe57 avutil: add AVX-512 flags 7 years ago
Martin Vignali b37196adff avutil/x86util : add macro for loading a 128 bits constants in an xmm or in each part of an ymm in order to simplify avx2 asm func 7 years ago
Dale Curtis 50e30d9bb7 Don't use _tzcnt instrinics with clang for windows w/o BMI. 7 years ago
James Almer 3d828c9fd5 cpu: split flag checks per arch in av_cpu_max_align() 7 years ago