565 Commits (2ad468ed1f7a7c67f1a2ab37c9e304b0011f0aae)

Author SHA1 Message Date
Lynne bbe95f7353
x86: replace explicit REP_RETs with RETs 2 years ago
Lynne 90c17a05aa
x86/tx_float: fix stray change in 15xM FFT and replace imul->lea 2 years ago
Lynne 87bae6b018
lavu/tx: refactor to explicitly track and convert lookup table order 2 years ago
Lynne fab97faf02
x86/tx_float: implement striding in fft_15xM 2 years ago
Lynne 92100eee5b
x86/tx_float_init: properly specify the supported factors of 15xM FFTs 2 years ago
Lynne cc1df4045e
x86/tx_float: add a standalone 15-point AVX2 transform 2 years ago
Lynne 877e575b5d
x86/tx_float: optimize and macro out FFT15 2 years ago
Johannes Kauffmann a11e745b97 lavu/fixed_dsp: add missing av_restrict qualifiers 2 years ago
Lynne f21899db7d
x86/tx_float: enable AVX-only split-radix FFT codelets 2 years ago
James Almer d2f482965f x86/tx_float: fix some symbol names 2 years ago
James Almer 0d8f43c74d x86/tx_float: change a condition in a preprocessor check 2 years ago
James Almer 750f378bec x86/tx_float: add missing preprocessor wrapper for AVX2 functions 2 years ago
Lynne 74e8541bab
x86/tx_float: generalize iMDCT 2 years ago
Lynne ace42cf581
x86/tx_float: add 15xN PFA FFT AVX SIMD 2 years ago
Lynne 3241e9225c
x86/tx_float: adjust internal ASM call ABI again 2 years ago
Lynne 4ba68639ca
x86/tx_float: add asm call versions of the 2pt and 4pt transforms 2 years ago
Lynne 892548e6a1
x86/tx_float: fully support 128bit regs in LOAD64_LUT 2 years ago
Lynne af42bb3d61
x86/tx_float: simplify and describe the intra-asm call convention 2 years ago
James Almer bda3a9faf4 x86/float_dsp: use three operand form for some instructions 2 years ago
Paul B Mahol 72acff9f59 avutil/x86/float_dsp: add fma3 for scalarproduct 2 years ago
Andreas Rheinhardt 29c4c0886d avutil/x86/intreadwrite: Add ability to detect whether MMX code is used 2 years ago
James Almer f4097e4c1f x86/tx_float: add missing check for AVX2 2 years ago
James Almer 74f5fb6db8 x86/tx_float: set all operands for shufps 2 years ago
Martin Storsjö e4759fa951 x86/tx_float: Fix building for platforms with a symbol prefix 2 years ago
Lynne 4537d9554d
x86/tx_float: implement inverse MDCT AVX2 assembly 2 years ago
Lynne 2425d5cd7e
x86/tx_float: add support for calling assembly functions from assembly 2 years ago
Lynne 98b32ef462
x86/tx_float: save a branch during coefficient deinterleaving 2 years ago
Andreas Rheinhardt 2718a3be1f avutil/x86/float_dsp: Remove obsolete 3dnowext function 2 years ago
Andreas Rheinhardt ea043cc53e avutil/x86/pixelutils: Remove obsolete MMX(EXT) functions 2 years ago
Lynne 27cffd16aa
x86/tx_float: replace fft_sr_avx with fft_sr_fma3 3 years ago
Lynne 0938ff9701
x86/tx_float: improve temporary register allocation for loads 3 years ago
Lynne 19c0bb2aa9
x86/tx_float: add AV_CPU_FLAG_AVXSLOW/SLOW_GATHER flags where appropriate 3 years ago
Lynne 9e94c35941
Revert "x86/tx_float: remove vgatherdpd usage" 3 years ago
Lynne 82a68a8771
x86/tx_float: remove vgatherdpd usage 3 years ago
Wu Jianhua f629ea2e18 avutil/cpu: add AVX512 Icelake flag 3 years ago
Andreas Rheinhardt 636631d9db Remove unnecessary libavutil/(avutil|common|internal).h inclusions 3 years ago
Andreas Rheinhardt 6c694074e1 avutil/x86/emms: Don't unnecessarily include lavu/cpu.h 3 years ago
Alexander Kanavin 91326dc942 libavutil: include assembly with full path from source root 3 years ago
Lynne 3bbe9c5e38
lavu/tx: refactor assembly codelet definition 3 years ago
Lynne 2e82c61055
x86/tx_float: avoid redefining macros 3 years ago
Lynne 35080149ef
x86/tx_float: mark AVX2 functions as AVXSLOW 3 years ago
Lynne 6c397f6bb5
x86/tx_float: add missing FF_TX_OUT_OF_PLACE flag to functions 3 years ago
Lynne 9787005846
x86/tx_float: do not build tx_float_init.c if x86 assembly is disabled 3 years ago
Lynne 28bff6ae54
x86/tx_float: add permute-free FFT versions 3 years ago
Lynne ef4bd81615
lavu/tx: rewrite internal code as a tree-based codelet constructor 3 years ago
James Almer 8c2d2fd6cc avutil/cpu: move slow gather checks below in the function 3 years ago
Alan Kelly ffbab99f2c libavutil/cpu: Add AV_CPU_FLAG_SLOW_GATHER. 3 years ago
James Almer 67b92d68c6 x86/intmath: add VEX encoded versions of av_clipf() and av_clipd() 3 years ago
Mark Reid c3502f4f75 libavutil/common: clip nan value to amin 3 years ago
Lynne 997f9bdb99
x86/tx_float: correctly load the transform length 3 years ago