36 Commits (d89930f8666c563a96f1dc1835b450522b4cd6da)

Author SHA1 Message Date
Lynne bd3e71b21e
x86/tx_float: enable SIMD for sizes over 131072 11 months ago
Lynne 90c17a05aa
x86/tx_float: fix stray change in 15xM FFT and replace imul->lea 2 years ago
Lynne fab97faf02
x86/tx_float: implement striding in fft_15xM 2 years ago
Lynne cc1df4045e
x86/tx_float: add a standalone 15-point AVX2 transform 2 years ago
Lynne 877e575b5d
x86/tx_float: optimize and macro out FFT15 2 years ago
Lynne f21899db7d
x86/tx_float: enable AVX-only split-radix FFT codelets 2 years ago
James Almer d2f482965f x86/tx_float: fix some symbol names 2 years ago
James Almer 0d8f43c74d x86/tx_float: change a condition in a preprocessor check 2 years ago
James Almer 750f378bec x86/tx_float: add missing preprocessor wrapper for AVX2 functions 2 years ago
Lynne 74e8541bab
x86/tx_float: generalize iMDCT 2 years ago
Lynne ace42cf581
x86/tx_float: add 15xN PFA FFT AVX SIMD 2 years ago
Lynne 3241e9225c
x86/tx_float: adjust internal ASM call ABI again 2 years ago
Lynne 4ba68639ca
x86/tx_float: add asm call versions of the 2pt and 4pt transforms 2 years ago
Lynne 892548e6a1
x86/tx_float: fully support 128bit regs in LOAD64_LUT 2 years ago
Lynne af42bb3d61
x86/tx_float: simplify and describe the intra-asm call convention 2 years ago
James Almer f4097e4c1f x86/tx_float: add missing check for AVX2 2 years ago
James Almer 74f5fb6db8 x86/tx_float: set all operands for shufps 2 years ago
Martin Storsjö e4759fa951 x86/tx_float: Fix building for platforms with a symbol prefix 2 years ago
Lynne 4537d9554d
x86/tx_float: implement inverse MDCT AVX2 assembly 2 years ago
Lynne 2425d5cd7e
x86/tx_float: add support for calling assembly functions from assembly 2 years ago
Lynne 98b32ef462
x86/tx_float: save a branch during coefficient deinterleaving 2 years ago
Lynne 27cffd16aa
x86/tx_float: replace fft_sr_avx with fft_sr_fma3 3 years ago
Lynne 0938ff9701
x86/tx_float: improve temporary register allocation for loads 3 years ago
Lynne 9e94c35941
Revert "x86/tx_float: remove vgatherdpd usage" 3 years ago
Lynne 82a68a8771
x86/tx_float: remove vgatherdpd usage 3 years ago
Alexander Kanavin 91326dc942 libavutil: include assembly with full path from source root 3 years ago
Lynne 2e82c61055
x86/tx_float: avoid redefining macros 3 years ago
Lynne 28bff6ae54
x86/tx_float: add permute-free FFT versions 3 years ago
Lynne ef4bd81615
lavu/tx: rewrite internal code as a tree-based codelet constructor 3 years ago
Lynne 997f9bdb99
x86/tx_float: correctly load the transform length 4 years ago
James Almer 7a6ea6ce2a x86/tx_float: remove ff_ prefix from external constant tables 4 years ago
Lynne bb40f800bd
x86/tx_float: fix forgotten 2-argument mulps 4 years ago
Lynne e2cf0a1f68
x86/tx_float: use all arguments on vperm2f and vpermilps and reindent comments 4 years ago
James Almer fddddc7ec2 x86/tx_float: Fixes compilation with old yasm 4 years ago
Lynne e448a4b4ea
lavu/x86/tx_float: fix FMA3 implying AVX2 is available 4 years ago
Lynne 119a3f7e8d
lavu/x86: add FFT assembly 4 years ago