465 Commits (c369ddb720bfb7b8f1780b875067b900285ebfc2)

Author SHA1 Message Date
Ramiro Polla c6a908be58 dct32: mark xmm registers in clobber list in ff_dct32_float_sse() 14 years ago
Ramiro Polla b32c9ca9a3 h264dsp: merge some asm blocks 14 years ago
Reimar Döffinger 6c2142809c Add d modifier to asm argument to fix nasm compilation. 14 years ago
Ramiro Polla 326bf69acc fft: mark xmm registers as clobbered in ff_imdct_calc_sse 14 years ago
Ronald S. Bultje dd68d4db43 MMX, MMX2, SSE2 and SSSE3 optimizations for pred16x16/8x8_plane H264 intra 14 years ago
İsmail Dönmez 9276bdddca snowdsp: Explicitly state the operand sizes 14 years ago
Ronald S. Bultje a52ffc3f54 Move static inline function to a macro, so that constant propagation in 14 years ago
Eli Friedman 329d689f75 Use sse2 variant of put_pixels16() for no_rnd also. Provides a minor speed 14 years ago
Ronald S. Bultje cd17285e6c Merge b_idx and edge variables, and optimize the ASM to directly load variables 14 years ago
Ronald S. Bultje 0cc8a5d088 Remove mv_mask variable. Replace the related pand -1/0 instructions by either 14 years ago
Ronald S. Bultje c0673f2cf4 Remove d_idx as a variable, and instead load it as a constant in the asm. 14 years ago
Ronald S. Bultje 2c3135f6d3 Unroll inner bidir loop in h264_loop_filter_strength_mmx2(), which gets rid 14 years ago
Ronald S. Bultje 4b81511cab Unloop the outer loop in h264_loop_filter_strength_mmx2(), which allows 14 years ago
Reimar Döffinger 02b424d9c8 Add d suffix to movd target register to make it work with nasm. 14 years ago
Reimar Döffinger dc77e985b7 Split and then simplify address generation macro. 14 years ago
Ronald S. Bultje 7e117771cd Remove unused variable. 14 years ago
Ronald S. Bultje ae11291865 Unroll loop in h264_idct_add16intra_sse2(). Basically identical to r25171, this 14 years ago
Ronald S. Bultje 4bca677494 Unroll loop in h264_idct_add8_sse2(). This means we can inline scan8[] in the 14 years ago
Måns Rullgård c0bc8b9afb x86: disable SSE functions using stack when stack is not aligned 14 years ago
Måns Rullgård f41237c9db x86: remove hack disabling sse2 h264 loop filter with 32-bit icc 14 years ago
Ronald S. Bultje ada65af9d1 Don't access upper 32 bits of a 32-bit int on 64-bit systems. 14 years ago
Ronald S. Bultje 6c3d021891 Properly add HAVE_YASM around yasmified symbols. Should fix compile error 14 years ago
Ronald S. Bultje e2e341048e Move hadamard_diff{,16}_{mmx,mmx2,sse2,ssse3}() from inline asm to yasm, 14 years ago
Ronald S. Bultje d0acc2d2e9 Move sse16_sse2() from inline asm to yasm. It is one of the functions causing 14 years ago
Ronald S. Bultje 1d16a1cf99 Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from 14 years ago
Jason Garrett-Glaser 8acb554aff LGPL SSE2 H.264 iDCT 14 years ago
Stefano Sabatini c6c98d0897 Move mm_support() from libavcodec to libavutil, make it a public 14 years ago
Reimar Döffinger b1c32fb5e5 Use "d" suffix for general-purpose registers used with movd. 14 years ago
Stefano Sabatini 7160bb716b Rename FF_MM_ symbols related to CPU features flags as AV_CPU_FLAG_ 14 years ago
Ronald S. Bultje 2c166c3af1 Port latest x264 deblock asm (before they moved to using NV12 as internal 14 years ago
Eli Friedman a10a9f5cd0 Fix typo in r25019. 14 years ago
Ronald S. Bultje 615da9b1d9 Unscrew breakage after my last commit because of symbol prefixes. 14 years ago
Ronald S. Bultje a33a2562c1 Rename h264_weight_sse2.asm to h264_weight.asm; add 16x8/8x16/8x4 non-square 14 years ago
Ronald S. Bultje 14bc1f2485 Split h264dsp_mmx.c (which was #included in dsputil_mmx.c) in h264_qpel_mmx.c, 14 years ago
Ronald S. Bultje 5929b3a651 Fix vertical align. 14 years ago
Ronald S. Bultje 79ce0f002e Fix compilation failure if yasm is disabled (missing vp3 symbols). 14 years ago
Ronald S. Bultje de1c253bab Split intra prediction initialization (i.e. assigning of function pointers) 14 years ago
Ronald S. Bultje d0eb5a1174 Move H264 chroma MC from inline asm to yasm. This fixes VP3/5/6 and VC-1 14 years ago
Ronald S. Bultje e9f5f020c6 Move VP3 IDCT functions from inline ASM to YASM. This fixes part of the VP3/5/6 14 years ago
Ronald S. Bultje 7e7c4b6008 Put ff_ prefix on non-static {put_signed,put,add}_pixels_clamped_mmx() 14 years ago
Loren Merritt 19d929f9a3 cosmetics in imdct_sse 14 years ago
Ronald S. Bultje 4eca52ed19 Fix typos when converting inline asm to yasm, fixes MMX-only fate-ea-vp61. 14 years ago
Ronald S. Bultje 6697bc33e2 Revert r24931, it broke Win32 and some BSD compiles (yay fate). 14 years ago
Ronald S. Bultje 72f642400b Mark xmm6 and xmm7 as clobbered in ff_vp3_idct_sse2(), which is contributing 14 years ago
Måns Rullgård 69dad87c48 VP6: fix vp6_filter_diag4_mmx/sse on 64-bit 14 years ago
Ronald S. Bultje 89fa3504ed Move vp6_filter_diag4() x86 SIMD code from inline ASM to YASM. This should 14 years ago
Ronald S. Bultje 3a0885146c Move vp6_filter_diag4() from DSPContext to VP56DSPContext. 14 years ago
Måns Rullgård c0ec9918b0 Remove global mm_flags variable 14 years ago
Ronald S. Bultje 3611c45ab7 Mark xmm registers as clobbered in simple loopfilter. Should fix the last 14 years ago
Alex Converse cb4f12466b imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits". 14 years ago