257 Commits (6b133d7e9c767c7d0bc39f2b301afd17c491c178)

Author SHA1 Message Date
Ramiro Polla 4f9d25ddc8 dnxhd_mmx: prefer xmm registers below xmm6 when they are available 14 years ago
İsmail Dönmez 80e33d2451 dsputil: Use explicit movzbl instead of movzx 14 years ago
Ramiro Polla a4ece893e1 lpc_mmx: add xmm registers to clobber list 14 years ago
Ramiro Polla e5d5407e26 lpc_mmx: merge some asm blocks 14 years ago
Ramiro Polla eed299b897 sad16_sse2: merge 2 asm blocks 14 years ago
Ramiro Polla 153ca56b38 xmm_clobbers: list xmm registers first in clobber list 14 years ago
Ramiro Polla ba40452095 idct_sse2_xvid: only mark xmm>=8 as clobbered on x86_64 14 years ago
Ramiro Polla 05c018078c motion_est_mmx: prefer xmm registers below xmm6 when they are available 14 years ago
Ramiro Polla 5d543a3d13 dsputil_mmx: add xmm registers to clobber list 14 years ago
Ramiro Polla e2d13c5882 cosmetics: split long line 14 years ago
Ramiro Polla 0d729e0de2 fdct_mmx: add xmm registers to clobber list 14 years ago
Ramiro Polla 616735eb97 idct_sse2_xvid: add xmm registers to clobber list 14 years ago
Ramiro Polla 9943f3b91c mpegvideo_mmx: add xmm registers to clobber list 14 years ago
Ramiro Polla 559738eff3 dsputil_mmx: prefer xmm registers below xmm6 when they are available 14 years ago
Ramiro Polla 51d592dbcb h264dsp: add xmm registers to clobber list 14 years ago
Ramiro Polla ac19f4a3e8 indent 14 years ago
Ramiro Polla cae05859e1 h264dsp: merge some more asm blocks 14 years ago
Ramiro Polla c6a908be58 dct32: mark xmm registers in clobber list in ff_dct32_float_sse() 14 years ago
Ramiro Polla b32c9ca9a3 h264dsp: merge some asm blocks 14 years ago
Reimar Döffinger 6c2142809c Add d modifier to asm argument to fix nasm compilation. 14 years ago
Ramiro Polla 326bf69acc fft: mark xmm registers as clobbered in ff_imdct_calc_sse 14 years ago
Ronald S. Bultje dd68d4db43 MMX, MMX2, SSE2 and SSSE3 optimizations for pred16x16/8x8_plane H264 intra 14 years ago
İsmail Dönmez 9276bdddca snowdsp: Explicitly state the operand sizes 14 years ago
Ronald S. Bultje a52ffc3f54 Move static inline function to a macro, so that constant propagation in 14 years ago
Eli Friedman 329d689f75 Use sse2 variant of put_pixels16() for no_rnd also. Provides a minor speed 14 years ago
Ronald S. Bultje cd17285e6c Merge b_idx and edge variables, and optimize the ASM to directly load variables 14 years ago
Ronald S. Bultje 0cc8a5d088 Remove mv_mask variable. Replace the related pand -1/0 instructions by either 14 years ago
Ronald S. Bultje c0673f2cf4 Remove d_idx as a variable, and instead load it as a constant in the asm. 14 years ago
Ronald S. Bultje 2c3135f6d3 Unroll inner bidir loop in h264_loop_filter_strength_mmx2(), which gets rid 14 years ago
Ronald S. Bultje 4b81511cab Unloop the outer loop in h264_loop_filter_strength_mmx2(), which allows 14 years ago
Reimar Döffinger 02b424d9c8 Add d suffix to movd target register to make it work with nasm. 14 years ago
Reimar Döffinger dc77e985b7 Split and then simplify address generation macro. 14 years ago
Ronald S. Bultje 7e117771cd Remove unused variable. 14 years ago
Ronald S. Bultje ae11291865 Unroll loop in h264_idct_add16intra_sse2(). Basically identical to r25171, this 14 years ago
Ronald S. Bultje 4bca677494 Unroll loop in h264_idct_add8_sse2(). This means we can inline scan8[] in the 14 years ago
Måns Rullgård c0bc8b9afb x86: disable SSE functions using stack when stack is not aligned 14 years ago
Måns Rullgård f41237c9db x86: remove hack disabling sse2 h264 loop filter with 32-bit icc 14 years ago
Ronald S. Bultje ada65af9d1 Don't access upper 32 bits of a 32-bit int on 64-bit systems. 14 years ago
Ronald S. Bultje 6c3d021891 Properly add HAVE_YASM around yasmified symbols. Should fix compile error 14 years ago
Ronald S. Bultje e2e341048e Move hadamard_diff{,16}_{mmx,mmx2,sse2,ssse3}() from inline asm to yasm, 14 years ago
Ronald S. Bultje d0acc2d2e9 Move sse16_sse2() from inline asm to yasm. It is one of the functions causing 14 years ago
Ronald S. Bultje 1d16a1cf99 Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from 14 years ago
Jason Garrett-Glaser 8acb554aff LGPL SSE2 H.264 iDCT 15 years ago
Stefano Sabatini c6c98d0897 Move mm_support() from libavcodec to libavutil, make it a public 15 years ago
Reimar Döffinger b1c32fb5e5 Use "d" suffix for general-purpose registers used with movd. 15 years ago
Stefano Sabatini 7160bb716b Rename FF_MM_ symbols related to CPU features flags as AV_CPU_FLAG_ 15 years ago
Ronald S. Bultje 2c166c3af1 Port latest x264 deblock asm (before they moved to using NV12 as internal 15 years ago
Eli Friedman a10a9f5cd0 Fix typo in r25019. 15 years ago
Ronald S. Bultje 615da9b1d9 Unscrew breakage after my last commit because of symbol prefixes. 15 years ago
Ronald S. Bultje a33a2562c1 Rename h264_weight_sse2.asm to h264_weight.asm; add 16x8/8x16/8x4 non-square 15 years ago