358 Commits (efd8f22a167197fbaf2b23fb29bf7096fade65e4)

Author SHA1 Message Date
Loren Merritt 1b87c40245 slightly faster ff_imdct_calc_3dn2() on amd64. (gcc added a bunch of useless movsxd) 19 years ago
Michael Niedermayer 21bb884fb7 change vorbis_inverse_coupling_sse2() so it works on sse1 cpus 19 years ago
Loren Merritt bcfa3e58ee 3dnow2 implementation of imdct. 19 years ago
Loren Merritt cd035a6051 10l, vorbis_inverse_coupling_sse() was really 3dnow 19 years ago
Loren Merritt 2dac4acfc0 sse & sse2 implementations of vorbis channel coupling. 19 years ago
Stefan Gehrer 595e7bd940 some MMX optimizations for the CAVS decoder 19 years ago
Michael Niedermayer 5ced7b80ad disable the vp3 mmx and sse2 idcts, their output doesnt match the c idct (tested with -f crc) and the theora spec does not allow different idcts not to mention the difference is quite vissible ... 19 years ago
Måns Rullgård 98d417cbcd #define SBUTTERFLY outside CONFIG_ENCODERS 19 years ago
Luca Abeni 9c39071d6d Move REG_* macros from libavcodec/i386/mmx.h to libavutil/x86_cpu.h 19 years ago
Måns Rullgård 3f8674a902 remove redundant macro definitions 19 years ago
Måns Rullgård 8fb0d07339 kill warning 19 years ago
Michael Niedermayer e27b6e62f7 missmatch control for mpeg2 intra dequantization if bitexact=1 19 years ago
Zuxy Meng 392f6da897 Remove unused and unsupported Cyrix's "Extended MMX", 19 years ago
Wolfram Gloger f42635f558 gcc-2.95 compile fix, patch by Wolfram Gloger <wmglo A dent PIS med PIS uni-muenchen PIS de> 19 years ago
Loren Merritt 75ca1a5f70 gmc_mmx tweaks 19 years ago
Loren Merritt 703c8195a8 mmx implementation of 3-point GMC. (5x faster than C) 19 years ago
Luca Barbato 22b48b85b6 altivec support for snow 19 years ago
Loren Merritt 5e8b787afa simplified and slightly faster h264_chroma_mc8_mmx 19 years ago
Loren Merritt 513fbd8e5a prefetch pixels for future motion compensation. 2-5% faster h264. 19 years ago
Loren Merritt 5e6a5c4daf 10l 19 years ago
Loren Merritt fdd3057981 added mmx implementation of h264_chroma_mc2 19 years ago
Robert Edele e8600e5edc add MMX and SSE versions of ff_snow_inner_add_yblock 19 years ago
Robert Edele 2c9a0285d4 snow mmx+sse2 optimizations, part 4 19 years ago
Robert Edele 4567b4bdab Add the mmx and sse2 implementations of ff_snow_vertical_compose(). 19 years ago
Robert Edele 059715a41c First part of a series of speed-enchancing patches. 19 years ago
Zuxy Meng 82eb4b0f1b 3DNow! & Extended 3DNow! versions of FFT 19 years ago
Loren Merritt 548a1c8a35 h264_idct8_add_mmx 19 years ago
Loren Merritt 6da971f160 h264_idct_add only needs mmx1 19 years ago
Zuxy Meng 2ffb22d2ad use xorps instead of mulps to toggle the sign of a float, as suggested by Software Optimization Guide for AMD64 Processors. 19 years ago
Loren Merritt d84f7c61ee gcc2.95 workaround 19 years ago
Loren Merritt 7a5b2fa812 remove some useless instructions 19 years ago
Loren Merritt 6a8eb0f45a 4% faster h264_qpel_mc 19 years ago
Loren Merritt ef9d1d1575 h264: special case dc-only idct. ~1% faster overall 19 years ago
Loren Merritt 4e295993ba 10l in 1.12 19 years ago
Loren Merritt 6ee669732d 10l (x86_64) 19 years ago
Loren Merritt e545f37527 18% faster put_h264_qpel16_mc[13]2_mmx2 19 years ago
Loren Merritt c03ce51dfb 11% faster put_h264_qpel16_v_lowpass_mmx2 19 years ago
Loren Merritt 0331f09237 15% faster put_h264_qpel16_hv_lowpass_mmx2 19 years ago
Steve L'Homme 68b51e58ce MSVC-compatible __align8/__align16 declaration 19 years ago
Diego Biurrun 5509bffa88 Update licensing information: The FSF changed postal address. 19 years ago
Loren Merritt e8b562087d tweak h264_biweight 19 years ago
Loren Merritt cec9395977 fix some potential arithmetic overflows in pred_direct_motion() and 19 years ago
Diego Biurrun bb270c0896 COSMETICS: tabs --> spaces, some prettyprinting 19 years ago
Diego Biurrun 115329f160 COSMETICS: Remove all trailing whitespace. 19 years ago
Guillaume Poirier f6d1338cb5 Add the rest of missing Reg_* macros to support both AMD-64 style regs and IA32 regs. 19 years ago
Loren Merritt ea15df8048 use sse16_sse2() in nsse 19 years ago
Loren Merritt a6624e21cb faster h264_chroma_mc8_mmx, added h264_chroma_mc4_mmx. 19 years ago
Loren Merritt b926572aa9 h264 mmx weighted prediction. up to 3% overall speedup. 19 years ago
Loren Merritt 5693c08356 sse2 16x16 sum squared diff (306=>268 cycles on a K8) 19 years ago
Michael Niedermayer 12e9668119 replace a few mov + psrlq with pshufw, there are more cases which could benefit from this but they would require us to duplicate some functions ... 19 years ago