Loren Merritt
1b87c40245
slightly faster ff_imdct_calc_3dn2() on amd64. (gcc added a bunch of useless movsxd)
...
Originally committed as revision 5962 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Michael Niedermayer
21bb884fb7
change vorbis_inverse_coupling_sse2() so it works on sse1 cpus
...
Originally committed as revision 5957 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
bcfa3e58ee
3dnow2 implementation of imdct.
...
6% faster vorbis and wma.
Originally committed as revision 5954 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
cd035a6051
10l, vorbis_inverse_coupling_sse() was really 3dnow
...
Originally committed as revision 5903 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
2dac4acfc0
sse & sse2 implementations of vorbis channel coupling.
...
9% faster vorbis (on a K8).
Originally committed as revision 5898 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Stefan Gehrer
595e7bd940
some MMX optimizations for the CAVS decoder
...
Originally committed as revision 5846 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Michael Niedermayer
5ced7b80ad
disable the vp3 mmx and sse2 idcts, their output doesnt match the c idct (tested with -f crc) and the theora spec does not allow different idcts not to mention the difference is quite vissible ...
...
Originally committed as revision 5788 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Måns Rullgård
98d417cbcd
#define SBUTTERFLY outside CONFIG_ENCODERS
...
Originally committed as revision 5628 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Luca Abeni
9c39071d6d
Move REG_* macros from libavcodec/i386/mmx.h to libavutil/x86_cpu.h
...
Originally committed as revision 5595 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Måns Rullgård
3f8674a902
remove redundant macro definitions
...
Originally committed as revision 5589 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Måns Rullgård
8fb0d07339
kill warning
...
Originally committed as revision 5588 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Michael Niedermayer
e27b6e62f7
missmatch control for mpeg2 intra dequantization if bitexact=1
...
Originally committed as revision 5328 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Zuxy Meng
392f6da897
Remove unused and unsupported Cyrix's "Extended MMX",
...
Add SSE3 support.
Patch by Zuxy Meng < zuxy POIS meng AH gmail POIS com >
Original thread:
04/26/06 13:13:
[Ffmpeg-devel] [PATCH] Bug fix, SSE3 support in i386/cputest.c and dsputil.h
Originally committed as revision 5326 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Wolfram Gloger
f42635f558
gcc-2.95 compile fix, patch by Wolfram Gloger <wmglo A dent PIS med PIS uni-muenchen PIS de>
...
Originally committed as revision 5298 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
75ca1a5f70
gmc_mmx tweaks
...
Originally committed as revision 5269 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
703c8195a8
mmx implementation of 3-point GMC. (5x faster than C)
...
Originally committed as revision 5265 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Luca Barbato
22b48b85b6
altivec support for snow
...
Originally committed as revision 5228 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
5e8b787afa
simplified and slightly faster h264_chroma_mc8_mmx
...
Originally committed as revision 5214 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
513fbd8e5a
prefetch pixels for future motion compensation. 2-5% faster h264.
...
Originally committed as revision 5203 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
5e6a5c4daf
10l
...
Originally committed as revision 5201 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
fdd3057981
added mmx implementation of h264_chroma_mc2
...
Originally committed as revision 5200 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Robert Edele
e8600e5edc
add MMX and SSE versions of ff_snow_inner_add_yblock
...
Patch by Robert Edele < yartrebo AH earthlink POIS net >
Original Thread:
Date: Mar 22, 2006 3:24 AM
Subject: [Ffmpeg-devel] [PATCH] snow mmx + sse2 part 5
Originally committed as revision 5197 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Robert Edele
2c9a0285d4
snow mmx+sse2 optimizations, part 4
...
Patch by Robert Edele, yartrebo <<at>> earthlink <<dot>> net
Originally committed as revision 5191 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Robert Edele
4567b4bdab
Add the mmx and sse2 implementations of ff_snow_vertical_compose().
...
Patch by Robert Edele < yartrebo AH earthlink POIS net >
Original thread:
Date: Mar 20, 2006 5:54 PM
Subject: [Ffmpeg-devel] [PATCH] snow mmx + sse2 part 3
Originally committed as revision 5185 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Robert Edele
059715a41c
First part of a series of speed-enchancing patches.
...
This one sets up a snow.h and makes snow use the dsputil function pointer
framework to access the three functions that will be implemented in asm
in the other parts of the patchset.
Patch by Robert Edele < yartrebo AH earthlink POIS net>
Original thread:
Subject: [Ffmpeg-devel] [PATCH] Snow mmx+sse2 asm optimizations
Date: Sun, 05 Feb 2006 12:47:14 -0500
Originally committed as revision 5172 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Zuxy Meng
82eb4b0f1b
3DNow! & Extended 3DNow! versions of FFT
...
Patch by Zuxy Meng, zuxy <<dot>> meng >>at<< gmail <<dot>> com
Minor non-functional diff-related fixes by me.
Originally committed as revision 5125 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
548a1c8a35
h264_idct8_add_mmx
...
Originally committed as revision 5123 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
6da971f160
h264_idct_add only needs mmx1
...
Originally committed as revision 5122 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Zuxy Meng
2ffb22d2ad
use xorps instead of mulps to toggle the sign of a float, as suggested by Software Optimization Guide for AMD64 Processors.
...
Patch by Zuxy Meng < zuxy POIS meng AH gmail POIS com > OKed by Michael
Original thread:
Date: Mar 5, 2006 8:15 PM
Subject: [Ffmpeg-devel] [PATCH] Little optimization to fft_sse.c
Originally committed as revision 5112 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
d84f7c61ee
gcc2.95 workaround
...
Originally committed as revision 5111 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
7a5b2fa812
remove some useless instructions
...
Originally committed as revision 5109 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
6a8eb0f45a
4% faster h264_qpel_mc
...
Originally committed as revision 5094 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
ef9d1d1575
h264: special case dc-only idct. ~1% faster overall
...
Originally committed as revision 4971 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
4e295993ba
10l in 1.12
...
Originally committed as revision 4965 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
6ee669732d
10l (x86_64)
...
Originally committed as revision 4952 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
e545f37527
18% faster put_h264_qpel16_mc[13]2_mmx2
...
Originally committed as revision 4951 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
c03ce51dfb
11% faster put_h264_qpel16_v_lowpass_mmx2
...
Originally committed as revision 4950 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
0331f09237
15% faster put_h264_qpel16_hv_lowpass_mmx2
...
Originally committed as revision 4949 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Steve L'Homme
68b51e58ce
MSVC-compatible __align8/__align16 declaration
...
patch by Steve Lhomme, steve .dot. lhomme .at. free .dot. fr
Originally committed as revision 4942 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Diego Biurrun
5509bffa88
Update licensing information: The FSF changed postal address.
...
Originally committed as revision 4842 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
e8b562087d
tweak h264_biweight
...
Originally committed as revision 4835 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
cec9395977
fix some potential arithmetic overflows in pred_direct_motion() and
...
ff_h264_weight_WxH_mmx2().
Originally committed as revision 4795 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Diego Biurrun
bb270c0896
COSMETICS: tabs --> spaces, some prettyprinting
...
Originally committed as revision 4764 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Diego Biurrun
115329f160
COSMETICS: Remove all trailing whitespace.
...
Originally committed as revision 4749 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Guillaume Poirier
f6d1338cb5
Add the rest of missing Reg_* macros to support both AMD-64 style regs and IA32 regs.
...
Not used yet, but should be once the SIMD code to accelerate Snow decoding is merged.
Originally committed as revision 4731 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
ea15df8048
use sse16_sse2() in nsse
...
Originally committed as revision 4688 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
a6624e21cb
faster h264_chroma_mc8_mmx, added h264_chroma_mc4_mmx.
...
2-4% overall speedup.
Originally committed as revision 4666 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
b926572aa9
h264 mmx weighted prediction. up to 3% overall speedup.
...
Originally committed as revision 4630 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Loren Merritt
5693c08356
sse2 16x16 sum squared diff (306=>268 cycles on a K8)
...
faster 8x8 mmx ssd (77=>70 cycles)
Originally committed as revision 4623 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago
Michael Niedermayer
12e9668119
replace a few mov + psrlq with pshufw, there are more cases which could benefit from this but they would require us to duplicate some functions ...
...
the trick is from various places (my own code in libpostproc, a patch on the x264 list, ...)
Originally committed as revision 4608 to svn://svn.ffmpeg.org/ffmpeg/trunk
19 years ago