FFmpeg

Commit Graph

Author	SHA1	Message	Date
Ben Chang	8de3458a07	avcodec/nvenc: surface allocation reduction This patch aims to reduce the number of input/output surfaces NVENC allocates per session. Previous default sets allocated surfaces to 32 (unless there is user specified param or lookahead involved). Having large number of surfaces consumes extra video memory (esp for higher resolution encoding). The patch changes the surfaces calculation for default, B-frames, lookahead scenario respectively. The other change involves surface selection. Previously, if a session allocates x surfaces, only x-1 surfaces are used (due to combination of output delay and lock toggle logic). To prevent unused surfaces, changing surface rotation to using predefined fifo. Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Timo Rothenpieler	d84c2298e2	avcodec/nvenc: apply quantization factors to cqp	8 years ago
Timo Rothenpieler	7fb2a7afa1	avcodec/nvenc: Deprecate usage of global_quality, introducing qp	8 years ago
Konda Raju	3df77b58e3	nvenc: Allow different const qps for I, P and B frames Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Clément Bœsch	b7cc4eb303	lavc/nvenc: misc cosmetics to reduce diff with Libav	8 years ago
Konda Raju	2db5ab73d4	avcodec/nvenc: allow different const-qps for I, P and B frames Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Konda Raju	f6790b5e10	add initial QP value options Signed-off-by: Diego Biurrun <diego@biurrun.de>	8 years ago
Ganapathy Kasi	3303f86467	nvenc: Remove qmin and qmax constraints for nvenc vbr qmin and qmax are not necessary for nvenc vbr. Also fix for using 2 pass vbr mode for slow preset through ctx->flag NVENC_TWO_PASSES. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Konda Raju	5f44a4a0a9	avcodec/nvenc: add initial QP value options Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Ganapathy Raman Kasi	a549243b89	avcodec/nvenc: remove qmin and qmax constraints for vbr qmin and qmax are not necessary for nvenc vbr. Enforcing this constraint, doesn't allow user to use vbr 2 pass mode without explicity setting the qmin and qmax options Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Ben Chang	d8f36a6aa3	nvenc: Fix the preset mapping list The map is a sparse array and does not need a empty element to terminate it. The empty element is stored after the last one inserted in the list, overwriting whichever element was next with zeros. Bug-Id: 1029 Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Anton Khirnov	984736dd9e	lavc: make sure not to return EAGAIN from codecs This error is treated specially by the API. CC: libav-stable@libav.org	8 years ago
Diego Biurrun	00b160af11	nvenc: Fix nvec vs. nvenc typo	8 years ago
Timo Rothenpieler	be74ba648c	avcodec/nvenc: push cuda context before encoding a frame Thanks to Miroslav Slugeň for figuring out what was going on here.	8 years ago
Timo Rothenpieler	8a3fea14ae	avcodec/nvenc: set frame buffer format for mapped frames	8 years ago
Timo Rothenpieler	a52976c0fe	nvenc: make gpu indices independent of supported capabilities Do not allocate a CUDA context for every available gpu. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Timo Rothenpieler	6b0a3ee6f8	avcodec/nvenc: add logging for more error cases	8 years ago
Timo Rothenpieler	5403d90f32	avcodec/nvenc: make gpu indices independend of supported capabilities	8 years ago
Luca Barbato	fb59f87ce7	nvenc: Explicitly push the cuda context on encoding Make sure that NVENC does not misbehave if other cuda usages happen in the application.	8 years ago
Miroslav Slugen	9b425bd24c	avcodec/nvenc: Add bluray_compat basic implementation Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Miroslav Slugen	1841eda679	avcodec/nvenc: Make AUD optional for h264_nvenc and hevc_nvenc Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Miroslav Slugeň	f8c503d927	avcodec/nvenc: round qpIntra and qpInter calculation Round qpIntra and qpInter calculation instead of old floor behavior. Adopted from vaapi_encode_h264.c Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Ruta Gadkari	67db4ff3b6	NVENC: Update check for Lookahead Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Ruta Gadkari	5b26d3b789	nvenc: Update check for lookahead By default it is -1. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Timo Rothenpieler	c2f3af57a5	avcodec/nvenc: mark intentional fall through	8 years ago
Miroslav Slugeň	f2dd6aee80	avcodec/nvenc: always reduce DAR width and height Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Philip Langdale	27038693bb	avcodec/nvenc: Delay identification of underlying format of cuda frames When input surfaces are cuda frames, we will not know what the actual underlying format (nv12, p010, etc) is at surface allocation time. On the other hand, we will know when the input frames are actually registered and associated with a surface. So, let's delay format discovery until registration time, which is actually how we handle other frame properties, such as dimensions. By itself, this change doesn't allow for transcoding of 10bit content from cuvid, but it reduces the problem to the hardcoding of the sw format in ffmpeg_cuvid.c Signed-off-by: Philip Langdale <philipl@overt.org> Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Philip Langdale	829db8effd	avcodec/nvenc: Remove aspect-ratio decompensation logic This dubious behaviour in nvenc was finally removed by nvidia, and as we refuse to run on anything older than 7.0, we don't need to keep it around for old versions.	8 years ago
Miroslav Slugeň	de2faec2fa	avcodec/nvenc: better surface allocation alghoritm, fix rc_lookahead User selectable surfaces are not working correctly, if you set number of surfaces on cmdline, it will always use minimum 32 or 48 depends on selected resolution, but in nvenc it is not necessary to use so many surfaces. So from now you can define as low as 1 surface and nvenc will still work, it will ofcourse lower GPU memory usage by 95% and async_delay to zero That was the easy part, now littlebit more... Next part of this patch is to always prefer rc_lookahead to be more important for number of surfaces, than user defined surfaces value. Maximum rc_lookahead from nvidia documentation is 32, but could increase in future generations so there is no limit for this yet. Value async_depth is still accepted and prefered over rc_lookahead. There were also bug when you request more than rc_lookahead > 31, it will always set maximum 31, because surface numbers recalculation was after setting lookahead, which is now fixed. Results: If you set -rc_lookahead 32 and -bf 3 it will now use only 40 surfaces and lower GPU memory usage by 20%, also it will now increase PSNR by 0.012dB Two more comments: 1. from my internal test, i don't understand addition of 4 more surfaces when lookahead is calculated, i didn't used this and everything works as with those 4 more extra surfaces, does anybody know what is going on there? I looks like it was used for B frames which are calculated separately, because B frames maximum is 4. 2. rc_lookahead is defined default to -1, but in test condition if (ctx->rc_lookahead) which sets lookahead it will be always true, i don't know if this is intended behavior, so in default behavior is lookahead always on! This is default condition when rc_lokkahead is -1 (not defined on cmdline), whis is maybe something that is not intended: ctx->encode_config.rcParams.enableLookahead = 1; ctx->encode_config.rcParams.lookaheadDepth = 0; ctx->encode_config.rcParams.disableIadapt = 0; ctx->encode_config.rcParams.disableBadapt = 0; Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Timo Rothenpieler	a66835bcb1	avcodec/nvenc: use dynamically loaded CUDA	8 years ago
Matt Oliver	6ead033bca	avcodec/nvenc.c: Use new safe dlopen code. Signed-off-by: Matt Oliver <protogonoi@gmail.com>	8 years ago
Sven C. Dack	da4d0fa86b	avcodec/nvenc: add test for Temporal AQ support Adds a check to see if the hardware supports temporal aq. Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Timo Rothenpieler	30c5587503	avcodec/nvenc: add support for forcing intra/idr frames	8 years ago
Yogender Gupta	cbd84b8a51	nvenc: Fix error log Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Yogender Gupta	da2848375a	nvenc: Force high_444 profile for 444 input Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Yogender Gupta	facc19ef06	avcodec/nvenc: Extended rate-control support as provided by SDK 7 Merged from libav commit by Yogender Gupta: https://git.libav.org/?p=libav.git;a=commitdiff;h=70de2ea4261f860457a04e3d0c58c5543f403325	8 years ago
Timo Rothenpieler	033f98c902	avcodec/nvenc: add HEVC REXT profile	8 years ago
Timo Rothenpieler	a81b000a39	avcodec/nvenc: Make sure that enum and array index match Based on libav commits by Luca Barbato and Yogender Gupta: https://git.libav.org/?p=libav.git;a=commit;h=352741b5ead1543d775ccf6040f33023e4491186 https://git.libav.org/?p=libav.git;a=commit;h=e02e2515b24bfc37ede6ca1744696230be55e50b	8 years ago
James Almer	dc48248ea8	avcodec/nvenc: use AVERROR_BUFFER_TOO_SMALL instead of ENOBUFS Should fix compilation with mingw32 Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: James Almer <jamrial@gmail.com>	8 years ago
Yogender Gupta	70de2ea426	nvenc: Extended rate-control support as provided by SDK 7 Signed-off-by: Luca Barbato <lu_zero@gentoo.org> Signed-off-by: Diego Biurrun <diego@biurrun.de>	8 years ago
Yogender Gupta	358c887a9f	nvenc: Add support for high bitdepth Signed-off-by: Luca Barbato <lu_zero@gentoo.org> Signed-off-by: Diego Biurrun <diego@biurrun.de>	8 years ago
Yogender Gupta	e02e2515b2	nvenc: Add some easier to understand presets that match x264 terminology Signed-off-by: Luca Barbato <lu_zero@gentoo.org> Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Luca Barbato	352741b5ea	nvenc: Make sure that enum and array index match And use a macro to reduce the boilerplate. Signed-off-by: Luca Barbato <lu_zero@gentoo.org> Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	8 years ago
Timo Rothenpieler	8ebe1dddfb	avcodec/nvenc: use frame size instead of surface size	8 years ago
Sven C. Dack	4aeb7a88ec	avcodec/nvenc: support RGB input nvenc still encodes as yuv, but does the conversion internally which brings some performance gains. Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	8 years ago
Timo Rothenpieler	fa3ecad071	avcodec/nvenc: correctly set inputPitch	8 years ago
Timo Rothenpieler	96cba1c552	avcodec/nvenc: use av_image_copy for copying frame data	8 years ago
Timo Rothenpieler	cac2df230e	avcodec/nvenc: update license header	8 years ago
Timo Rothenpieler	26a5cbd781	avcodec/nvenc: use proper soname for cuda/nvenc libraries	8 years ago
Timo Rothenpieler	df615efcf2	avcodec/nvenc: check maximum driver API version	8 years ago

1 2 3 4 5

232 Commits (3aeeee1597abd6c12308fbc2f4087d7c943166df)