vulkan_decode: halve execution pool size

Determined experimentally, on various videos and hardware.
On Intel, using less resources in-flight is around 15% faster,
with similar results on Nvidia hardware.
pull/389/head
Lynne 1 year ago
parent 5f1be341c2
commit 24c4307b80
No known key found for this signature in database
GPG Key ID: A2FEA5F03F034464
  1. 5
      libavcodec/vulkan_decode.c

@ -1105,8 +1105,9 @@ int ff_vk_decode_init(AVCodecContext *avctx)
session_create.pVideoProfile = &prof->profile_list.pProfiles[0];
/* Create decode exec context.
* 4 async contexts per thread seems like a good number. */
err = ff_vk_exec_pool_init(s, &qf_dec, &ctx->exec_pool, 4*avctx->thread_count,
* 2 async contexts per thread was experimentally determined to be optimal
* for a majority of streams. */
err = ff_vk_exec_pool_init(s, &qf_dec, &ctx->exec_pool, 2*avctx->thread_count,
nb_q, VK_QUERY_TYPE_RESULT_STATUS_ONLY_KHR, 0,
session_create.pVideoProfile);
if (err < 0)

Loading…
Cancel
Save