Drastic reduction in trt plan cache size #946

hyln9 · 2024-06-08T21:50:22Z

Hello!

As NVIDIA has finally released TensorRT 10.0 and made it publicly available on their website, I did some research on the now improved engine refitting API.

The result is very promising and the size of the plan cache is reduced by ~30x on my laptop. Support for the newer CUDA 12.x has been added as well.

inisis · 2024-06-13T08:38:03Z

Hi, I'm a little bit curious why the plan cache became 30x smaller, I refer to the doc, it seems that refitter is used to change engine weight dynamically. Thanks.

hyln9 · 2024-06-13T12:26:11Z

Hi, I'm a little bit curious why the plan cache became 30x smaller, I refer to the doc, it seems that refitter is used to change engine weight dynamically. Thanks.

The kSTRIP_PLAN flag enables weight-stripping and works well with refitting at runtime.

ActiveIce · 2024-06-18T06:00:56Z

Thanks for your work. I ran into a problem when compile it with TensorRT 10.1.0 . The CMakeLists.txt cannot read version number in NvInferVersion.h since it changed the encoding to utf16-le. Should I mod the CMakeLists.txt or do anything else?

hyln9 · 2024-06-19T11:38:08Z

Thanks for your work. I ran into a problem when compile it with TensorRT 10.1.0 . The CMakeLists.txt cannot read version number in NvInferVersion.h since it changed the encoding to utf16-le. Should I mod the CMakeLists.txt or do anything else?

It should be fixed now.

hyln9 added 2 commits June 9, 2024 04:32

Require TensorRT 10.0 or greater

ffcabe5

Minimize trt plan cache size

79563bd

Fix trt version detection

2fd4a8b

Fix trt deprecation warning

5a1ef44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drastic reduction in trt plan cache size #946

Drastic reduction in trt plan cache size #946

hyln9 commented Jun 8, 2024

inisis commented Jun 13, 2024

hyln9 commented Jun 13, 2024

ActiveIce commented Jun 18, 2024

hyln9 commented Jun 19, 2024

Drastic reduction in trt plan cache size #946

Are you sure you want to change the base?

Drastic reduction in trt plan cache size #946

Conversation

hyln9 commented Jun 8, 2024

inisis commented Jun 13, 2024

hyln9 commented Jun 13, 2024

ActiveIce commented Jun 18, 2024

hyln9 commented Jun 19, 2024