[https://nvbugs/6329052][fix] Add attn_backend: FLASHINFER and model_kwargs: {num_hidden_layers: 4} to…#15464
Open
tensorrt-cicd wants to merge 2 commits into
Open
[https://nvbugs/6329052][fix] Add attn_backend: FLASHINFER and model_kwargs: {num_hidden_layers: 4} to…#15464tensorrt-cicd wants to merge 2 commits into
attn_backend: FLASHINFER and model_kwargs: {num_hidden_layers: 4} to…#15464tensorrt-cicd wants to merge 2 commits into