How do I propagate intermediate results in an integration model? #6537

callmezhangchenchenokay · 2023-11-08T13:26:07Z

How do I propagate intermediate results in an integration model?

I want to get the output of the first model

I tried to use the output of the first model directly as the input in the third model, but it didn't work

oandreeva-nv · 2023-12-15T20:01:23Z

Did the solution provided in triton-inference-server/tensorrtllm_backend#71 worked for your case?

callmezhangchenchenokay · 2023-12-18T04:28:43Z

Thanks for your reply!
At the bottom of this solution is my comment that the problem has been solved

callmezhangchenchenokay · 2023-12-29T08:30:47Z

Sorry to interrupt again!

The solution mentioned above requires model_transaction_policy to be set to True,
Can only be used if stream = False,

However, this problem occurs when stream = True

So there needs to be a way to export REQUEST_INPUT_LEN from stream =True, model_transaction_policy =True

callmezhangchenchenokay mentioned this issue Nov 8, 2023

ensemble_scheduling the out value of a step in the middle #6527

Closed

callmezhangchenchenokay closed this as completed Dec 18, 2023

callmezhangchenchenokay reopened this Dec 29, 2023

callmezhangchenchenokay closed this as completed Dec 29, 2023

Provide feedback