Batched inferencing using HuggingfaceLocalGenerator #8770

srsingh24 · 2025-01-24T12:31:36Z

Is there a way to perform batched inferencing using HuggingfaceLocalGenerator? I could not find any information about this on the docs.

The text was updated successfully, but these errors were encountered:

anakin87 · 2025-01-24T14:01:30Z

Hello!

While it should be possible to configure the batch_size of the Hugging Face pipeline/model under the hood,
this component only accepts a prompt (single str) as input, therefore this practically does not allow batching.

Could you tell me more about your use case?

srsingh24 · 2025-01-24T14:04:39Z

@anakin87 Since the HuggingFaceLocalGenerator only accepts a single str input, I am having to sequentially loop through each prompt. This makes running my code very slow. I would like to perform batched inference so I can make better use of my GPUs and run the inference faster

anakin87 · 2025-01-24T14:07:38Z

Clear... I wanted to better understand what you are building...
Are you running evaluation? Performing RAG? ...

srsingh24 · 2025-01-24T14:09:37Z

I am using the HuggingfaceLocalGenerator for all sorts of use cases from basic inferencing on a list of prompts to evaluation to RAG

anakin87 added the type:feature New feature or request label Jan 24, 2025

julian-risch added the P3 Low priority, leave it in the backlog label Jan 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batched inferencing using HuggingfaceLocalGenerator #8770

Batched inferencing using HuggingfaceLocalGenerator #8770

srsingh24 commented Jan 24, 2025

anakin87 commented Jan 24, 2025

srsingh24 commented Jan 24, 2025 •

edited

Loading

anakin87 commented Jan 24, 2025

srsingh24 commented Jan 24, 2025

Batched inferencing using HuggingfaceLocalGenerator #8770

Batched inferencing using HuggingfaceLocalGenerator #8770

Comments

srsingh24 commented Jan 24, 2025

anakin87 commented Jan 24, 2025

srsingh24 commented Jan 24, 2025 • edited Loading

anakin87 commented Jan 24, 2025

srsingh24 commented Jan 24, 2025

srsingh24 commented Jan 24, 2025 •

edited

Loading