Skip to content

InferenceClient: provide generator-based bulk inference #68

Open
@mhaas

Description

@mhaas

The current implementation will try to perform inference for all chunks. If an exception occurs, the progress is lost.

By yielding the individual chunks, the caller can be in charge of error handling.

We can either change the existing API or introduce a separate method (and implement do_bulk_inference using the new method).

See also #62

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions