Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Support batches and files as unified API routes #819

Open
3 tasks
narengogi opened this issue Dec 17, 2024 · 1 comment
Open
3 tasks

[Feature] Support batches and files as unified API routes #819

narengogi opened this issue Dec 17, 2024 · 1 comment
Labels
enhancement New feature or request triage

Comments

@narengogi
Copy link
Collaborator

What Would You Like to See with the Gateway?

Requirements:

  • Support /v1/files routes as a unified API
  • Support /v1/batches routes as a unified API
  • Additionally support a route to get batches output. something like /v1/batches/:batchId/output

This would require some considerable amount of changes in the gateway, so for v1

  1. support the following providers: openai, bedrock, cohere
  2. bedrock would require uploading to s3 as chunks as streaming a file to s3 without knowing the content-length is not allowed, please make provisions for that
  3. File uploads should not increase memory consumption in worker environments. even when transforming the files, only 100kb chunks should be read

Context for your Request

No response

Your Twitter/LinkedIn

No response

@narengogi narengogi added the enhancement New feature or request label Dec 17, 2024
@anterart
Copy link

anterart commented Dec 22, 2024

Hi @narengogi, when do you think this feature will be available?
I'm really looking up to it :)
Also is Google's Vertex AI & Azure OpenAI's batch API will be supported?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request triage
Projects
None yet
Development

No branches or pull requests

2 participants