Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retrieval + generation eval: parse, chunk and embed dataset #3571

Open
jacopo-chevallard opened this issue Jan 28, 2025 — with Linear · 1 comment
Open

Retrieval + generation eval: parse, chunk and embed dataset #3571

jacopo-chevallard opened this issue Jan 28, 2025 — with Linear · 1 comment
Assignees

Comments

Copy link
Collaborator

jacopo-chevallard commented Jan 28, 2025

Given an ingestion workflow/configuration, we should parse, chunk and embed the reference dataset and store the results into a database for successive retrieval.

Open questions:

  • create a temporary (supabase) instance and use it for evaluation purposes, so when the evaluation is finished, the instance can be deleted? Or shall we persist the results for successive inspection, for instance to understand why a certain evaluation run provided bad results?
@jacopo-chevallard jacopo-chevallard self-assigned this Jan 28, 2025
Copy link

linear bot commented Jan 28, 2025

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant