Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement first version of retrieval/generation evaluation #3553

Open
jacopo-chevallard opened this issue Jan 22, 2025 — with Linear · 1 comment
Open

Implement first version of retrieval/generation evaluation #3553

jacopo-chevallard opened this issue Jan 22, 2025 — with Linear · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

Copy link
Collaborator

jacopo-chevallard commented Jan 22, 2025

For a list of potential datasets for retrieval see CORE-325 and Notion

Evaluation steps for CI/CD

  1. Select a subset (1 to 20) .Each subset contains 135 Q&A, with 5 html documents for each question, so a total of 675 documents (in html format)

  2. Retrieve reference dataset --> CORE-357

  3. For each row, parse, chunk and embed the documents contained in search_results.page_result --> CORE-348

  4. For each row, extract the question from the query field and run the chosen RAG workflow to obtain an answer --> CORE-345

  5. Compute evaluation metrics comparing the ground truth answers and the answers produced in 4 --> CORE-340

  6. Push the results to exp. tracker --> CORE-351

  7. Define thresholds for alerting --> CORE-350

Copy link

linear bot commented Jan 22, 2025

@dosubot dosubot bot added the enhancement New feature or request label Jan 22, 2025
@jacopo-chevallard jacopo-chevallard changed the title Implement a first draft of retrieval metrics Implement first version of retrieval metrics Jan 22, 2025
@jacopo-chevallard jacopo-chevallard changed the title Implement first version of retrieval metrics Implement first version of retrieval/generation metrics Jan 23, 2025
@jacopo-chevallard jacopo-chevallard changed the title Implement first version of retrieval/generation metrics Implement first version of retrieval/generation evaluation Jan 28, 2025
@jacopo-chevallard jacopo-chevallard self-assigned this Jan 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant