You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
jacopo-chevallard
changed the title
Implement a first draft of retrieval metrics
Implement first version of retrieval metrics
Jan 22, 2025
jacopo-chevallard
changed the title
Implement first version of retrieval metrics
Implement first version of retrieval/generation metrics
Jan 23, 2025
jacopo-chevallard
changed the title
Implement first version of retrieval/generation metrics
Implement first version of retrieval/generation evaluation
Jan 28, 2025
For a list of potential datasets for retrieval see CORE-325 and Notion
Evaluation steps for CI/CD
Select a subset (1 to 20) .Each subset contains 135 Q&A, with 5 html documents for each question, so a total of 675 documents (in html format)
Retrieve reference dataset --> CORE-357
For each row, parse, chunk and embed the documents contained in
search_results.page_result
--> CORE-348For each row, extract the question from the
query
field and run the chosen RAG workflow to obtain an answer --> CORE-345Compute evaluation metrics comparing the ground truth answers and the answers produced in 4 --> CORE-340
Push the results to exp. tracker --> CORE-351
Define thresholds for alerting --> CORE-350
The text was updated successfully, but these errors were encountered: