Skip to content

👋 Welcome to Athina AI

Athina is building monitoring and evaluation tools for LLM developers.

Sign Up | Website | Contact

  • Evals SDK: Open-source framework for evaluating LLMs (Python + CLI)
  • Platform: Monitor your production inferences, and automatically run evals

hero

Open-Source SDK for Evals

athina-ai/athina-evals

Documentation | Quick Start | Running Evals

We have a library of preset evaluators, but you can also write custom evaluators within the Athina framework.

Example Preset Evals:

  • Context Contains Enough Information: Detect bad or insufficient retrievals.
  • Does Response Answer Query: Detect incomplete or irrelevant responses.
  • Response Faithfulness: Detect when responses are deviating from the provided context.
  • Summarization Accuracy: Detect hallucinations and mistakes in summaries
  • Grading Criteria: If X, then fail. Otherwise pass.
  • Custom Evals: Custom prompt for LLM-powered evaluation.
  • RAGAS: A set of evaluators that return RAGAS metrics.

Results can also be viewed and tracked on our platform. develop-view

Monitoring & Evaluations Platform for LLM Inferences

Documentation | Demo Video | Sign Up

  • UI for monitoring and visibility into your LLM inferences.
  • Run evals automatically against logged inferences in production.
  • Track cost, token usage, response times, feedback, pass rate and other eval metrics.
  • Analytics segmented by Customer ID, Model, Prompt, Environment, and More.
  • Topic Classification
  • Data Exports
  • ... and more

Contact [email protected] if you have any questions.

Pinned Loading

  1. athina-evals athina-evals Public

    Python SDK for running evaluations on LLM generated responses

    Python 156 11

Repositories

Showing 10 of 14 repositories
  • athina-evals Public

    Python SDK for running evaluations on LLM generated responses

    athina-ai/athina-evals’s past year of commit activity
    Python 156 11 0 3 Updated Jul 3, 2024
  • athina-client Public

    A light weight version of athina SDK

    athina-ai/athina-client’s past year of commit activity
    Python 0 0 0 1 Updated Jul 2, 2024
  • athina-ai/athina-deploy’s past year of commit activity
    Shell 1 0 0 0 Updated Jul 2, 2024
  • ai-research-papers Public

    Summaries of AI Research Papers

    athina-ai/ai-research-papers’s past year of commit activity
    7 0 0 0 Updated Jun 29, 2024
  • athina-logger Public

    SDK to log LLM inference calls to Athina

    athina-ai/athina-logger’s past year of commit activity
    Python 1 1 0 0 Updated Jun 21, 2024
  • athina-ai/athina-docs’s past year of commit activity
    MDX 1 MIT 0 0 2 Updated Mar 29, 2024
  • athina-ai/athina-evals-ci’s past year of commit activity
    Python 2 0 0 0 Updated Feb 23, 2024
  • ragas Public Forked from explodinggradients/ragas

    Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

    athina-ai/ragas’s past year of commit activity
    Python 0 Apache-2.0 526 0 0 Updated Feb 5, 2024
  • athina-sdk Public

    LLM Testing SDK that helps you write and run tests to monitor your LLM app in production

    athina-ai/athina-sdk’s past year of commit activity
    Python 129 1 1 1 Updated Jan 22, 2024
  • ariadne Public

    LLM Evals for Text Summarization and RAG use-cases.

    athina-ai/ariadne’s past year of commit activity
    Python 34 Apache-2.0 0 0 0 Updated Jan 22, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…