Skip to content

Conversation

@c-ehrlich
Copy link
Contributor

No description provided.

}
);

Eval('text-match-eval', {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we encourage users to add capability name to metadata?

Create an eval file `src/evals/ticket-classification.eval.ts`:

```typescript
import { experimental_Eval as Eval, Scorer } from 'axiom/ai/evals';
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
import { experimental_Eval as Eval, Scorer } from 'axiom/ai/evals';
import { Eval, Scorer } from 'axiom/ai/evals';

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@c-ehrlich we need to write something about cli auth: axiom login, maybe @manototh can help us structure this in a good way. There would be 2 ways to authenticate the SDK:

  • Either using CLI, and then you only need to specify a dataset in axiom.config.ts
  • Specifying an API token and dataset in axiom.config.ts

Copy link
Collaborator

@manototh manototh Nov 13, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@thesollyz Thanks for this. I would specify API token only because that's what we use in the rest of the AI eng docs anyway.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@manototh not sure what do you mean? CLI auth is a new feature that we want the users to use.

@manototh manototh changed the title draft: initial eval docs Update Measure in AI eng Nov 13, 2025
title: "Measure"
description: "Learn how to measure the quality of your AI capabilities by running evaluations against ground truth data."
keywords: ["ai engineering", "AI engineering", "measure", "evals", "evaluation", "scoring", "graders"]
keywords: ["ai engineering", "AI engineering", "measure", "evals", "evaluation", "scoring", "scorers"]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it would be also good to have graders as a keyword here for users who would search for graders instead of scorers. Maybe we can add "scores" as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants