Update Measure in AI eng #465

c-ehrlich · 2025-11-11T08:53:46Z

No description provided.

thesollyz · 2025-11-11T11:04:09Z

ai-engineering/measure.mdx

+  }
+);

 Eval('text-match-eval', {


Should we encourage users to add capability name to metadata?

thesollyz · 2025-11-11T11:22:36Z

ai-engineering/measure.mdx

+Create an eval file `src/evals/ticket-classification.eval.ts`:
+
+```typescript
+import { experimental_Eval as Eval, Scorer } from 'axiom/ai/evals';


Suggested change

import { experimental_Eval as Eval, Scorer } from 'axiom/ai/evals';

import { Eval, Scorer } from 'axiom/ai/evals';

thesollyz · 2025-11-13T10:47:01Z

ai-engineering/measure.mdx

@c-ehrlich we need to write something about cli auth: axiom login, maybe @manototh can help us structure this in a good way. There would be 2 ways to authenticate the SDK:

Either using CLI, and then you only need to specify a dataset in axiom.config.ts

Specifying an API token and dataset in axiom.config.ts

@thesollyz Thanks for this. I would specify API token only because that's what we use in the rest of the AI eng docs anyway.

@manototh not sure what do you mean? CLI auth is a new feature that we want the users to use.

thesollyz · 2025-11-14T10:54:03Z

ai-engineering/measure.mdx

 title: "Measure"
 description: "Learn how to measure the quality of your AI capabilities by running evaluations against ground truth data."
-keywords: ["ai engineering", "AI engineering", "measure", "evals", "evaluation", "scoring", "graders"]
+keywords: ["ai engineering", "AI engineering", "measure", "evals", "evaluation", "scoring", "scorers"]


I think it would be also good to have graders as a keyword here for users who would search for graders instead of scorers. Maybe we can add "scores" as well.

c-ehrlich added 2 commits November 11, 2025 15:50

initial eval docs

2ae1a63

add note about instrumentation fn

a082b90

mintlify bot deployed to staging November 11, 2025 08:54 View deployment

thesollyz reviewed Nov 11, 2025

View reviewed changes

ai-engineering/measure.mdx

}

);

Eval('text-match-eval', {

Copy link

Contributor

thesollyz Nov 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we encourage users to add capability name to metadata?

thesollyz reviewed Nov 11, 2025

View reviewed changes

Stylistic fixes

7df0bdb

mintlify bot deployed to staging November 11, 2025 15:13 View deployment

Quick fixes

0254557

mintlify bot deployed to staging November 13, 2025 09:10 View deployment

thesollyz reviewed Nov 13, 2025

View reviewed changes

manototh changed the title ~~draft: initial eval docs~~ Update Measure in AI eng Nov 13, 2025

Merge branch 'main' into evals-1

686a53e

mintlify bot deployed to staging November 13, 2025 13:06 View deployment

Fixes

7b8bd25

mintlify bot deployed to staging November 13, 2025 13:50 View deployment

thesollyz reviewed Nov 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Measure in AI eng #465

Update Measure in AI eng #465

c-ehrlich commented Nov 11, 2025

Uh oh!

thesollyz Nov 11, 2025

Uh oh!

thesollyz Nov 11, 2025

Uh oh!

thesollyz Nov 13, 2025

Uh oh!

manototh Nov 13, 2025 •

edited

Loading

Uh oh!

thesollyz Nov 14, 2025

Uh oh!

thesollyz Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	import { experimental_Eval as Eval, Scorer } from 'axiom/ai/evals';
	import { Eval, Scorer } from 'axiom/ai/evals';

Update Measure in AI eng #465

Are you sure you want to change the base?

Update Measure in AI eng #465

Conversation

c-ehrlich commented Nov 11, 2025

Uh oh!

thesollyz Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

thesollyz Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

thesollyz Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

manototh Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thesollyz Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

thesollyz Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

manototh Nov 13, 2025 •

edited

Loading