[MLOB-5067] Simplify access to Goal Completeness #33646

tillwf · 2026-01-05T08:39:58Z

Closes [MLOB-5067](https://datadoghq.atlassian.net/browse/MLOB-5067)

github-actions · 2026-01-05T08:45:12Z

Preview links (active after the `build_preview` check completes)

Modified Files

estherk15 · 2026-01-05T20:22:08Z

content/en/llm_observability/evaluations/managed_evaluations/_index.md

+| Evaluated on LLM spans | Evaluated using LLM | Checks whether the agent resolved the user’s intent by analyzing full session spans. Runs only on sessions marked as completed. |
+
+##### How to Use
+<div class="alert alert-info">Goal completeness is only available for OpenAI and Azure OpenAI.</div>


Suggested change

<div class="alert alert-info">Goal completeness is only available for OpenAI and Azure OpenAI.</div>

<div class="alert alert-info">Goal Completeness is only available for OpenAI and Azure OpenAI.</div>

estherk15 · 2026-01-05T20:23:07Z

content/en/llm_observability/evaluations/managed_evaluations/_index.md

+
+The evaluation requires sending a span with a specific tag when the session ends. This signal allows the evaluation to identify session boundaries and trigger the completeness assessment:
+
+For optimal evaluation accuracy and cost control, it is preferable to send a tag when the session is finished and configure the evaluation to run only on session with this tag. The evaluation returns a detailed breakdown including resolved intentions, unresolved intentions, and reasoning for the assessment. A session is considered incomplete if more than 50% of identified intentions remain unresolved.


Suggested change

For optimal evaluation accuracy and cost control, it is preferable to send a tag when the session is finished and configure the evaluation to run only on session with this tag. The evaluation returns a detailed breakdown including resolved intentions, unresolved intentions, and reasoning for the assessment. A session is considered incomplete if more than 50% of identified intentions remain unresolved.

For optimal evaluation accuracy and cost control, it is preferable to send a tag when the session is finished and configure the evaluation to run only on sessions with this tag. The evaluation returns a detailed breakdown including resolved intentions, unresolved intentions, and reasoning for the assessment. A session is considered incomplete if more than 50% of identified intentions remain unresolved.

estherk15 · 2026-01-05T20:23:53Z

content/en/llm_observability/evaluations/managed_evaluations/_index.md

+
+1. Go to the **Goal Completeness** settings
+2. Configure the evaluation data:
+   - Select **spans** as the data type since Goal Completeness runs on LLM spans which contains the full session history.


Suggested change

- Select **spans** as the data type since Goal Completeness runs on LLM spans which contains the full session history.

- Select **spans** as the data type since Goal Completeness runs on LLM spans which contain the full session history.

tillwf requested a review from a team as a code owner January 5, 2026 08:39

[MLOB-5067] Simplify access to Goal Completeness

aa2a7eb

Closes [MLOB-5067](https://datadoghq.atlassian.net/browse/MLOB-5067)

tillwf force-pushed the till.wohlfarth/MLOB-5067/Simplify_gc_access branch from 44971e0 to aa2a7eb Compare January 5, 2026 08:41

estherk15 approved these changes Jan 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MLOB-5067] Simplify access to Goal Completeness #33646

[MLOB-5067] Simplify access to Goal Completeness #33646

tillwf commented Jan 5, 2026 •

edited by atlassian bot

Loading

Uh oh!

github-actions bot commented Jan 5, 2026

Uh oh!

estherk15 Jan 5, 2026

Uh oh!

estherk15 Jan 5, 2026

Uh oh!

estherk15 Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	<div class="alert alert-info">Goal completeness is only available for OpenAI and Azure OpenAI.</div>
	<div class="alert alert-info">Goal Completeness is only available for OpenAI and Azure OpenAI.</div>


		The evaluation requires sending a span with a specific tag when the session ends. This signal allows the evaluation to identify session boundaries and trigger the completeness assessment:

		For optimal evaluation accuracy and cost control, it is preferable to send a tag when the session is finished and configure the evaluation to run only on session with this tag. The evaluation returns a detailed breakdown including resolved intentions, unresolved intentions, and reasoning for the assessment. A session is considered incomplete if more than 50% of identified intentions remain unresolved.

	- Select spans as the data type since Goal Completeness runs on LLM spans which contains the full session history.
	- Select spans as the data type since Goal Completeness runs on LLM spans which contain the full session history.

[MLOB-5067] Simplify access to Goal Completeness #33646

Are you sure you want to change the base?

[MLOB-5067] Simplify access to Goal Completeness #33646

Conversation

tillwf commented Jan 5, 2026 • edited by atlassian bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 5, 2026

Preview links (active after the build_preview check completes)

Modified Files

Uh oh!

estherk15 Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

estherk15 Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

estherk15 Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tillwf commented Jan 5, 2026 •

edited by atlassian bot

Loading

Preview links (active after the `build_preview` check completes)