[ML] Move to the Cohere V2 API for new inference endpoints #129884

davidkyle · 2025-06-23T21:45:23Z

The Cohere V2 API contains 2 changes that must be adapted for

The model parameter is no longer optional
For embeddings the input_type parameter is no longer optional

Creating an endpoint without a model now causes a validation exception. input_type is declared either in task_settings or in the inference call, if not set in either or these places input_type defaults to search_query.

New inference endpoints will use the V2 API, existing endpoints will continue to use the V1 API. The user does not have the option of picking the V1 API in new endpoints. One possibly controversial aspect is that the API version is not surfaced to the user, the version is persisted with the model config but not included in the GET _inference response. I implemented this behaviour because the user does not have the ability to pick the API, in retrospect hiding the version now seems confusing.

The request classes have been moved to org.elasticsearch.xpack.inference.services.cohere.request.v1 and renamed. The new V2 request classes are in org.elasticsearch.xpack.inference.services.cohere.request.v2 (they are very similar).

The upgrade test CohereServiceUpgradeIT tests that the old v1 endpoints still work after upgrading.

elasticsearchmachine · 2025-06-23T21:45:48Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2025-06-23T21:45:48Z

Hi @davidkyle, I've created a changelog YAML for you.

...inference/src/main/java/org/elasticsearch/xpack/inference/services/cohere/CohereService.java

...e/src/main/java/org/elasticsearch/xpack/inference/services/cohere/CohereServiceSettings.java

...e/src/main/java/org/elasticsearch/xpack/inference/services/cohere/request/CohereRequest.java

.../org/elasticsearch/xpack/inference/services/cohere/request/v1/CohereV1CompletionRequest.java

.../org/elasticsearch/xpack/inference/services/cohere/request/v2/CohereV2CompletionRequest.java

# Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java

...e/src/main/java/org/elasticsearch/xpack/inference/services/cohere/CohereServiceSettings.java

…inference/services/cohere/CohereServiceSettings.java Co-authored-by: Pat Whelan <[email protected]>

davidkyle · 2025-06-24T17:00:03Z

Test this please

elasticsearchmachine · 2025-06-24T21:52:29Z

💔 Backport failed

Status	Branch	Result
❌	8.19	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 129884

davidkyle added 6 commits June 23, 2025 15:31

Basic v2 classes

936150c

Add v2 classes

14cc105

fix the tests

d5c80bb

start upgrade test

fcbbfa0

Upgrade test

d37dd1f

Fix the tests

92a373b

davidkyle added >enhancement :ml Machine learning auto-backport Automatically create backport pull requests when merged v8.19.0 v9.1.0 labels Jun 23, 2025

elasticsearchmachine added the Team:ML Meta label for the ML team label Jun 23, 2025

Update docs/changelog/129884.yaml

8860a61

[CI] Auto commit changes from spotless

fe3fed6

davidkyle commented Jun 23, 2025

View reviewed changes

Merge branch 'main' into v2-cohere

d982af5

# Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java

prwhelan approved these changes Jun 24, 2025

View reviewed changes

...e/src/main/java/org/elasticsearch/xpack/inference/services/cohere/CohereServiceSettings.java Show resolved Hide resolved

...e/src/main/java/org/elasticsearch/xpack/inference/services/cohere/CohereServiceSettings.java Outdated Show resolved Hide resolved

Update x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/…

7ef5119

…inference/services/cohere/CohereServiceSettings.java Co-authored-by: Pat Whelan <[email protected]>

davidkyle enabled auto-merge (squash) June 24, 2025 15:36

[CI] Auto commit changes from spotless

2b2eac5

davidkyle and others added 5 commits June 24, 2025 19:38

Merge branch 'main' into v2-cohere

ef6771e

[CI] Auto commit changes from spotless

90f25c1

Merge branch 'main' into v2-cohere

b139870

Merge branch 'main' into v2-cohere

337fc27

[CI] Auto commit changes from spotless

954cd0d

davidkyle merged commit 3a1551e into elastic:main Jun 24, 2025
32 checks passed

elasticsearchmachine added the backport pending label Jun 24, 2025

davidkyle mentioned this pull request Jun 25, 2025

[CI] CohereServiceUpgradeIT class failing #121537

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Move to the Cohere V2 API for new inference endpoints #129884

[ML] Move to the Cohere V2 API for new inference endpoints #129884

Uh oh!

davidkyle commented Jun 23, 2025

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davidkyle commented Jun 24, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented Jun 24, 2025

Uh oh!

Uh oh!

[ML] Move to the Cohere V2 API for new inference endpoints #129884

[ML] Move to the Cohere V2 API for new inference endpoints #129884

Uh oh!

Conversation

davidkyle commented Jun 23, 2025

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

davidkyle commented Jun 24, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented Jun 24, 2025

💔 Backport failed

Uh oh!

Uh oh!