[ML] Move to the Cohere V2 API for new inference endpoints #129884

davidkyle · 2025-06-23T21:45:23Z

The Cohere V2 API contains 2 changes that must be adapted for

The model parameter is no longer optional
For embeddings the input_type parameter is no longer optional

Creating an endpoint without a model now causes a validation exception. input_type is declared either in task_settings or in the inference call, if not set in either or these places input_type defaults to search_query.

New inference endpoints will use the V2 API, existing endpoints will continue to use the V1 API. The user does not have the option of picking the V1 API in new endpoints. One possibly controversial aspect is that the API version is not surfaced to the user, the version is persisted with the model config but not included in the GET _inference response. I implemented this behaviour because the user does not have the ability to pick the API, in retrospect hiding the version now seems confusing.

The request classes have been moved to org.elasticsearch.xpack.inference.services.cohere.request.v1 and renamed. The new V2 request classes are in org.elasticsearch.xpack.inference.services.cohere.request.v2 (they are very similar).

The upgrade test CohereServiceUpgradeIT tests that the old v1 endpoints still work after upgrading.

elasticsearchmachine · 2025-06-23T21:45:48Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2025-06-23T21:45:48Z

Hi @davidkyle, I've created a changelog YAML for you.

davidkyle · 2025-06-23T21:47:06Z

...inference/src/main/java/org/elasticsearch/xpack/inference/services/cohere/CohereService.java

@@ -166,24 +166,14 @@ private static CohereModel createModel(
        return switch (taskType) {
            case TEXT_EMBEDDING -> new CohereEmbeddingsModel(
                inferenceEntityId,
-                taskType,


Just removing the task type and service name parameters as both of these are known from model type.

davidkyle · 2025-06-23T21:49:06Z

...e/src/main/java/org/elasticsearch/xpack/inference/services/cohere/CohereServiceSettings.java

        builder.endObject();
        return builder;
    }

    public XContentBuilder toXContentFragment(XContentBuilder builder, Params params) throws IOException {
-        return toXContentFragmentOfExposedFields(builder, params);
+        toXContentFragmentOfExposedFields(builder, params);
+        return builder.field(API_VERSION, apiVersion); // API version is persisted but not exposed to the user


Moving this to the toXContent will expose it to the user

davidkyle · 2025-06-23T21:52:53Z

...e/src/main/java/org/elasticsearch/xpack/inference/services/cohere/request/CohereRequest.java

+    }
+
+    @Override
+    public HttpRequest createHttpRequest() {


Moving shared code to the base class

davidkyle · 2025-06-23T21:54:38Z

.../org/elasticsearch/xpack/inference/services/cohere/request/v1/CohereV1CompletionRequest.java

+import java.util.List;
+import java.util.Objects;
+
+public class CohereV1CompletionRequest extends CohereRequest {


The CohereCompletionRequest and CohereCompletionRequestEntity classes into a single class here. Same for the other request types, this is a refactoring not new code

davidkyle · 2025-06-23T21:56:32Z

.../org/elasticsearch/xpack/inference/services/cohere/request/v2/CohereV2CompletionRequest.java

+
+    @Override
+    protected List<String> pathSegments() {
+        return List.of(CohereUtils.VERSION_2, CohereUtils.CHAT_PATH);


Use the V2 API path.

davidkyle added 6 commits June 23, 2025 15:31

Basic v2 classes

936150c

Add v2 classes

14cc105

fix the tests

d5c80bb

start upgrade test

fcbbfa0

Upgrade test

d37dd1f

Fix the tests

92a373b

davidkyle added >enhancement :ml Machine learning auto-backport Automatically create backport pull requests when merged v8.19.0 v9.1.0 labels Jun 23, 2025

elasticsearchmachine added the Team:ML Meta label for the ML team label Jun 23, 2025

Update docs/changelog/129884.yaml

8860a61

github-actions bot deployed to docs-preview June 23, 2025 21:46 View deployment

[CI] Auto commit changes from spotless

fe3fed6

github-actions bot deployed to docs-preview June 23, 2025 21:55 View deployment

davidkyle commented Jun 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Move to the Cohere V2 API for new inference endpoints #129884

[ML] Move to the Cohere V2 API for new inference endpoints #129884

davidkyle commented Jun 23, 2025

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

davidkyle Jun 23, 2025

Uh oh!

davidkyle Jun 23, 2025

Uh oh!

davidkyle Jun 23, 2025

Uh oh!

davidkyle Jun 23, 2025

Uh oh!

davidkyle Jun 23, 2025

Uh oh!

Uh oh!

[ML] Move to the Cohere V2 API for new inference endpoints #129884

Are you sure you want to change the base?

[ML] Move to the Cohere V2 API for new inference endpoints #129884

Conversation

davidkyle commented Jun 23, 2025

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

davidkyle Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

davidkyle Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

davidkyle Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

davidkyle Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

davidkyle Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!