Fixes descriptions in the Inference APIs #5566

kosabogi · 2025-10-29T09:38:47Z

This PR fixes description errors discovered during the https://github.com/elastic/docs-content-internal/issues/280 of the 8.x and 9.x API documentation.

Closes #5508, #5509, #5510, #5511, #5512

github-actions · 2025-10-29T09:42:47Z

Following you can find the validation changes against the target branch for the APIs.

API	Status	Request	Response
`bulk`	🟢	558/558 → 578/578	576/576 → 596/596
`esql.query`	🟢 → 🔴	359/359 → 361/363	0/0
`indices.create`	🔴	1414/1440 → 1434/1460	1440/1440 → 1460/1460
`indices.create_data_stream`	🟢	131/131 → 132/132	131/131 → 132/132
`indices.downsample`	🟢 → 🔴	9/9 → 40/42	9/9 → 42/42
`indices.get`	🟢	66/66 → 69/69	66/66 → 69/69
`indices.get_data_stream`	🔴	124/124 → 125/125	77/124 → 77/125
`indices.get_mapping`	🔴	213/213 → 225/225	202/213 → 214/225
`indices.get_settings`	🔴	86/86 → 93/93	66/86 → 73/93
`indices.put_index_template`	🔴	140/164 → 142/166	164/164 → 166/166
`indices.put_settings`	🔴	56/58 → 78/80	58/58 → 80/80
`indices.segments`	🟢 → 🔴	5/5 → 6/6	5/5 → 5/6
`ingest.put_pipeline`	🟢	78/78 → 79/79	78/78 → 79/79
`search`	🔴	2593/2612 → 2640/2659	2612/2612 → 2659/2659

You can validate these APIs yourself by using the make validate target.

specification/inference/delete/DeleteRequest.ts

leemthompo · 2025-10-29T16:40:50Z

specification/inference/put_cohere/PutCohereRequest.ts

+     * Applies only to the `sparse_embedding` and `text_embedding` task types.
+     * Not applicable to the `rerank`, `completion`, or `chat_completion` task types.


These generic messages might be confusing, because for example Cohere only supports the following task types:

completion

rerank

text_embedding

So consider deleting sparse_embedding and chat_completion here. I'd inspect all of these services to avoid confusing users by mention applicabilities that aren't available

You’re absolutely right. Thinking it through, that also means chunking_settings isn’t applicable for certain inference endpoints, even though now it appears for all of them. Could this be a bug, @davidkyle?
For example, the Create an Anthropic inference endpoint API supports only one task type (completion) so chunking doesn’t apply there.

Yes chunking_settings only applies to the text_embedding and sparse_embedding task types

Sorry was on auto-pilot and I thought this was a backport 🙈

I noticed a small nit that might confuse users

Co-authored-by: Liam Thompson <leemthompo@gmail.com>

davidkyle · 2025-10-31T09:34:16Z

specification/inference/_types/CommonTypes.ts

  completion,
  rerank,
-  space_embedding,
+  sparse_embedding,


davidkyle · 2025-10-31T09:36:47Z

specification/inference/put/PutRequest.ts

+ *
+ * NOTE: When creating an inference endpoint, the associated machine learning model is automatically deployed if it is not
+ * already running. After creating the endpoint, wait for the model deployment to complete before using it. You can verify
+ * the deployment status by using the Get trained model statistics API. In the response, look for "state": "fully_allocated"
+ * and ensure the "allocation_count" matches the "target_allocation_count". Avoid creating multiple endpoints for the same
+ * model unless required, as each endpoint consumes significant resources.
+ *


This text is in PutElasticsearchRequest.ts and PutElserRequest.ts and specific to those inference services we don't need it here too

Suggested change

*

* NOTE: When creating an inference endpoint, the associated machine learning model is automatically deployed if it is not

* already running. After creating the endpoint, wait for the model deployment to complete before using it. You can verify

* the deployment status by using the Get trained model statistics API. In the response, look for "state": "fully_allocated"

* and ensure the "allocation_count" matches the "target_allocation_count". Avoid creating multiple endpoints for the same

* model unless required, as each endpoint consumes significant resources.

*

davidkyle · 2025-10-31T09:37:18Z

specification/inference/put_cohere/PutCohereRequest.ts

+     * Applies only to the `sparse_embedding` and `text_embedding` task types.
+     * Not applicable to the `rerank`, `completion`, or `chat_completion` task types.


Yes chunking_settings only applies to the text_embedding and sparse_embedding task types

Improves descriptions in the Inference APIs

aafd99f

kosabogi requested review from leemthompo and pquentin October 29, 2025 09:38

kosabogi added specification backport 9.1 backport 9.2 labels Oct 29, 2025

Merge branch 'main' into inference-fixes

c8491cb

leemthompo previously approved these changes Oct 29, 2025

View reviewed changes

leemthompo reviewed Oct 29, 2025

View reviewed changes

specification/inference/delete/DeleteRequest.ts Outdated Show resolved Hide resolved

leemthompo reviewed Oct 29, 2025

View reviewed changes

Update specification/inference/delete/DeleteRequest.ts

b89aee3

Co-authored-by: Liam Thompson <leemthompo@gmail.com>

davidkyle reviewed Oct 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixes descriptions in the Inference APIs #5566

Fixes descriptions in the Inference APIs #5566

kosabogi commented Oct 29, 2025

Uh oh!

github-actions bot commented Oct 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

leemthompo Oct 29, 2025

Uh oh!

kosabogi Oct 30, 2025

Uh oh!

davidkyle Oct 31, 2025

Uh oh!

davidkyle Oct 31, 2025

Uh oh!

davidkyle Oct 31, 2025

Uh oh!

davidkyle Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		* Applies only to the `sparse_embedding` and `text_embedding` task types.
		* Not applicable to the `rerank`, `completion`, or `chat_completion` task types.

Fixes descriptions in the Inference APIs #5566

Are you sure you want to change the base?

Fixes descriptions in the Inference APIs #5566

Conversation

kosabogi commented Oct 29, 2025

Uh oh!

github-actions bot commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

leemthompo Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

kosabogi Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

davidkyle Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

davidkyle Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

davidkyle Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

davidkyle Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Oct 29, 2025 •

edited

Loading