Skip to content

Conversation

@kosabogi
Copy link
Contributor

This PR fixes description errors discovered during the https://github.com/elastic/docs-content-internal/issues/280 of the 8.x and 9.x API documentation.

Closes #5508, #5509, #5510, #5511, #5512

@github-actions
Copy link
Contributor

github-actions bot commented Oct 29, 2025

Following you can find the validation changes against the target branch for the APIs.

API Status Request Response
bulk 🟢 558/558 → 578/578 576/576 → 596/596
esql.query 🟢 → 🔴 359/359 → 361/363 0/0
indices.create 🔴 1414/1440 → 1434/1460 1440/1440 → 1460/1460
indices.create_data_stream 🟢 131/131 → 132/132 131/131 → 132/132
indices.downsample 🟢 → 🔴 9/9 → 40/42 9/9 → 42/42
indices.get 🟢 66/66 → 69/69 66/66 → 69/69
indices.get_data_stream 🔴 124/124 → 125/125 77/124 → 77/125
indices.get_mapping 🔴 213/213 → 225/225 202/213 → 214/225
indices.get_settings 🔴 86/86 → 93/93 66/86 → 73/93
indices.put_index_template 🔴 140/164 → 142/166 164/164 → 166/166
indices.put_settings 🔴 56/58 → 78/80 58/58 → 80/80
indices.segments 🟢 → 🔴 5/5 → 6/6 5/5 → 5/6
ingest.put_pipeline 🟢 78/78 → 79/79 78/78 → 79/79
search 🔴 2593/2612 → 2640/2659 2612/2612 → 2659/2659

You can validate these APIs yourself by using the make validate target.

leemthompo
leemthompo previously approved these changes Oct 29, 2025
Comment on lines +68 to +69
* Applies only to the `sparse_embedding` and `text_embedding` task types.
* Not applicable to the `rerank`, `completion`, or `chat_completion` task types.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These generic messages might be confusing, because for example Cohere only supports the following task types:

  • completion
  • rerank
  • text_embedding

So consider deleting sparse_embedding and chat_completion here. I'd inspect all of these services to avoid confusing users by mention applicabilities that aren't available

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You’re absolutely right. Thinking it through, that also means chunking_settings isn’t applicable for certain inference endpoints, even though now it appears for all of them. Could this be a bug, @davidkyle?
For example, the Create an Anthropic inference endpoint API supports only one task type (completion) so chunking doesn’t apply there.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes chunking_settings only applies to the text_embedding and sparse_embedding task types

@leemthompo leemthompo dismissed their stale review October 29, 2025 16:41

Sorry was on auto-pilot and I thought this was a backport 🙈

I noticed a small nit that might confuse users

Co-authored-by: Liam Thompson <leemthompo@gmail.com>
completion,
rerank,
space_embedding,
sparse_embedding,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

😁

Comment on lines +54 to +60
*
* NOTE: When creating an inference endpoint, the associated machine learning model is automatically deployed if it is not
* already running. After creating the endpoint, wait for the model deployment to complete before using it. You can verify
* the deployment status by using the Get trained model statistics API. In the response, look for "state": "fully_allocated"
* and ensure the "allocation_count" matches the "target_allocation_count". Avoid creating multiple endpoints for the same
* model unless required, as each endpoint consumes significant resources.
*
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This text is in PutElasticsearchRequest.ts and PutElserRequest.ts and specific to those inference services we don't need it here too

Suggested change
*
* NOTE: When creating an inference endpoint, the associated machine learning model is automatically deployed if it is not
* already running. After creating the endpoint, wait for the model deployment to complete before using it. You can verify
* the deployment status by using the Get trained model statistics API. In the response, look for "state": "fully_allocated"
* and ensure the "allocation_count" matches the "target_allocation_count". Avoid creating multiple endpoints for the same
* model unless required, as each endpoint consumes significant resources.
*

Comment on lines +68 to +69
* Applies only to the `sparse_embedding` and `text_embedding` task types.
* Not applicable to the `rerank`, `completion`, or `chat_completion` task types.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes chunking_settings only applies to the text_embedding and sparse_embedding task types

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Missing prerequisites in the Inference API documentation

4 participants