Skip to content

Conversation

@yinggeh
Copy link
Contributor

@yinggeh yinggeh commented Oct 30, 2025

What does the PR do?

  • Enable /v1/embeddings inference request for OpenAI API frontend

Checklist

  • PR title reflects the change and is of format <commit_type>: <Title>
  • Changes are described in the pull request.
  • Related issues are referenced.
  • Populated github labels field
  • Added test plan and verified test passes.
  • Verified that the PR passes existing CI.
  • Verified copyright is correct on all changed files.
  • Added succinct git squash message before merging ref.
  • All template sections are filled out.
  • Optional: Additional screenshots for behavior/output changes with before/after.

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

  • feat

Related PRs:

triton-inference-server/vllm_backend#104

Where should the reviewer start?

Test plan:

  • CI Pipeline ID:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

@yinggeh yinggeh self-assigned this Oct 30, 2025
@yinggeh yinggeh added the enhancement New feature or request label Oct 30, 2025
whoisj
whoisj previously approved these changes Oct 30, 2025
whoisj
whoisj previously approved these changes Oct 30, 2025
…to yinggeh/tri-49-request-for-openai-compatible-api-endpoints-for-triton
@yinggeh yinggeh changed the base branch from main to r25.10 October 30, 2025 22:59
@yinggeh
Copy link
Contributor Author

yinggeh commented Oct 30, 2025

Rebase to r25.10 to run pipeline.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Development

Successfully merging this pull request may close these issues.

4 participants