HabanaAI / vllm-fork Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 134
Star 85

Code
Issues 10
Pull requests 67
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: HabanaAI/vllm-fork

Labels 19 Milestones 0

New pull request New

67 Open 2,000 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Limit HTTP header count and size

#2173 opened Dec 10, 2025 by agrabow

Loading…

Delay prefix cache calculation to find longest common prefix

#2170 opened Dec 8, 2025 by ikurtchen

Loading…

3 tasks done

Bump actions/stale from 9.1.0 to 10.1.1 dependencies

Pull requests that update a dependency file

github_actions

Pull requests that update GitHub Actions code

#2169 opened Dec 8, 2025 by dependabot bot

Loading…

[DeepSeek R1] Gracefully shutdown mooncake store when exception for kill

#2166 opened Dec 4, 2025 by jerrychenhf

Loading…

Dev/aice/v1.22.0/mnps and pp fix

#2165 opened Dec 4, 2025 by tvoas

Loading…

Enable delayed sampling for warmup also to remove graph compilation i…

#2164 opened Dec 4, 2025 by yeonsily

Loading…

3 tasks

Libint/add topk sampling scalar padding

#2160 opened Dec 1, 2025 by libinta

Loading…

3 tasks

Upgrade mooncake to 0.3.6

#2159 opened Dec 1, 2025 by hlin99

Loading…

fix bs>1 crash issue for ovis

#2158 opened Dec 1, 2025 by libinta

Loading…

3 tasks

add VLLM_ENGINE_PROFILER_SKIP_STEPS to the engine profiler

#2143 opened Nov 19, 2025 by yangulei

Loading…

[DeepSeek R1] chunked prefill warmup with chunk size

#2135 opened Nov 14, 2025 by jerrychenhf

Loading…

PD scripts update for 1.23 + fp8_inc

#2131 opened Nov 11, 2025 by Yanli2190

Loading…

Use default INC version in docker

#2120 opened Nov 5, 2025 by Yanli2190

Loading…

add mineru doc

#2112 opened Nov 3, 2025 by yingjie-han

Loading…

Add max_pixels option.

#2094 opened Oct 28, 2025 by wenbinc-Bin

Loading…

Workaround for Assertion error when embedding with bge-m3 in lazy mode

#2093 opened Oct 28, 2025 by slokesha

Loading…

fix wrong section for Qwen series doc

#2074 opened Oct 23, 2025 by heyuanliu-intel

Loading…

3 tasks

Enable chunked prefill on aice 1.22

#2070 opened Oct 23, 2025 by YuJiankang

Loading…

refactor(hpu_model_runner): restructure multimodal-related code

#2066 opened Oct 22, 2025 by Jing1Ling • Draft

3 tasks

Slokesha port ovis

#2063 opened Oct 21, 2025 by slokesha • Draft

3 tasks

[CS-1549] Eanble function call DeepSeek-V3.1

#2047 opened Oct 19, 2025 by JianyuLi01

Loading…

Porting_ovis

#2044 opened Oct 16, 2025 by SupreetSinghPalne • Draft

3 tasks

Spalne/porting ovis

#2038 opened Oct 16, 2025 by SupreetSinghPalne • Draft

3 tasks

fix bug that VLLM_SKIP_WARMUP=1 is not recognized in vision_bucket

#2036 opened Oct 15, 2025 by yingjie-han

Loading…

Fix cache miss for Ovis2.5

#2035 opened Oct 15, 2025 by Jianhong-Zhang • Draft

Previous 1 2 3 Next

Previous Next

ProTip! Filter pull requests by the default branch with base:habana_main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!