-
Notifications
You must be signed in to change notification settings - Fork 36
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Skip size calculation during async copy wait=True
#1126
opened Nov 19, 2025 by
rupengliu-meta
Loading…
[TPU Offload][WIP] Separate offload manager and cpu-cache backend, and code structure refactor
#1122
opened Nov 18, 2025 by
juncgu-google
Loading…
[Misc] Fix model dtype not being configured correctly
#1093
opened Nov 13, 2025 by
kyuyeunk
Loading…
[Llama4 Guard] Add JAX Llama-Guard-4-12B Text Portion
#1090
opened Nov 13, 2025 by
JiriesKaileh
Loading…
Enable Pipeline Parallelism on Jax models
#1077
opened Nov 12, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Enable Pipeline Parallelism on Jax runner
#1053
opened Nov 8, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
Enable Pipeline Parallelism on jax worker
#1043
opened Nov 7, 2025 by
Chenyaaang
Loading…
1 of 8 tasks
[Docs] fix dead links in multiple documentation pages
#1027
opened Nov 6, 2025 by
mattheliu
Loading…
3 tasks done
[FIX] Add dummy get_input_embeddings to fix vLLM model type check
#971
opened Oct 29, 2025 by
kuafou
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.