-
Notifications
You must be signed in to change notification settings - Fork 311
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
update sglang to 0.5.6
run-ci-precision
run-ci-short
#1051
opened Dec 8, 2025 by
lilei199908
Loading…
Update sglang patch: add update_weights_from_tensor to EagleWorkerV2
#1044
opened Dec 6, 2025 by
zhihengy
Loading…
[Draft] Update Megatron patch to work for Megatron v0.15.0
#1042
opened Dec 6, 2025 by
Birch-san
Loading…
feat: Support
list-of-dicts format for multimodal message content
#1037
opened Dec 5, 2025 by
ppraneth
Loading…
RDMA Support for the weight transferring from Megatron to SGL
#932
opened Nov 25, 2025 by
JensenFire
•
Draft
1 of 2 tasks
token-in-token-out and rollout log prob for fully async rl
#867
opened Nov 23, 2025 by
rbao2018
Loading…
[Feature/Fix] Support IPv6 host resolution and robust URI formatting
#859
opened Nov 21, 2025 by
Chen-GX
Loading…
feat(rollout): support evaluation parameter in custom generate function
#832
opened Nov 19, 2025 by
bcol23
Loading…
perf: Add pipelined weight update via --pipeline-update flag
#689
opened Nov 5, 2025 by
GeLee-Q
Loading…
feat: Add CISPO (Clipped IS-weight Policy Optimization)
#681
opened Nov 3, 2025 by
kekmodel
Loading…
[FSDP] Add an example script to run 4B demo with fsdp
#672
opened Nov 2, 2025 by
Zhuohao-Li
Loading…
Avoiding multiple ranks write into debug train data file concurrently
#642
opened Oct 30, 2025 by
YuchenFan48
Loading…
[Feature] Add sglang server metrics in wandb logging
#604
opened Oct 27, 2025 by
yitianlian
Loading…
Support advantage normalization and loss aggregation options
#545
opened Oct 21, 2025 by
sam571128
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.