Skip to content

Conversation

@sumingZero
Copy link
Contributor

Purpose

Toy proxy now supports PD-mixed round-robin scheduling

Test

PD disaggregation:
python3 /vllm-workspace/unified-cache-management/ucm/pd/toy_proxy_server.py \ --pd-disaggregation \ --prefiller-hosts localhost \ --prefiller-ports 7884 \ --decoder-hosts localhost \ --decoder-ports 7885 \ --port 9300
image

PD mix:
python3 /vllm-workspace/unified-cache-management/ucm/pd/toy_proxy_server.py \ --worker-hosts localhost localhost \ --worker-ports 7884 7885 \ --port 9300
image

flesher0813
flesher0813 previously approved these changes Oct 30, 2025
@flesher0813 flesher0813 merged commit b01501a into ModelEngine-Group:develop Oct 31, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants