simple-loadbalancer

Usage

start a vllm server on platform onthingai.com then get the endpoint url

then edit the endpoints_config.yaml

Qwen/Qwen2.5-7B-Instruct: 
  - http://your-endpoint-url-here

then run the load_balancer.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
endpoints_config.yaml		endpoints_config.yaml
load_balancer.py		load_balancer.py
requirements.txt		requirements.txt