Skip to content

Fix: torch.AcceleratorError: CUDA error: an illegal memory access was encountered#89

Open
itechbear wants to merge 1 commit intosgl-project:mainfrom
itechbear:patch-1
Open

Fix: torch.AcceleratorError: CUDA error: an illegal memory access was encountered#89
itechbear wants to merge 1 commit intosgl-project:mainfrom
itechbear:patch-1

Conversation

@itechbear
Copy link

@itechbear itechbear commented Mar 1, 2026

Summary

Replaced copy_done.synchronize() with torch.cuda.synchronize(self.device) for better synchronization handling. Fix the error torch.AcceleratorError: CUDA error: an illegal memory access was encountered raised when the server is being benchmarked.

I'm not quite sure whether the patch would cause performance degration, since it might effectively disable the CPU/GPU overlapped scheduling feature. But I think it is a good starting point.

The Problem

When I benchmarked the server, it consistently raises error in server logs.

Hardware

$ nvidia-smi
Sun Mar  1 15:02:56 2026       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 590.48.01              Driver Version: 590.48.01      CUDA Version: 13.1     |
+-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 3060        On  |   00000000:01:00.0  On |                  N/A |
|  0%   56C    P8             20W /  170W |   11036MiB /  12288MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+

Server log

$ python -m minisgl --model Qwen/Qwen3-1.7B --memory-ratio 0.8 --max-running-req 128 --port 1919
[2026-03-01|10:49:34] INFO     Parsed arguments:
ServerArgs(model_path='Qwen/Qwen3-1.7B', tp_info=DistributedInfo(rank=0, size=1), dtype=torch.bfloat16, max_running_req=128, attention_back
end='auto', moe_backend='auto', cuda_graph_bs=None, cuda_graph_max_bs=None, page_size=1, memory_ratio=0.8, distributed_timeout=60.0, use_du
mmy_weight=False, use_pynccl=True, max_seq_len_override=None, num_page_override=None, max_extend_tokens=8192, cache_type='radix', offline_m
ode=False, _unique_suffix='.pid=12795', server_host='127.0.0.1', server_port=1919, num_tokenizer=0, silent_output=False)
[2026-03-01|10:49:35|core|rank=0] INFO     Auto-selected attention backend: fi
[Gloo] Rank 0 is connected to 0 peer ranks. Expected number of connected peer ranks is : 0
[2026-03-01|10:49:36|core|rank=0] INFO     Free memory before loading model: 10.74 GiB
Loading weights:  50%|███████████████████████████████████████████▌                                           | 1/2 [00:00<00:00,  2.79it/s]
[2026-03-01|10:49:37|initializer] INFO     Tokenize server 0 is ready
Loading weights: 100%|███████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00,  4.82it/s]
[2026-03-01|10:49:38|core|rank=0] INFO     Allocating 45228 tokens for KV cache, K + V = 4.83 GiB
[2026-03-01|10:49:38|core|rank=0] INFO     Free memory after initialization: 2.00 GiB
[2026-03-01|10:49:38|core|rank=0] INFO     Start capturing CUDA graphs with sizes: [1, 2, 4, 8, 16, 24, 32, 40, 48, 56, 64, 72, 80, 88, 96,
 104, 112, 120, 128, 136, 144, 152, 160]
[2026-03-01|10:49:38|core|rank=0] INFO     Free GPU memory before capturing CUDA graphs: 1.97 GiB
Capturing graphs: bs = 1   | avail_mem = 1.71 GiB: 100%|████████████████████████████████████████████████| 23/23 [00:00<00:00, 28.81batch/s]
[2026-03-01|10:49:38|core|rank=0] INFO     Free GPU memory after capturing CUDA graphs: 1.70 GiB
[2026-03-01|10:49:40|core|rank=0] INFO     Scheduler is idle, waiting for new reqs...
[2026-03-01|10:49:40|initializer] INFO     Scheduler is ready
[2026-03-01|10:49:40|FrontendAPI] INFO     API server is ready to serve on 127.0.0.1:1919
INFO:     Started server process [12795]
INFO:     Waiting for application startup.
INFO:     Application startup complete.
INFO:     Uvicorn running on http://127.0.0.1:1919 (Press CTRL+C to quit)
INFO:     127.0.0.1:59220 - "GET /v1/models HTTP/1.1" 200 OK
INFO:     127.0.0.1:49772 - "POST /v1/chat/completions HTTP/1.1" 200 OK
[2026-03-01|10:50:44|core|rank=0] INFO     Scheduler is idle, waiting for new reqs...
INFO:     127.0.0.1:49778 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49792 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49800 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49816 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49828 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49836 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49852 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49856 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49858 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49872 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49882 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49888 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49896 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49902 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49916 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49918 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49932 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49936 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49950 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49966 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49970 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49980 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:49986 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50000 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50002 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50010 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50024 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50032 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50038 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50046 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50050 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50058 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50070 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50076 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50092 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50102 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50110 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50120 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50126 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50138 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50146 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50150 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50156 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50170 - "POST /v1/chat/completions HTTP/1.1" 200 OK
Process minisgl-TP0-scheduler:
Traceback (most recent call last):
  File "/usr/lib/python3.12/multiprocessing/process.py", line 314, in _bootstrap
    self.run()
  File "/usr/lib/python3.12/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
  File "mini-sglang/main/python/minisgl/server/launch.py", line 31, in _run_scheduler
    scheduler.run_forever()
  File "mini-sglang/main/.venv/lib/python3.12/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 131, in run_forever
    data = self.overlap_loop(data)
           ^^^^^^^^^^^^^^^^^^^^^^^
  File "mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 105, in overlap_loop
    self._process_last_data(last_data)
  File "mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 143, in _process_last_data
    copy_done.synchronize()
  File "mini-sglang/main/.venv/lib/python3.12/site-packages/torch/cuda/streams.py", line 231, in synchronize
    super().synchronize()
torch.AcceleratorError: CUDA error: an illegal memory access was encountered
Search for `cudaErrorIllegalAddress' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information.
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

terminate called after throwing an instance of 'c10::AcceleratorError'
  what():  CUDA error: an illegal memory access was encountered
Search for `cudaErrorIllegalAddress' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information.
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

Exception raised from c10_cuda_check_implementation at /pytorch/c10/cuda/CUDAException.cpp:44 (most recent call first):
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x80 (0
x7622a2b7cb80 in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libc10.so)
frame #1: <unknown function> + 0x11fb7 (0x7622a2f44fb7 in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libc10_cuda.so)
frame #2: <unknown function> + 0xc77b28 (0x762243e77b28 in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so)
frame #3: <unknown function> + 0xc72b53 (0x762243e72b53 in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so)
frame #4: <unknown function> + 0xc7a685 (0x762243e7a685 in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so)
frame #5: <unknown function> + 0x4827af (0x7622944827af in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libtorch_python.so)
frame #6: c10::TensorImpl::~TensorImpl() + 0x9 (0x7622a2b59d69 in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libc10.so)
frame #7: <unknown function> + 0x7cb658 (0x7622947cb658 in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libtorch_python.so)
frame #8: <unknown function> + 0x7cb9c5 (0x7622947cb9c5 in mini-sglang/main/.venv/lib/python3.12/site-packages/torch/lib/libtorch_python.so)
frame #9: mini-sglang/main/.venv/bin/python() [0x59159a]
frame #10: mini-sglang/main/.venv/bin/python() [0x59f2a3]
frame #11: mini-sglang/main/.venv/bin/python() [0x591572]
frame #12: mini-sglang/main/.venv/bin/python() [0x558d21]
frame #13: mini-sglang/main/.venv/bin/python() [0x6107d5]
frame #14: mini-sglang/main/.venv/bin/python() [0x6107e5]
frame #15: mini-sglang/main/.venv/bin/python() [0x6107e5]
frame #16: mini-sglang/main/.venv/bin/python() [0x6107e5]
frame #17: mini-sglang/main/.venv/bin/python() [0x6107e5]
frame #18: mini-sglang/main/.venv/bin/python() [0x6107e5]
frame #19: mini-sglang/main/.venv/bin/python() [0x5532db]
frame #20: mini-sglang/main/.venv/bin/python() [0x59f2a3]
frame #21: _PyEval_EvalFrameDefault + 0x566f (0x5dbecf in mini-sglang/main/.venv/bin/python)
frame #22: PyEval_EvalCode + 0x15b (0x5d582b in mini-sglang/main/.venv/bin/python)
frame #23: PyRun_StringFlags + 0xd3 (0x6087b3 in mini-sglang/main/.venv/bin/python)
frame #24: PyRun_SimpleStringFlags + 0x3e (0x6b392e in mini-sglang/main/.venv/bin/python)
frame #25: Py_RunMain + 0x481 (0x6bc5f1 in mini-sglang/main/.venv/bin/python)
frame #26: Py_BytesMain + 0x2d (0x6bc00d in mini-sglang/main/.venv/bin/python)
frame #27: <unknown function> + 0x2a1ca (0x7622a3c2a1ca in /lib/x86_64-linux-gnu/libc.so.6)
frame #28: __libc_start_main + 0x8b (0x7622a3c2a28b in /lib/x86_64-linux-gnu/libc.so.6)
frame #29: _start + 0x25 (0x657445 in mini-sglang/main/.venv/bin/python)

INFO:     127.0.0.1:50180 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50188 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50196 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50212 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50220 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50234 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50248 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50264 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50276 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50284 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50300 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50302 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50316 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50324 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50328 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50344 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50356 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50372 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50376 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50380 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50382 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50388 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50390 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50392 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50394 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50404 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50406 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50416 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50420 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50436 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50450 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50466 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50478 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50488 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50492 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50494 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50500 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50514 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50522 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50536 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50542 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50548 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50560 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50564 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50578 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50590 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50598 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50610 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50622 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50630 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50636 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50644 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50646 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50650 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50654 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:50664 - "POST /v1/chat/completions HTTP/1.1" 200 OK

Benchmark command

The bench_serving.py was taken from https://github.com/sgl-project/sglang/blob/main/python/sglang/bench_serving.py without any modification

python bench_serving.py --backend sglang-oai-chat --dataset-name random --random-input 128 --random-output 128 --num-prompts 500 --request-rate 128 --random-range-ratio 1.0 --base-url http://127.0.0.1:1919

Rationale of the fix

The following content was mostly generated by AI. I reviewed it before adding it here

Root Cause: Resource Race in Overlap Scheduling
In the overlap_loop of Scheduler, the next batch ($N$) is scheduled and launched on the GPU before the results of the previous batch ($N-1$) are processed:

  1. Scheduling Batch $N$: _schedule_next_batch is called. Because the results of batch $N-1$ haven't been processed yet, any requests that finished in batch $N-1$ (e.g., due to an EOS token) are still marked as "running." Consequently, they are scheduled again for batch $N$—this is "speculative execution."
  2. Launching Batch $N$: Batch $N$ starts executing on the engine.stream (GPU).
  3. Processing Batch $N-1$: _process_last_data is called for batch $N-1$. It identifies requests that finished and calls _free_req_resources, which returns their KV cache pages and table slots to the free pool.
  4. The IMA(Illegal Memory Access): Because batch $N$ is already running on the GPU and is using those same resources (speculatively), freeing them on the CPU allows the next iteration ($N+1$) to reallocate and overwrite them. This leads to the batch $N$ kernels reading corrupted or out-of-bounds indices from the page_table, resulting in an illegal memory access.

Why the Fix Works
The original code used copy_done.synchronize(), which only blocked the CPU until batch $N-1$ was finished. This allowed _process_last_data to proceed while batch $N$ was still running.

By replacing it with torch.cuda.synchronize(self.device), the scheduler now waits for all ongoing GPU operations on the device to complete. This includes the speculatively launched batch $N$. As a result:

  • Batch $N$ is guaranteed to finish its execution before _process_last_data begins freeing any resources.
  • The race condition is eliminated because no resource is returned to the free pool while a GPU kernel is still potentially accessing it.

While this fix restores stability by effectively serializing parts of the execution, it ensures that speculative tokens don't conflict with resource management, preventing the CUDA illegal memory access reported in the logs.

… encountered

Update synchronization method in scheduler.py. Replaced copy_done.synchronize() with torch.cuda.synchronize(self.device) for better synchronization handling.
@DarkSharpness
Copy link
Collaborator

DarkSharpness commented Mar 2, 2026

Hi. Could you show your benchmark code and which GPU are you using? A correctly implemented overlap scheduling should contain no race condition. The issue you mention should not be the root cause, because all load/write to the global page table are performed on the scheduler's stream. In forward execution, the batch will only read the indices from the metadata in the attention backend, which is safe.

BTW, could you refer to #58 and try export MINISGL_OVERLAP_EXTRA_SYNC=1 first? I've never run tests on pre-hopper GPUs, and there seems to be constant IMA for ampere GPUs (like #67, #58). If this flag can resolve your issue, we may set this flag to true by default for ampere GPUs.

@itechbear
Copy link
Author

The information about benchmark scripts is documented in the "Benchmark command" section.

I'll try setting export MINISGL_OVERLAP_EXTRA_SYNC=1 and get back to you when I have updates.

@itechbear
Copy link
Author

itechbear commented Mar 2, 2026

The problem still exists. A new interesting finding is that the debugging statement I added below would cause the server to hang. No requests could be completed. It might be just because of the exception.

I added a debugging statement to scheduler.py

$ git log -n 1
commit c7f800d5e8654703683136e87d4fd42cfba13852 (HEAD -> origin-main, origin/main, origin/HEAD)
Author: DarkSharpness <76582120+DarkSharpness@users.noreply.github.com>
Date:   Thu Feb 26 23:26:37 2026 +0800

    [Minor] Style cleanup (#86)
    
    * [feature] support cache unfinished request
    
    * [minor] clean up code
    
    * [minor] more code clean up & renaming
    
    * [minor] make pre-commit happy
$ git diff
diff --git a/python/minisgl/scheduler/scheduler.py b/python/minisgl/scheduler/scheduler.py
index 574bdeb..1dba6e6 100644
--- a/python/minisgl/scheduler/scheduler.py
+++ b/python/minisgl/scheduler/scheduler.py
@@ -228,6 +228,7 @@ class Scheduler(SchedulerIOMixin):
         batch, sample_args, input_mapping, output_mapping = forward_input
         batch.input_ids = self.token_pool[input_mapping]
         if ENV.OVERLAP_EXTRA_SYNC:  # NOTE: https://github.com/sgl-project/mini-sglang/issues/58
+            print('extra sync!')
             self.stream.synchronize()
         forward_output = self.engine.forward_batch(batch, sample_args)
         self.token_pool[output_mapping] = forward_output.next_tokens_gpu

The server hanged and failed to finish any request. I had to abort the client.

$ export MINISGL_OVERLAP_EXTRA_SYNC=1
$ python -m minisgl --model Qwen/Qwen3-1.7B --memory-ratio 0.8 --max-running-req 128 --port 1919
[2026-03-02|21:46:33] INFO     Parsed arguments:
ServerArgs(model_path='Qwen/Qwen3-1.7B', tp_info=DistributedInfo(rank=0, size=1), dtype=torch.bfloat16, max_running_req=128, attention_back
end='auto', moe_backend='auto', cuda_graph_bs=None, cuda_graph_max_bs=None, page_size=1, memory_ratio=0.8, distributed_timeout=60.0, use_du
mmy_weight=False, use_pynccl=True, max_seq_len_override=None, num_page_override=None, max_extend_tokens=8192, cache_type='radix', offline_m
ode=False, _unique_suffix='.pid=8748', server_host='127.0.0.1', server_port=1919, num_tokenizer=0, silent_output=False)
[2026-03-02|21:46:34|core|rank=0] INFO     Auto-selected attention backend: fi
[Gloo] Rank 0 is connected to 0 peer ranks. Expected number of connected peer ranks is : 0
[2026-03-02|21:46:36|core|rank=0] INFO     Free memory before loading model: 10.90 GiB
Loading weights: 100%|███████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00,  5.50it/s]
[2026-03-02|21:46:37|core|rank=0] INFO     Allocating 46490 tokens for KV cache, K + V = 4.97 GiB
[2026-03-02|21:46:37|core|rank=0] INFO     Free memory after initialization: 2.03 GiB
[2026-03-02|21:46:37|core|rank=0] INFO     Start capturing CUDA graphs with sizes: [1, 2, 4, 8, 16, 24, 32, 40, 48, 56, 64, 72, 80, 88, 96,
 104, 112, 120, 128, 136, 144, 152, 160]
[2026-03-02|21:46:37|core|rank=0] INFO     Free GPU memory before capturing CUDA graphs: 2.01 GiB
Capturing graphs: bs = 80  | avail_mem = 1.79 GiB:  35%|█████████████████                                | 8/23 [00:00<00:00, 21.97batch/s]
[2026-03-02|21:46:37|initializer] INFO     Tokenize server 0 is ready
Capturing graphs: bs = 1   | avail_mem = 1.74 GiB: 100%|████████████████████████████████████████████████| 23/23 [00:00<00:00, 30.45batch/s]
[2026-03-02|21:46:37|core|rank=0] INFO     Free GPU memory after capturing CUDA graphs: 1.73 GiB
[2026-03-02|21:46:41|core|rank=0] INFO     Scheduler is idle, waiting for new reqs...
[2026-03-02|21:46:41|initializer] INFO     Scheduler is ready
[2026-03-02|21:46:41|FrontendAPI] INFO     API server is ready to serve on 127.0.0.1:1919
INFO:     Started server process [8748]
INFO:     Waiting for application startup.
INFO:     Application startup complete.
INFO:     Uvicorn running on http://127.0.0.1:1919 (Press CTRL+C to quit)
INFO:     127.0.0.1:39710 - "GET /v1/models HTTP/1.1" 200 OK
INFO:     127.0.0.1:60960 - "POST /v1/chat/completions HTTP/1.1" 200 OK
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
extra sync!
[2026-03-02|21:46:52|core|rank=0] INFO     Scheduler is idle, waiting for new reqs...
INFO:     127.0.0.1:60968 - "POST /v1/chat/completions HTTP/1.1" 200 OK
extra sync!
INFO:     127.0.0.1:60984 - "POST /v1/chat/completions HTTP/1.1" 200 OK
extra sync!
extra sync!
INFO:     127.0.0.1:60996 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32780 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32786 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32796 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32808 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32818 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32826 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32832 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32840 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32846 - "POST /v1/chat/completions HTTP/1.1" 200 OK
extra sync!
INFO:     127.0.0.1:32856 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32870 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32874 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32878 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32880 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32884 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32898 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32910 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32924 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32928 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32940 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32948 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32964 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32978 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32988 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:32992 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33002 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33016 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33032 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33042 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33050 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33060 - "POST /v1/chat/completions HTTP/1.1" 200 OK
extra sync!
INFO:     127.0.0.1:33074 - "POST /v1/chat/completions HTTP/1.1" 200 OK
extra sync!
INFO:     127.0.0.1:33076 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33084 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33090 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33092 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33100 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33116 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33124 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33128 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33134 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33148 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33150 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33158 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33172 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33188 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33202 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33218 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33230 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33236 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33252 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33260 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33270 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33284 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33298 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33300 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33308 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33322 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33338 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33346 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33348 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33352 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33354 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33368 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33374 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33380 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33392 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33406 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33414 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33416 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33420 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33434 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33442 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33450 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33464 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33472 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33476 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33488 - "POST /v1/chat/completions HTTP/1.1" 200 OK
extra sync!
INFO:     127.0.0.1:33502 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33516 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33528 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33540 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33550 - "POST /v1/chat/completions HTTP/1.1" 200 OK
extra sync!
Process minisgl-TP0-scheduler:
Traceback (most recent call last):
  File "/usr/lib/python3.12/multiprocessing/process.py", line 314, in _bootstrap
    self.run()
  File "/usr/lib/python3.12/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
  File "python/minisgl/server/launch.py", line 31, in _run_scheduler
    scheduler.run_forever()
  File ".venv/lib/python3.12/site-packages/torch/utils/_contextlib.py", line 120, in decor
ate_context
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "python/minisgl/scheduler/scheduler.py", line 131, in run_forever
    data = self.overlap_loop(data)
           ^^^^^^^^^^^^^^^^^^^^^^^
  File "python/minisgl/scheduler/scheduler.py", line 105, in overlap_loop
    self._process_last_data(last_data)
  File "python/minisgl/scheduler/scheduler.py", line 143, in _process_last_data
    copy_done.synchronize()
  File ".venv/lib/python3.12/site-packages/torch/cuda/streams.py", line 231, in synchroniz
e
    super().synchronize()
torch.AcceleratorError: CUDA error: an illegal memory access was encountered
Search for `cudaErrorIllegalAddress' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information.
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

terminate called after throwing an instance of 'c10::AcceleratorError'
  what():  CUDA error: an illegal memory access was encountered
Search for `cudaErrorIllegalAddress' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information.
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

Exception raised from c10_cuda_check_implementation at /pytorch/c10/cuda/CUDAException.cpp:44 (most recent call first):
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x80 (0
x71a256f7cb80 in .venv/lib/python3.12/site-packages/torch/lib/libc10.so)
frame #1: <unknown function> + 0x11fb7 (0x71a25db66fb7 in .venv/lib/python3.12/site-packag
es/torch/lib/libc10_cuda.so)
frame #2: <unknown function> + 0xc77b28 (0x71a1fee77b28 in .venv/lib/python3.12/site-packa
ges/torch/lib/libtorch_cuda.so)
frame #3: <unknown function> + 0xc72b53 (0x71a1fee72b53 in .venv/lib/python3.12/site-packa
ges/torch/lib/libtorch_cuda.so)
frame #4: <unknown function> + 0xc7a685 (0x71a1fee7a685 in .venv/lib/python3.12/site-packa
ges/torch/lib/libtorch_cuda.so)
frame #5: <unknown function> + 0x4827af (0x71a24f4827af in .venv/lib/python3.12/site-packa
ges/torch/lib/libtorch_python.so)
frame #6: c10::TensorImpl::~TensorImpl() + 0x9 (0x71a256f59d69 in .venv/lib/python3.12/sit
e-packages/torch/lib/libc10.so)
frame #7: <unknown function> + 0x7cb658 (0x71a24f7cb658 in .venv/lib/python3.12/site-packa
ges/torch/lib/libtorch_python.so)
frame #8: <unknown function> + 0x7cb9c5 (0x71a24f7cb9c5 in .venv/lib/python3.12/site-packa
ges/torch/lib/libtorch_python.so)
frame #9: .venv/bin/python() [0x59159a]
frame #10: .venv/bin/python() [0x59f2a3]
frame #11: .venv/bin/python() [0x591572]
frame #12: .venv/bin/python() [0x558d21]
frame #13: .venv/bin/python() [0x6107d5]
frame #14: .venv/bin/python() [0x6107e5]
frame #15: .venv/bin/python() [0x6107e5]
frame #16: .venv/bin/python() [0x6107e5]
frame #17: .venv/bin/python() [0x6107e5]
frame #18: .venv/bin/python() [0x6107e5]
frame #19: .venv/bin/python() [0x5532db]
frame #20: .venv/bin/python() [0x59f2a3]
frame #21: _PyEval_EvalFrameDefault + 0x566f (0x5dbecf in .venv/bin/python)
frame #22: PyEval_EvalCode + 0x15b (0x5d582b in .venv/bin/python)
frame #23: PyRun_StringFlags + 0xd3 (0x6087b3 in .venv/bin/python)
frame #24: PyRun_SimpleStringFlags + 0x3e (0x6b392e in .venv/bin/python)
frame #25: Py_RunMain + 0x481 (0x6bc5f1 in .venv/bin/python)
frame #26: Py_BytesMain + 0x2d (0x6bc00d in .venv/bin/python)
frame #27: <unknown function> + 0x2a1ca (0x71a25ec2a1ca in /lib/x86_64-linux-gnu/libc.so.6)
frame #28: __libc_start_main + 0x8b (0x71a25ec2a28b in /lib/x86_64-linux-gnu/libc.so.6)
frame #29: _start + 0x25 (0x657445 in .venv/bin/python)

INFO:     127.0.0.1:33558 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33570 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33580 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33582 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33598 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33608 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33622 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33634 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33642 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33652 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33664 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33680 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33682 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33686 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33692 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33694 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33706 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33718 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33724 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33730 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33742 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33750 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33758 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33760 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33776 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33792 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33808 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33814 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33816 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33826 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33828 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33844 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33850 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33854 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33856 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33868 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33876 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33884 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33900 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33904 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33906 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33914 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33920 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33936 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33948 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33956 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33958 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33970 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33978 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33994 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:33996 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34008 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34024 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34038 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34052 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34054 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34056 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34072 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34084 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34098 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34100 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34110 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34114 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34120 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34122 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34128 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34144 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34152 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34154 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34162 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34170 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34182 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34196 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34206 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34214 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34222 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34230 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34246 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34248 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34256 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34264 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34280 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34294 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34296 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34302 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34312 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34328 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34344 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34352 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34366 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34382 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34394 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34400 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34406 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34412 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34424 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34428 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34440 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34450 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34466 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34482 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34486 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34502 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34518 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34526 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34536 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34540 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34554 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34566 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34580 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34594 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34598 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34604 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34618 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34624 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34626 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34632 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34640 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34652 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34664 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34676 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34692 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34696 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34698 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34702 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34706 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34722 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34736 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34746 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34748 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34760 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34770 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34776 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34782 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34784 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34794 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34796 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34806 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34814 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34830 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34834 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34838 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34846 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34860 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34864 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34866 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34874 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34876 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34890 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34900 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34914 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34928 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34938 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34948 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34952 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34962 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34978 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34986 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:34998 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35002 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35006 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35008 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35018 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35022 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35030 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35040 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35042 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35048 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35064 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35078 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35092 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35108 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35118 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35124 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35126 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35138 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35146 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35158 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35162 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35168 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35184 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35190 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35198 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35210 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35212 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35222 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35236 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35248 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35254 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35268 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35270 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35286 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35290 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35298 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35304 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35316 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35324 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35328 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35338 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35342 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35346 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35354 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35370 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35378 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35392 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35396 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35404 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35418 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35428 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35432 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35436 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35450 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35462 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35474 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35482 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35498 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35508 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35518 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35530 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35544 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35552 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35556 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35564 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35580 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35594 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35600 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35616 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35630 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35644 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35650 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35664 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35678 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35682 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35684 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35694 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35698 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35714 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35716 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35728 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35740 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35750 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35766 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35772 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35774 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35784 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35796 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35812 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35814 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35830 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35838 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35844 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35848 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35850 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35856 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35858 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35874 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35882 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35894 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35900 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35902 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35904 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35914 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35928 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35934 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35946 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35952 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35962 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35968 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35974 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35982 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:35990 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36004 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36010 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36022 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36036 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36040 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36050 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36056 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36060 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36066 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36074 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36086 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36090 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36100 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36108 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36116 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36118 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36132 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36142 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36152 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36156 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36158 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36166 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36168 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36176 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36184 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36198 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36204 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36214 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36230 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36236 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36252 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36260 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36264 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36274 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36288 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36298 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36306 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36314 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36316 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36332 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36340 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36344 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36360 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36364 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36366 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36368 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36376 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36378 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36394 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36398 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36404 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36416 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36426 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36440 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36450 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36464 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36468 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36482 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36488 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36502 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36518 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36530 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36536 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36540 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36544 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36550 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36554 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36560 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36570 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36580 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36592 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36598 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36604 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36618 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36624 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36638 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36644 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36648 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36656 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36672 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36686 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36692 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36708 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36724 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36734 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36748 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36752 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36764 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36768 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36784 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36794 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36802 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36812 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36820 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36836 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36840 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36854 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36862 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36872 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36878 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36888 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36902 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36908 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36912 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36924 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36940 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36942 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36946 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36948 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36958 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36974 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36976 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36990 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:36992 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37006 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37012 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37026 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37042 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37046 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37062 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37076 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37088 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37090 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37096 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37110 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37120 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37122 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37136 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37146 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37152 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37158 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37164 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37174 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37190 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37196 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37208 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37210 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37222 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37232 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37242 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37258 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37274 - "POST /v1/chat/completions HTTP/1.1" 200 OK
INFO:     127.0.0.1:37284 - "POST /v1/chat/completions HTTP/1.1" 200 OK
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 1
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 2
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 3
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 4
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 5
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 6
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 7
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 8
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 9
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 10
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 11
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 12
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 13
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 14
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 15
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 16
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 17
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 18
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 19
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 20
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 21
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 22
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 23
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 24
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 25
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 26
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 27
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 28
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 29
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 30
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 31
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 32
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 33
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 34
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 35
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 36
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 37
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 38
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 39
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 40
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 41
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 42
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 43
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 44
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 45
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 46
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 47
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 48
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 49
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 50
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 51
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 52
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 53
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 54
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 55
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 56
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 57
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 58
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 59
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 60
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 61
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 62
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 63
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 64
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 65
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 66
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 67
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 68
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 69
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 70
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 71
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 72
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 73
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 74
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 75
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 76
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 77
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 78
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 79
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 80
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 81
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 82
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 83
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 84
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 85
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 86
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 87
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 88
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 89
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 90
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 91
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 92
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 93
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 94
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 95
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 96
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 97
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 98
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 99
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 100
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 101
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 102
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 103
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 104
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 105
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 106
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 107
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 108
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 109
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 110
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 111
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 112
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 113
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 114
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 115
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 116
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 117
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 118
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 119
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 120
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 121
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 122
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 123
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 124
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 125
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 126
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 127
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 128
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 129
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 130
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 131
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 132
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 133
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 134
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 135
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 136
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 137
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 138
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 139
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 140
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 141
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 142
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 143
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 144
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 145
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 146
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 147
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 148
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 149
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 150
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 151
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 152
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 153
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 154
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 155
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 156
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 157
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 158
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 159
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 160
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 161
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 162
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 163
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 164
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 165
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 166
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 167
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 168
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 169
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 170
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 171
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 172
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 173
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 174
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 175
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 176
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 177
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 178
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 179
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 180
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 181
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 182
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 183
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 184
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 185
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 186
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 187
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 188
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 189
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 190
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 191
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 192
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 193
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 194
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 195
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 196
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 197
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 198
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 199
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 200
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 201
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 202
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 203
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 204
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 205
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 206
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 207
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 208
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 209
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 210
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 211
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 212
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 213
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 214
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 215
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 216
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 217
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 218
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 219
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 220
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 221
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 222
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 223
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 224
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 225
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 226
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 227
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 228
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 229
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 230
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 231
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 232
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 233
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 234
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 235
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 236
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 237
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 238
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 239
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 240
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 241
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 242
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 243
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 244
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 245
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 246
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 247
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 248
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 249
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 250
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 251
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 252
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 253
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 254
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 255
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 256
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 257
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 258
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 259
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 260
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 261
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 262
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 263
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 264
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 265
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 266
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 267
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 268
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 269
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 270
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 271
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 272
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 273
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 274
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 275
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 276
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 277
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 278
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 279
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 280
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 281
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 282
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 283
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 284
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 285
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 286
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 287
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 288
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 289
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 290
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 291
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 292
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 293
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 294
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 295
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 296
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 297
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 298
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 299
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 300
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 301
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 302
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 303
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 304
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 305
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 306
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 307
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 308
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 309
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 310
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 311
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 312
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 313
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 314
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 315
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 316
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 317
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 318
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 319
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 320
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 321
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 322
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 323
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 324
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 325
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 326
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 327
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 328
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 329
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 330
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 331
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 332
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 333
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 334
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 335
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 336
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 337
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 338
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 339
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 340
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 341
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 342
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 343
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 344
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 345
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 346
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 347
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 348
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 349
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 350
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 351
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 352
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 353
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 354
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 355
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 356
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 357
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 358
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 359
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 360
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 361
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 362
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 363
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 364
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 365
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 366
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 367
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 368
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 369
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 370
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 371
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 372
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 373
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 374
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 375
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 376
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 377
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 378
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 379
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 380
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 381
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 382
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 383
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 384
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 385
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 386
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 387
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 388
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 389
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 390
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 391
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 392
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 393
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 394
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 395
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 396
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 397
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 398
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 399
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 400
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 401
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 402
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 403
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 404
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 405
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 406
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 407
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 408
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 409
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 410
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 411
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 412
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 413
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 414
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 415
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 416
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 417
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 418
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 419
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 420
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 421
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 422
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 423
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 424
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 425
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 426
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 427
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 428
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 429
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 430
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 431
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 432
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 433
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 434
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 435
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 436
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 437
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 438
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 439
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 440
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 441
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 442
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 443
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 444
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 445
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 446
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 447
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 448
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 449
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 450
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 451
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 452
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 453
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 454
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 455
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 456
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 457
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 458
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 459
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 460
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 461
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 462
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 463
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 464
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 465
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 466
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 467
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 468
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 469
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 470
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 471
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 472
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 473
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 474
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 475
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 476
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 477
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 478
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 479
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 480
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 481
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 482
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 483
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 484
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 485
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 486
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 487
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 488
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 489
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 490
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 491
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 492
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 493
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 494
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 495
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 496
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 497
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 498
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 499
[2026-03-02|21:47:00|FrontendAPI] WARNING  Aborting request for user 500

@DarkSharpness
Copy link
Collaborator

DarkSharpness commented Mar 3, 2026

This really looks tricky. I don't have access to ampere GPUs, but on H200 and B200, under the same config (without MINISGL_OVERLAP_EXTRA_SYNC), there's no illegal memory access.

# server
python -m minisgl --model Qwen/Qwen3-1.7B --num-pages 45228 --attn "fi" --max-running-req 128 --port 1919
# client
python -m sglang.bench_serving --backend sglang-oai-chat --dataset-name random --random-input 128 --random-output 128 --num-prompts 500 --request-rate 128 --random-range-ratio 1.0 --base-url http://127.0.0.1:1919

It seems that the issue only exists for Ampere GPUs... Could you try some older commits, like 34fe3f31fd12f26b0a8f7ed5044b292f493df5a0?

If the issue still exists, could you please try:

  1. set CUDA_LAUNCH_BLOCKING=1 and rerun
  2. set CUDA_LAUNCH_BLOCKING=0 and MINISGL_DISABLE_OVERLAP_SCHEDULING =1 and rerun

@itechbear
Copy link
Author

Setting MINISGL_DISABLE_OVERLAP_SCHEDULING =1 alone without CUDA_LAUNCH_BLOCKING would definitely mitigate the issue. It was the solution I used to resume my benchmarks.

I'll try setting CUDA_LAUNCH_BLOCKING=1 and post updates here when I have data points.

@DarkSharpness
Copy link
Collaborator

Thanks. Unfortunately I don't have access to 3060/A100. The fastest way to locate which kernel is encountering IMA is to use cuda coredump. This bug really looks puzzling to me because I never reproduce it on Hopper/Blackwell GPUs. Anyway I will take a static pass on the codebase to check again.

@itechbear
Copy link
Author

Setting CUDA_LAUNCH_BLOCKING=1 would also resolve the problem.

I created a cuda coredump file following the article: https://blog.vllm.ai/2025/08/11/cuda-debugging.html . I hope it helps you debug since I don't have much deep knowledge about cuda.

cuda_coredump.8643.1772547581.gz

@DarkSharpness
Copy link
Collaborator

Setting CUDA_LAUNCH_BLOCKING=1 would also resolve the problem.

I created a cuda coredump file following the article: https://blog.vllm.ai/2025/08/11/cuda-debugging.html . I hope it helps you debug since I don't have much deep knowledge about cuda.

cuda_coredump.8643.1772547581.gz

Thanks a lot. I will look into it tonight👍🏿

@DarkSharpness
Copy link
Collaborator

The bug is in flashinfer prefill attention. I will locate the potential race condition later.

image

@DarkSharpness
Copy link
Collaborator

I managed to reproduce this bug on my H200 by adding torch.cuda._sleep(1_000_000) before a forward batch. This only happens for flashinfer attention backend (specifying as fa/trtllm can mitigate this issue), so there must be some race condition in flashinfer attention backend. I'm still looking into it.

cc @itechbear @gbdjxgp

@itechbear
Copy link
Author

I just benchmarked the latest main branch. While the previous problem seems to be gone, a new error arises

Process minisgl-TP0-scheduler:
Traceback (most recent call last):
  File "/usr/lib/python3.12/multiprocessing/process.py", line 314, in _bootstrap
    self.run()
  File "/usr/lib/python3.12/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
  File "./sgl-project/mini-sglang/main/python/minisgl/server/launch.py", line 31, in _run_scheduler
    scheduler.run_forever()
  File "./sgl-project/mini-sglang/main/.venv/lib/python3.12/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 131, in run_forever
    data = self.overlap_loop(data)
           ^^^^^^^^^^^^^^^^^^^^^^^
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 105, in overlap_loop
    self._process_last_data(last_data)
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 164, in _process_last_data
    self.cache_manager.cache_req(req, finished=False)
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/cache.py", line 72, in cache_req
    self.unlock(old_handle)
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/cache.py", line 40, in unlock
    self.prefix_cache.lock_handle(handle, unlock=True)
  File "./sgl-project/mini-sglang/main/python/minisgl/kvcache/radix_cache.py", line 119, in lock_handle
    assert node.ref_count >= 0
           ^^^^^^^^^^^^^^^^^^^
AssertionError

benchmark scipt:

$ python bench_serving.py --backend sglang-oai-chat --dataset-name random --random-input 128 --random-output 128 --num-prompts 1000 --request-rate 128 --random-range-ratio 1.0 --base-url http://127.0.0.1:1919

@DarkSharpness
Copy link
Collaborator

I just benchmarked the latest main branch. While the previous problem seems to be gone, a new error arises

Process minisgl-TP0-scheduler:
Traceback (most recent call last):
  File "/usr/lib/python3.12/multiprocessing/process.py", line 314, in _bootstrap
    self.run()
  File "/usr/lib/python3.12/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
  File "./sgl-project/mini-sglang/main/python/minisgl/server/launch.py", line 31, in _run_scheduler
    scheduler.run_forever()
  File "./sgl-project/mini-sglang/main/.venv/lib/python3.12/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 131, in run_forever
    data = self.overlap_loop(data)
           ^^^^^^^^^^^^^^^^^^^^^^^
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 105, in overlap_loop
    self._process_last_data(last_data)
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/scheduler.py", line 164, in _process_last_data
    self.cache_manager.cache_req(req, finished=False)
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/cache.py", line 72, in cache_req
    self.unlock(old_handle)
  File "./sgl-project/mini-sglang/main/python/minisgl/scheduler/cache.py", line 40, in unlock
    self.prefix_cache.lock_handle(handle, unlock=True)
  File "./sgl-project/mini-sglang/main/python/minisgl/kvcache/radix_cache.py", line 119, in lock_handle
    assert node.ref_count >= 0
           ^^^^^^^^^^^^^^^^^^^
AssertionError

benchmark scipt:

$ python bench_serving.py --backend sglang-oai-chat --dataset-name random --random-input 128 --random-output 128 --num-prompts 1000 --request-rate 128 --random-range-ratio 1.0 --base-url http://127.0.0.1:1919

Thanks. This should be another issue, and I will fix it later. Around 2 weeks ago we updated a little about the radix cache implementation and that possibly break some implementation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants