Intel GPU is not used by qwen-asr #8850
Replies: 2 comments 1 reply
-
|
The log line where GPU vendor and VRAM are both empty stands out, because it suggests the problem may begin before qwen-asr inference and before model-specific code decides where to run. If the runtime cannot identify the Intel device at its own capability layer, backend selection may fall back in ways that still look superficially correct from the outside. |
Beta Was this translation helpful? Give feedback.
-
|
Did some testing with an LLM (qwen3 running on llama-cpp backend) and Intel GPU is actually used there. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I'm running LocalAI on TrueNAS 25.10 with the following kernel:
6.12.33-production+truenasAnd my GPU is an
Intel ARC A380 DG2.When I'm trying to use the
qwen3-asr-0.6bSTT model with theintel-qwen-asrbackend, CPU is used instead of the GPU.I've determined this by observing GPU usage with
intel_gpu_top(it doesn't spike when STT is doing the processing) and CPU usage withhtop(CPU usage spikes on multiple cores when STT processing is done).During startup, no GPU and no VRAM are detected by the image:
However,
sycl-lsdoes identify the GPU:I would appreciate any input on why the GPU is seemingly not used and what troubleshooting steps I can take to determine why.
For information, I'm using localai intel gpu image with the following compose:
Full log:
Beta Was this translation helpful? Give feedback.
All reactions