Skip to content

fix: cuda availability checker issue over web docker service with no …#89

Merged
IliasHad merged 1 commit intomainfrom
fix/cuda-issues-on-web-docker-service
Feb 25, 2026
Merged

fix: cuda availability checker issue over web docker service with no …#89
IliasHad merged 1 commit intomainfrom
fix/cuda-issues-on-web-docker-service

Conversation

@IliasHad
Copy link
Owner

Overview

isGPUAvailable() returns true on machines where the NVIDIA driver is installed but the CUDA toolkit libraries (e.g. libcublasLt.so.12) are missing or on the wrong version. This causes ONNX Runtime to throw at model load time and not at GPU detection time, making the failure hard to trace. #85

Expected behavior

isGPUAvailable() should return true only when the full CUDA stack is confirmed to be functional.

Actual behavior

isGPUAvailable() returns true based on driver presence alone, then ONNX Runtime crashes at model initialization with a missing shared library error.

Changes:

Replace shell-based detection with an actual ONNX Runtime CUDA session creation using a minimal valid ONNX model. This validates the full CUDA stack end-to-end

@IliasHad IliasHad merged commit e909c72 into main Feb 25, 2026
26 of 32 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant