feat: TensorRT support by dodamih · Pull Request #1205 · ZettaAI/zetta_utils

dodamih · 2026-03-24T23:06:26Z

Occasionally it looks like the workers will fail to find TensorRT, but it's handled gracefully.

Add ldconfig for nvidia/tensorrt .so paths, prune large site-packages dirs from ldd scan to speed up build. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Switch from torchscript-based TRT compilation to torch_tensorrt.compile with ExportedProgram (.ep) format. Add GPU memory cleanup, improve error handling with fallback to eager mode. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Make tensorrt semaphore optional with a default count, so existing specs without it don't break. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

trivoldus28 · 2026-03-25T05:24:45Z

zetta_utils/convnet/utils.py

+            assert input_shape is not None  # mypy
+
+            trt_fname = (
+                str(xxhash.xxh128(str((path, tuple(input_shape))).encode("utf-8")).hexdigest())


I think you need to also add the gpu model or gpu capability to the hash. I'm not sure if a T4 can run a model compiled by L4 or reversed

codecov · 2026-03-25T05:34:15Z

Codecov Report

❌ Patch coverage is 91.66667% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 99.97%. Comparing base (8c2437b) to head (6991a3e).
⚠️ Report is 79 commits behind head on main.

Files with missing lines	Patch %	Lines
zetta_utils/convnet/utils.py	88.88%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1205      +/-   ##
==========================================
- Coverage   99.98%   99.97%   -0.01%     
==========================================
  Files         192      192              
  Lines       10199    10214      +15     
==========================================
+ Hits        10197    10211      +14     
- Misses          2        3       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

trivoldus28 and others added 5 commits March 24, 2026 16:02

add tensorrt support

79873a2

feat: TensorRT shared library discovery in Docker

f0c2cec

Add ldconfig for nvidia/tensorrt .so paths, prune large site-packages dirs from ldd scan to speed up build. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: update TensorRT compilation to new API

3454c39

Switch from torchscript-based TRT compilation to torch_tensorrt.compile with ExportedProgram (.ep) format. Add GPU memory cleanup, improve error handling with fallback to eager mode. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

refactor: rename trt_compilation semaphore to tensorrt

7ee2c19

Make tensorrt semaphore optional with a default count, so existing specs without it don't break. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: extract modules for type checks

8db5547

dodamih force-pushed the dodam/tensorrt branch from d6574f8 to 8db5547 Compare March 24, 2026 23:19

supersergiy approved these changes Mar 25, 2026

View reviewed changes

dodamih added 2 commits March 24, 2026 22:18

chore: mypy / pylint fix

fc76972

chore: add tensorrt cleanup to test fixture

6991a3e

trivoldus28 reviewed Mar 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: TensorRT support#1205

feat: TensorRT support#1205
dodamih wants to merge 7 commits intomainfrom
dodam/tensorrt

dodamih commented Mar 24, 2026

Uh oh!

trivoldus28 Mar 25, 2026

Uh oh!

codecov bot commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dodamih commented Mar 24, 2026

Uh oh!

trivoldus28 Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Mar 25, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants