Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
60 commits
Select commit Hold shift + click to select a range
5e98d64
feat: Add TraceLens integration for trace analysis with MLflow upload
gphuang Dec 18, 2025
bbfa9d3
docs: Fix TraceLens CSV format description (multiple files per rank)
gphuang Dec 18, 2025
0759122
fix: Remove unsupported HTML format option from TraceLens
gphuang Dec 18, 2025
4c908e5
fix: Use specific trace file patterns to avoid matching unrelated JSO…
gphuang Dec 18, 2025
2861bdf
docs: Clarify MLflow upload defaults are opt-out when MLflow enabled
gphuang Dec 18, 2025
44d479f
fix: normalize tracelens_ranks parameter to handle string input from …
gphuang Dec 18, 2025
f343046
perf: optimize TraceLens report generation to parse trace file only once
gphuang Dec 18, 2025
0ed33db
feat: cleanup tracelens_reports directory after upload to MLflow
gphuang Dec 18, 2025
deda294
feat: make tracelens_reports cleanup configurable
gphuang Dec 18, 2025
45d384f
refactor: change default to keep tracelens reports locally
gphuang Dec 18, 2025
290d5d3
feat: decouple TraceLens generation from MLflow upload
gphuang Dec 18, 2025
370360c
refactor: remove confusing tracelens_max_reports parameter
gphuang Dec 18, 2025
6366eac
fix: Escape glob paths to handle [] characters in experiment names
gphuang Dec 18, 2025
cb1584b
fix: Install openpyxl for XLSX generation and call TraceLens twice fo…
gphuang Dec 18, 2025
8dc3126
feat: Enable TraceLens by default with one report per node
gphuang Dec 19, 2025
c9967c6
fix: Upload TraceLens CSV directories to preserve rank grouping
gphuang Dec 19, 2025
c383ae8
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Dec 19, 2025
36e4b70
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 14, 2026
eb4da13
minor fix: lint format
gphuang Jan 15, 2026
bddc80f
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 15, 2026
fdc8f51
minor fix
gphuang Jan 15, 2026
6140b2b
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 16, 2026
ada9c01
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 19, 2026
11ffb51
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 20, 2026
77e03d1
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 20, 2026
2494dd5
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 22, 2026
6eebca4
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 23, 2026
d931053
Merge branch 'main' into feat/12-enable-tracelens-analysis
wenxie-amd Jan 26, 2026
968ee8b
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Jan 30, 2026
f79efc0
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Feb 2, 2026
3cfa407
Refactor TraceLens/MLflow artifact features to separate module
gphuang Feb 2, 2026
559ea52
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Feb 3, 2026
99f0fa6
Address Copilot review comments for TraceLens functionality
gphuang Feb 3, 2026
ab6323e
feat: auto-enable mlflow and profiling for tracelens upload
gphuang Feb 5, 2026
5b4e43c
fix: auto-enable tensorboard when profiling is enabled
gphuang Feb 5, 2026
74ad879
chore: set TraceLens defaults to false (opt-in)
gphuang Feb 5, 2026
a399f31
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Feb 9, 2026
7d40ca4
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Feb 9, 2026
75ec055
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Feb 9, 2026
7a3856f
Pin TraceLens install to v0.4.0; default mlflow upload flags to false
gphuang Feb 9, 2026
8098e53
TraceLens/MLflow fixes: tests, docs, local-only generation, cleanup s…
gphuang Feb 9, 2026
21106e3
Fix Copilot review issues: supply-chain safety, UnboundLocalError, ra…
gphuang Feb 9, 2026
8a93b63
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Feb 10, 2026
63fb4aa
Harden TraceLens rank handling and docs
gphuang Feb 10, 2026
9fcd412
Clarify MLflow memory metric name
gphuang Feb 10, 2026
2ab4a48
Tighten GPU util parsing and TraceLens log wording
gphuang Feb 10, 2026
535c816
Defer openpyxl install until TraceLens import
gphuang Feb 10, 2026
1ed2954
Harden TraceLens install behavior and docs
gphuang Feb 11, 2026
7088a0c
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Feb 11, 2026
9d59aa5
Address remaining TraceLens and ROCm comment feedback
gphuang Feb 11, 2026
8581091
Clarify TraceLens defaults and openpyxl fallback
gphuang Feb 11, 2026
6198960
Clarify MLflow writer rank in TraceLens upload
gphuang Feb 11, 2026
c05c743
Fix TraceLens output_format docstring default
gphuang Feb 11, 2026
f86556a
Improve TraceLens install diagnostics and metrics safety.
gphuang Feb 11, 2026
8891434
Remove unused openpyxl install result.
gphuang Feb 11, 2026
3a1c3c5
Harden TraceLens install checks and ROCm parsing.
gphuang Feb 11, 2026
195f28b
Handle TraceLens SHA verification.
gphuang Feb 11, 2026
74fccb9
Merge branch 'main' into feat/12-enable-tracelens-analysis
gphuang Feb 20, 2026
0b4b491
Add TraceLens normalization coverage and import.
gphuang Feb 20, 2026
d2c88d7
Remove redundant re import.
gphuang Feb 23, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading
Loading