Skip to content

test(eval): add flashinfer regression and benchmark coverage#157

Open
lesj0610 wants to merge 1 commit intoturboderp-org:masterfrom
lesj0610:feat/regression-suite-upstream
Open

test(eval): add flashinfer regression and benchmark coverage#157
lesj0610 wants to merge 1 commit intoturboderp-org:masterfrom
lesj0610:feat/regression-suite-upstream

Conversation

@lesj0610
Copy link
Contributor

@lesj0610 lesj0610 commented Mar 2, 2026

Summary

  • refresh the eval/regression branch onto v0.0.24
  • update eval/perf.py for the current runtime/backend flow
  • add gateway smoke, multilingual quality smoke, Qwen3.5 architecture smoke, and MLA support matrix documentation
  • switch compare_q_exllamav3.py and ppl.py to the backend-auto path used by the refreshed runtime

Notes

  • This PR is evaluation/documentation only
  • No runtime, kernel, or conversion logic is modified here

Validation

  • python -m py_compile eval/compare_q_exllamav3.py eval/perf.py eval/ppl.py eval/gateway_regression_smoke.py eval/quality_regression_en_zh.py eval/quality_smoke_multilingual.py eval/smoke_qwen3_5_arch.py

@lesj0610 lesj0610 force-pushed the feat/regression-suite-upstream branch from d05b29e to a0f939c Compare March 11, 2026 15:52
@lesj0610
Copy link
Contributor Author

Refreshed onto v0.0.24. This branch is now limited to eval/regression/documentation coverage for the current runtime/backend flow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant