Tracking ROCm CI parity with the NVIDIA PR Test workflow (.github/workflows/pr-test.yml): the same per-commit tests green on AMD Instinct MI300/MI355X (ROCm) as on NVIDIA.
Snapshot of the currently active (enabled) per-commit tests against origin/main @ c8e85df24. A ticked box = confirmed passing on MI355X (trailing #NNNN is the PR carrying the ROCm fix, where one was needed); a blank box = still to run (newly added/split files, or not yet green). Suites are bucketed by GPU type/count and selected at runtime by domain labels. List a suite's contents with:
python3 -m tests.ci.run_suite --hw cpu --suite stage-a-cpu --list-only
python3 -m tests.ci.run_suite --hw cuda --suite <suite-name> --list-only
77 / 83 passing.
stage-a-cpu (CPU) — 44 active
stage-b-cpu (CPU)
No tests registered yet.
stage-b-2-gpu-h200 (CUDA, 2-GPU) — 15 active
stage-c-2-gpu-h200 (CUDA, 2-GPU) — 2 active
stage-c-4-gpu-h200 (CUDA, 4-GPU) — 12 active
stage-c-8-gpu-h100 (CUDA, 8-GPU) — 10 active
Tracking ROCm CI parity with the NVIDIA
PR Testworkflow (.github/workflows/pr-test.yml): the same per-commit tests green on AMD Instinct MI300/MI355X (ROCm) as on NVIDIA.Snapshot of the currently active (enabled) per-commit tests against
origin/main@c8e85df24. A ticked box = confirmed passing on MI355X (trailing#NNNNis the PR carrying the ROCm fix, where one was needed); a blank box = still to run (newly added/split files, or not yet green). Suites are bucketed by GPU type/count and selected at runtime by domain labels. List a suite's contents with:77 / 83 passing.
stage-a-cpu (CPU) — 44 active
stage-b-cpu (CPU)
No tests registered yet.
stage-b-2-gpu-h200 (CUDA, 2-GPU) — 15 active
NVFP4QuantizerRef)stage-c-2-gpu-h200 (CUDA, 2-GPU) — 2 active
stage-c-4-gpu-h200 (CUDA, 4-GPU) — 12 active
INT4 quant kernel is CUDA-only (no ROCm/HIP build of fake_int_quant_cuda)modelopt_fp8 quantization is currently rejected by the SGLang ROCm guard)stage-c-8-gpu-h100 (CUDA, 8-GPU) — 10 active
Deep-EP WIP on Rocm)Deep-EP WIP on Rocm)