[AMD] Add MiniMax-M3-FP8 MI355X ATOMESH update 0623 by seungrokj · Pull Request #1930 · SemiAnalysisAI/InferenceX

seungrokj · 2026-06-25T05:44:27Z

Summary

Eliminate all hardcoded MODEL_NAME == "DeepSeek-V4-Pro" / per-model checks from server_atom.sh
All model-specific configuration (env vars, parallel flags, MTP flags, KV cache flags, HF overrides) now driven from models_atom.yaml using the same python3 yaml.safe_load pattern as server_vllm.sh
Add MiniMax-M3-MXFP4 and MiniMax-M3-MXFP8 entries to models_atom.yaml with EAGLE3 MTP flags
Image bump for minimaxm3-fp8-mi355x-atom-disagg: rocm/atom-dev:MiniMax-M3-20260622 → rocm/atom-dev:MiniMax-M3-20260623

Fields added to `models_atom.yaml`

Field	Purpose
`env`	Space-separated `KEY=VALUE` pairs exported unconditionally
`tp_dp_flags`	Parallel flags for TP+DPA mode
`tp_dp_env`	Env vars exported only in TP+DPA mode
`ep_dp_flags`	Parallel flags for EP+DPA mode
`ep_dp_env`	Env vars exported only in EP+DPA mode
`mtp_flags`	Flags prepended to `SPEC_ARGS` before `$DECODE_MTP_SIZE`
`kv_cache_flags`	Full `--kv_cache_dtype` flag string
`hf_overrides`	JSON string passed to `--hf-overrides`

PR Review Checklist

Verified that as of the moment of typing this, this is the latest version of PR_REVIEW_CHECKLIST.md
Verified that the general code quality meets the InferenceX standard and does not make the code quality any worse.
Verified that this PR has passed PR validation. Please link to GitHub Action workflow that shows this.
Verified that this PR passes evals. Please link to GitHub Action workflow that shows this.
Verified that speculative decoding PRs uses chat templates to align the AL distribution to real world
If a company claims that they support vLLM/SGLang as first class LLM inference engines on their hardware, I have verified that the respective vLLM/SGLang submission has been made before additional frameworks (TRT-LLM, ATOM, etc.). The only exceptions are for new hardware, such as MI455X UALoE72, Vera Rubin NVL72, Rubin NVL8, etc., and for new model architectures where there is an actual reason why vLLM/SGLang does not fundamentally support them yet.
Verified that the single-node recipes are similar to the official vLLM recipes and/or the SGLang cookbook:
- If they are not, I have verified that a PR has been opened in vLLM recipe repo or SGLang repo and linked it below in the additional detail section:
If any of the above criteria cannot reasonably be satisfied, I have provided additional reasoning below.

🤖 Generated with Claude Code

…els_atom.yaml Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…om.yaml-driven) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

github-actions · 2026-06-25T10:57:31Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=28149790315
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=28149790315

functionstackx · 2026-06-25T23:07:15Z

@Oseltamivir can u review this? tho it seems like evals r failing potentially failing

…ingFace path Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

seungrokj · 2026-06-26T03:12:14Z

@functionstackx @Oseltamivir let me first check something and will ping when it is ready!

seungrokj and others added 2 commits June 25, 2026 14:39

[AMD] refactor server_atom.sh to drive model-specific config from mod…

a07ef93

…els_atom.yaml Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

[AMD] add perf-changelog entry for server_atom.sh refactor (models_at…

ecda65b

…om.yaml-driven) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

seungrokj requested a review from a team June 25, 2026 05:44

seungrokj requested review from 1am9trash, billishyahao, chunfangamd and yctseng0211 as code owners June 25, 2026 05:44

github-project-automation Bot added this to InferenceMAX Board Jun 25, 2026

[AMD] fix perf-changelog pr-link for PR #1930

872f3ff

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

seungrokj changed the title ~~[AMD] refactor server_atom.sh: drive model-specific config from models_atom.yaml~~ [AMD] Add MiniMax-M3-FP4 MI355X ATOMESH update 0623 Jun 25, 2026

Merge branch 'main' into amd/m3_atom_pd_fp8_0623

3d714a9

seungrokj added AMD all-evals Expand eval selection to every fixed-sequence config evals-only Suppress throughput and run only eval jobs; combine with all-evals to expand selection full-sweep-enabled labels Jun 25, 2026

claude Bot reviewed Jun 25, 2026

View reviewed changes

Comment thread benchmarks/multi_node/amd_utils/models_atom.yaml

Comment thread benchmarks/multi_node/amd_utils/server_atom.sh

Comment thread benchmarks/multi_node/amd_utils/server_atom.sh

seungrokj removed the full-sweep-enabled label Jun 26, 2026

[AMD] fix model name for minimaxm3-fp8-mi355x-atom-disagg to use Hugg…

b24371a

…ingFace path Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

seungrokj changed the title ~~[AMD] Add MiniMax-M3-FP4 MI355X ATOMESH update 0623~~ [AMD] Add MiniMax-M3-FP8 MI355X ATOMESH update 0623 Jun 26, 2026

seungrokj removed the all-evals Expand eval selection to every fixed-sequence config label Jun 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMD] Add MiniMax-M3-FP8 MI355X ATOMESH update 0623#1930

[AMD] Add MiniMax-M3-FP8 MI355X ATOMESH update 0623#1930
seungrokj wants to merge 5 commits into
mainfrom
amd/m3_atom_pd_fp8_0623

seungrokj commented Jun 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 25, 2026

Uh oh!

functionstackx commented Jun 25, 2026

Uh oh!

seungrokj commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

seungrokj commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Fields added to models_atom.yaml

PR Review Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 25, 2026

Uh oh!

functionstackx commented Jun 25, 2026

Uh oh!

seungrokj commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

seungrokj commented Jun 25, 2026 •

edited

Loading

Fields added to `models_atom.yaml`