[AMD] Add MiniMax-M3-FP8 MI355X ATOMESH update 0623#1930
Open
seungrokj wants to merge 5 commits into
Open
Conversation
…els_atom.yaml Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…om.yaml-driven) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Contributor
|
see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=28149790315 |
Collaborator
|
@Oseltamivir can u review this? tho it seems like evals r failing potentially failing |
…ingFace path Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Collaborator
Author
|
@functionstackx @Oseltamivir let me first check something and will ping when it is ready! |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
MODEL_NAME == "DeepSeek-V4-Pro"/ per-model checks fromserver_atom.shmodels_atom.yamlusing the samepython3 yaml.safe_loadpattern asserver_vllm.shMiniMax-M3-MXFP4andMiniMax-M3-MXFP8entries tomodels_atom.yamlwith EAGLE3 MTP flagsminimaxm3-fp8-mi355x-atom-disagg:rocm/atom-dev:MiniMax-M3-20260622→rocm/atom-dev:MiniMax-M3-20260623Fields added to
models_atom.yamlenvKEY=VALUEpairs exported unconditionallytp_dp_flagstp_dp_envep_dp_flagsep_dp_envmtp_flagsSPEC_ARGSbefore$DECODE_MTP_SIZEkv_cache_flags--kv_cache_dtypeflag stringhf_overrides--hf-overridesPR Review Checklist
🤖 Generated with Claude Code