-
Notifications
You must be signed in to change notification settings - Fork 208
[AMD] Add MiniMax-M3-FP4 MI355X ATOMMESH #1856
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+153
−1
Merged
Changes from 46 commits
Commits
Show all changes
47 commits
Select commit
Hold shift + click to select a range
512504d
[AMD] server_atom: improve config print and cleanup
seungrokj 7ffb3e3
update perf-changelog for dsv4-fp4-mi355x-atom-disagg-mtp
seungrokj 50634be
[AMD] fix DECODE_MTP_SIZE and BENCH_REQUEST_RATE propagation in atom-…
seungrokj 2ccacee
[AMD] server_atom: pass SPEC_ARGS to prefill server
seungrokj dd1e8ac
[AMD] amd-master: fix comment for 1P1D TP8+DPA+TBO+MTP1 config
seungrokj 1f854e4
[AMD] dsv4_atom-disagg: remove DECODE_MTP_SIZE from check_env_vars
seungrokj 1cf914d
[AMD] bench: use --dsv4 flag for DeepSeek-V4-Pro MTP benchmarks
seungrokj 1e7f3da
[AMD] server_atom: export IS_MTP=true when SPEC_DECODING=mtp for benc…
seungrokj 638b837
[AMD] server_atom: fix hf-overrides JSON quoting
seungrokj 3c89eae
fix: inline --hf-overrides to avoid eval word-splitting, remove OPT_ARGS
seungrokj 23808cf
refactor: extract --hf-overrides into HF_OVERRIDES_ARG variable
seungrokj 78806c3
fix: enable --hf-overrides only for DeepSeek-V4-Pro
seungrokj 72734b0
fix: add HF_OVERRIDES_ARG to INFO config print block
seungrokj 688eb03
fix: replace broken-quote array splice with ${ARRAY[*]} in CMD strings
seungrokj f804274
fix: remove ${CUDAGRAPH_OPT} from decode CMD
seungrokj 931727a
feat: add MiniMax-M3 ATOM disagg CI script and server_atom.sh support
seungrokj 501a8cc
feat: add minimaxm3-fp4-mi355x-atom-disagg recipe and AITER_QUICK_RED…
seungrokj b430d91
feat: export AITER_QUICK_REDUCE_QUANTIZATION=INT4 for non-DSv4 models
seungrokj 7b80ea7
fix: server_atom.sh and minimaxm3 disagg cleanup
seungrokj 4ea680d
fix: dsv4_fp4_mi355x_atom-disagg cleanup
seungrokj 74aa3e0
fix: set BLOCK_SIZE=128 for MiniMax-M3 in minimaxm3_fp4_mi355x_atom-d…
seungrokj 26ba108
fix: use KV_CACHE_DTYPE=fp8 for MiniMax-M3 disagg (matches atom serve…
seungrokj b76105f
feat: update minimaxm3-fp4-mi355x-atom-disagg search space and disabl…
seungrokj de6ddc6
feat: add MiniMax-M3-MXFP4/MXFP8 to models_atom.yaml; set KV_CACHE_DT…
seungrokj ac99718
fix: set mi355x-disagg runner and add dynamic cudagraph sizes for dec…
seungrokj aa67d5e
fix: gate ATOM_MOE_GU_ITLV and AITER_BF16_FP8_MOE_BOUND on DeepSeek-V…
seungrokj 2de59c3
fix: preserve empty KV_CACHE_DTYPE to skip --kv-cache-dtype flag
seungrokj d19ea61
feat: update minimaxm3-fp4-mi355x-atom-disagg search space in amd-mas…
seungrokj 198e2c5
fix: use KV_CACHE_DTYPE=auto for minimaxm3 disagg to skip --kv-cache-…
seungrokj bf0538d
fix: align minimaxm3 disagg settings with slurm reference script
seungrokj 9ff735c
benchmarks: add MI355X FP4 atom disaggregated multi-node benchmark sc…
seungrokj d0de9f3
perf-changelog: append PR 1856 entry after rebase
functionstackx 190b055
chore: trim atom mesh whitespace
functionstackx b60d4ca
Merge branch 'main' into amd/atom_mesh_0619_m3
seungrokj 90d526d
fix: patch custom_all_reduce and pin MAX_MODEL_LEN for minimaxm3 atom…
seungrokj 392a286
Merge branch 'main' into amd/atom_mesh_0619_m3
seungrokj cea10d2
Merge branch 'main' into amd/atom_mesh_0619_m3
seungrokj f550a11
fix: patch custom_all_reduce.py via git sparse-checkout at pinned commit
seungrokj 2f73986
fix: bump atom image and clean up server_atom.sh
seungrokj 132f240
Merge branch 'main' into amd/atom_mesh_0619_m3
seungrokj f9545e5
fix: bump minimaxm3-fp4-mi355x-atom-disagg image to nightly_202606221656
seungrokj 3a7ab53
fix: bump minimaxm3-fp4-mi355x-atom-disagg image to MiniMax-M3-20260622
seungrokj 5f0310c
Merge branch 'main' into amd/atom_mesh_0619_m3
seungrokj 70a1a6f
Merge branch 'main' into amd/atom_mesh_0619_m3
seungrokj 1280704
Merge branch 'main' into amd/atom_mesh_0619_m3
seungrokj 7b27b66
Merge branch 'main' into amd/atom_mesh_0619_m3
seungrokj 64885fc
Merge branch 'main' into amd/atom_mesh_0619_m3
functionstackx File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.