Skip to content

[ROCm]: fix: reduce MoE temp memory — embedding cap, weight sum default, skip trivial specs (PR3)#4193

Open
cj401-amd wants to merge 4 commits into
AI-Hypercomputer:mainfrom
cj401-amd:cj/tmem-fixes-clean-3-moe-tmem
Open

[ROCm]: fix: reduce MoE temp memory — embedding cap, weight sum default, skip trivial specs (PR3)#4193
cj401-amd wants to merge 4 commits into
AI-Hypercomputer:mainfrom
cj401-amd:cj/tmem-fixes-clean-3-moe-tmem