Skip to content

Fix Gemma4 MoE Triton flex stage config#742

Merged
FurtherAI merged 2 commits into
mainfrom
austin/gemma4_moe_num_stages2_fix
Jun 29, 2026
Merged

Fix Gemma4 MoE Triton flex stage config#742
FurtherAI merged 2 commits into
mainfrom
austin/gemma4_moe_num_stages2_fix

Conversation

@FurtherAI

Copy link
Copy Markdown
Collaborator

Summary

  • add the Gemma4 MoE compile-crash signature for 512-wide global attention
  • carry Gemma4 flex compile crash config through the attention wrapper into copied layer configs
  • make signature-selected Triton num_stages=2 override forced-Triton backend selection

Testing

  • direct compile-flag probes passed
  • scratch CP worker passed for tp1/ep2/cp2/no-sp; generated 512-head kernels used num_stages=2
  • ruff check and ruff format passed
  • commit hooks passed

No new tests are included.

@FurtherAI FurtherAI merged commit b2347d2 into main Jun 29, 2026
5 checks passed
@FurtherAI FurtherAI deleted the austin/gemma4_moe_num_stages2_fix branch June 29, 2026 01:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant