Fix Gemma4 MoE Triton flex stage config by FurtherAI · Pull Request #742 · OpenPipe/ART

FurtherAI · 2026-06-29T00:45:23Z

Summary

add the Gemma4 MoE compile-crash signature for 512-wide global attention
carry Gemma4 flex compile crash config through the attention wrapper into copied layer configs
make signature-selected Triton num_stages=2 override forced-Triton backend selection

direct compile-flag probes passed
scratch CP worker passed for tp1/ep2/cp2/no-sp; generated 512-head kernels used num_stages=2
ruff check and ruff format passed
commit hooks passed

No new tests are included.

FurtherAI added 2 commits June 28, 2026 22:15

Fix Gemma4 MoE Triton flex stage config

2a8104d

Remove Gemma4 compile flag tests

77d2b04

FurtherAI merged commit b2347d2 into main Jun 29, 2026
5 checks passed

FurtherAI deleted the austin/gemma4_moe_num_stages2_fix branch June 29, 2026 01:04