Add broadcast-compatible scale support for RmsNorm by rsuderman · Pull Request #332 · iree-org/fusilli

rsuderman · 2026-04-10T23:48:15Z

Allow RmsNorm scale tensors to have broadcast-compatible shapes (e.g. {1,c,1,1} instead of {1,c,h,w}), letting torch.aten.rms_norm handle the broadcasting internally. Also update getScalarConstantAsm to emit the actual tensor shape instead of hardcoding tensor<1x...>.

Allow RmsNorm scale tensors to have broadcast-compatible shapes (e.g. {1,c,1,1} instead of {1,c,h,w}), letting torch.aten.rms_norm handle the broadcasting internally. Also update getScalarConstantAsm to emit the actual tensor shape instead of hardcoding tensor<1x...>. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Rob Suderman <rob.suderman@gmail.com>

sjain-stanford

In isolation this change is fine but I'm not sure this is a level of flexibility we should allow. Both PT and cudnn consistently use per element scale for RMS norm (i.e. of normalized shape - matching input shape modulo batch dims). Why would we want to differ / be more general than that? If this change was prompted by allowing hipdnn -> fusilli to bridge, then its a placebo and the real issue lies in hipdnn scale not matching normalized shapes.

rsuderman requested a review from sjain-stanford April 10, 2026 23:48

sjain-stanford requested changes Apr 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add broadcast-compatible scale support for RmsNorm#332

Add broadcast-compatible scale support for RmsNorm#332
rsuderman wants to merge 1 commit intoiree-org:mainfrom
rsuderman:rmsnorm_broadcast

rsuderman commented Apr 10, 2026

Uh oh!

sjain-stanford left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rsuderman commented Apr 10, 2026

Uh oh!

sjain-stanford left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants