[Pallas] Attention perf: further reduce spillage from pre-loading Q, by loading Q in-loop and not pipelining it#2397
Draft
AmesingFlank wants to merge 1 commit into
Draft
[Pallas] Attention perf: further reduce spillage from pre-loading Q, by loading Q in-loop and not pipelining it#2397AmesingFlank wants to merge 1 commit into
AmesingFlank wants to merge 1 commit into