Skip to content

Fix LayerNorm, Multi-Query Attention, and Weight-Tying in OLMo to HF Conversion Script#820

Draft
ved1beta wants to merge 2 commits into
allenai:mainfrom
ved1beta:conv_to_HF_TODO
Draft

Fix LayerNorm, Multi-Query Attention, and Weight-Tying in OLMo to HF Conversion Script#820
ved1beta wants to merge 2 commits into
allenai:mainfrom
ved1beta:conv_to_HF_TODO

Commits

Commits on Apr 3, 2025