Skip to content

Fix LayerNorm, Multi-Query Attention, and Weight-Tying in OLMo to HF Conversion Script#820

Draft
ved1beta wants to merge 2 commits into
allenai:mainfrom
ved1beta:conv_to_HF_TODO
Draft

Fix LayerNorm, Multi-Query Attention, and Weight-Tying in OLMo to HF Conversion Script#820
ved1beta wants to merge 2 commits into
allenai:mainfrom
ved1beta:conv_to_HF_TODO

Conversation

@ved1beta
Copy link
Copy Markdown
Contributor

@ved1beta ved1beta commented Apr 3, 2025

No description provided.

@ved1beta ved1beta changed the title layer norm , multiquery , weight trying TODO Fix LayerNorm, Multi-Query Attention, and Weight-Tying in OLMo to HF Conversion Script Apr 3, 2025
@ved1beta ved1beta marked this pull request as draft April 3, 2025 19:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant