-
Notifications
You must be signed in to change notification settings - Fork 43
Add FP32 operators and tiling support for MicroLlama on Snitch #153
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
lee2716
wants to merge
59
commits into
pulp-platform:devel
Choose a base branch
from
lee2716:devel
base: devel
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from 50 commits
Commits
Show all changes
59 commits
Select commit
Hold shift + click to select a range
1a6308e
Add FP32 operators for MicroLlama on Snitch (untiled)
lee2716 1bdf9c9
Add tiling support for MicroLlama on Snitch
lee2716 35d51ef
Fix Snitch tiled platform by unifying mapping to use TilingReadyBindings
lee2716 76a4678
add full microllama model in ci test
lee2716 c222810
add comments with information about operations
lee2716 497e9c1
generalize RMSNorm to support full ONNX spec
lee2716 bf5ddb7
Fix broadcast stride calculation for inputs with different ranks in A…
lee2716 7e3659d
delete unused function
lee2716 28280fb
delete the comment
lee2716 ac5d541
update year to 2026
lee2716 c90b35c
Fix: Revert batch_size type to uint32_t based on review
lee2716 5669c28
update year to 2026
lee2716 cf4d9bd
update year to 2026
lee2716 c04bd6a
remove code duplication
lee2716 bdc550e
remove code duplication
lee2716 b53ff75
update year to 2026
lee2716 b355624
update year to 2026
lee2716 5306134
recover the Gemm_fp32
lee2716 32d88c0
improve multicore transpose
lee2716 7092e35
format: run make format on Snitch platform code
lee2716 b2199cb
pytest: add microLlama model to Snitch test configurations
lee2716 7ad03a3
style: consolidate imports in Snitch platform
lee2716 d76f6f1
refactor: restore Snitch framework code to origin/devel
lee2716 1c62b68
fix: Reshape operator for Snitch platform
lee2716 32d4bfa
fix: Add broadcasting support and compatible type inference
lee2716 66c4b4f
make format update
lee2716 be96413
update test paths for reorganized RMSNorm and microLlama directories,…
lee2716 89d382a
refactor: general ONNX broadcasting for Div/Mul/Add
lee2716 e55c7bc
fix: enable tiled deployment for NOP operations and L2 memory management
lee2716 13a4e64
fix: restore NOPTileConstraint compatibility with Siracusa/Neureka ti…
lee2716 55b6750
fix: correct integer type inference for all-zero input arrays
lee2716 06010e4
fix: preserve original dtype for all-zero input type inference
lee2716 85a68fd
make format update
lee2716 182a2c3
update rmsnorm test
lee2716 03125c0
feat: replace trivial all-1.0 weights with true FP32 random initializ…
lee2716 027ccab
merge HardSwishChecker, rename parser, fix Softmax types, yapf fix
lee2716 064981a
fix: multi-core safe memory allocation for Snitch platform
lee2716 9be8768
feat: multi-core MatMul, Softmax kernels and fix Mul template
lee2716 fdc0c82
refactor: slim Snitch parsers, add MatMul_fp32.c, remove unsupported …
lee2716 7813684
remove if (snrt_is_compute_core())
lee2716 4865516
fix:correct RMSNorm op count from 5*inputSize to 6*inputSize
lee2716 fc8ea3f
refactor: use SkipTransformer with pointer assignment for Reshape, av…
lee2716 b6b6eb5
simplify: remove unused broadcasting logic from FloatDiv/Mul TileCons…
lee2716 4e8448b
cleanup: remove unused BasicTransformer and Basic*Bindings dead code
lee2716 7003801
fix CI test of snitch
lee2716 e693be7
fix: add int8→int32 MatMul binding to fix Snitch Integer MatMul CI test
lee2716 1633a71
fix CI test of snitch
lee2716 0a66cf4
refactor: reuse Generic GatherTemplate and revert NOPTileConstraint
lee2716 6b357cf
refactor: simplify Snitch parsers, templates, and bindings
lee2716 c7b9771
refactor: address reviewer comments and reduce code duplication
lee2716 6e736f4
Merge remote-tracking branch 'origin/devel' into update_lee2716_devel
lee2716 1062ec2
ci: split snitch workflows into separate kernels/models jobs
lee2716 f7d62b0
fix(snitch): remove singleton mutation in FloatDiv/FloatMulTemplate, …
lee2716 ebf9009
refactor(snitch): restore Gemm/RqGemm TileConstraint imports to origi…
lee2716 ebe3e46
Merge remote-tracking branch 'origin/devel' into update_lee2716_devel
lee2716 94aea9e
docs(snitch): clarify TransposeTemplate parallelization heuristic
lee2716 9fa6fd7
refactor(tiling): move scalar broadcast handling into Generic BOPTile…
lee2716 f87f98b
docs(snitch): explain why InitNetwork is called from all cores
lee2716 1924e5e
refactor(snitch): merge Add_fp32 into Add_fp32_broadcast with element…
lee2716 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
|
diaconuccalin marked this conversation as resolved.
diaconuccalin marked this conversation as resolved.
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
diaconuccalin marked this conversation as resolved.
diaconuccalin marked this conversation as resolved.
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -31,3 +31,5 @@ ignore: | |
| - "**/toolchain/" | ||
| # Ignore all files in .git | ||
| - "**/.git/**" | ||
| # Ignore all files in .venv | ||
| - "**/.venv/" | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.