Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
34 commits
Select commit Hold shift + click to select a range
a2414ab
feat: unify shared-prefix megatron execution
bradhilton Jun 25, 2026
0e85b0b
Merge main into gdn tree core
bradhilton Jun 25, 2026
4afa197
fix sparse block mask metadata parity
bradhilton Jun 26, 2026
48bdd15
use native wandb sdk types
bradhilton Jun 26, 2026
46a9d61
fix wandb boundary test mock
bradhilton Jun 26, 2026
6c05e09
test: cover tree gdn trainability
bradhilton Jun 26, 2026
498bcd8
test: keep tree trainability smoke small
bradhilton Jun 26, 2026
6f5c379
test: relax tree trainability planner shape
bradhilton Jun 26, 2026
d5b9088
test: use production dtype for tree trainability
bradhilton Jun 27, 2026
4c2b3df
test: check tree trainability by parameter update
bradhilton Jun 27, 2026
e944f81
fix: allow partial lora target coverage
bradhilton Jun 27, 2026
1c79ace
fix: avoid qwen35 dense te compile crash
bradhilton Jun 27, 2026
b039a28
fix: disable te layernorm linear compile path
bradhilton Jun 27, 2026
0527d76
fix: graph break qwen35 dense lora fc1
bradhilton Jun 27, 2026
2acd4cb
fix: handle empty lora fc1 slices
bradhilton Jun 27, 2026
efe01ba
fix: normalize gemma4 shared expert lora keys
bradhilton Jun 27, 2026
32b14a1
style: apply megatron formatting
bradhilton Jun 27, 2026
61e4d2b
test: cover fused fc1 lora sensitivity
bradhilton Jun 27, 2026
6089a2a
test: normalize chat template tool calls
bradhilton Jun 27, 2026
d0d1681
test: strengthen flash lse sensitivity topology
bradhilton Jun 27, 2026
5474538
fix: preserve gdn tree segment order
bradhilton Jun 27, 2026
9470337
Revert "fix: preserve gdn tree segment order"
bradhilton Jun 27, 2026
eccdefb
fix: chunk-align local tree gdn forks
bradhilton Jun 27, 2026
61dba04
test: cover chunk-aligned tree gdn planning
bradhilton Jun 27, 2026
e7497a4
test: run flash lse sensitivity under flash backend
bradhilton Jun 27, 2026
6053920
test: use bf16 for flash lse sensitivity
bradhilton Jun 27, 2026
c72552d
test: stabilize qwen35 length trainability
bradhilton Jun 27, 2026
c104a2c
test: tune qwen35 length learning rate
bradhilton Jun 27, 2026
4ebb865
test: lower train-inf vllm memory reservation
bradhilton Jun 27, 2026
c39a379
fix: generalize gemma4 flex attention workaround
bradhilton Jun 27, 2026
c8cfaf1
fix: remove gemma4 shared expert lora rescale
bradhilton Jun 27, 2026
31a7954
fix: align gemma4 shared expert lora conversion
bradhilton Jun 27, 2026
e8d17f2
test: expose gdn prefill backend for train-inf validation
bradhilton Jun 27, 2026
8326658
test: stabilize model support workflow probes
bradhilton Jun 27, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 14 additions & 0 deletions .github/workflows/build-gpu-image.yml
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,11 @@ on:
required: true
default: true
type: boolean
prewarm_modal:
description: "Prebuild the pushed image in Modal when auth is configured"
required: true
default: true
type: boolean
prewarm_timeout:
description: "Timeout for GPU node prewarm rollout"
required: true
Expand Down Expand Up @@ -155,11 +160,16 @@ jobs:
PULL_IMAGE_REPO: ${{ inputs.pull_image_repo || 'docker.io/bradhiltonnw/art-gpu' }}
IMAGE_TAG: ${{ inputs.tag }}
NO_CACHE: ${{ inputs.no_cache }}
MODAL_TOKEN_ID: ${{ secrets.MODAL_TOKEN_ID }}
MODAL_TOKEN_SECRET: ${{ secrets.MODAL_TOKEN_SECRET }}
PREWARM_MODAL_INPUT: ${{ inputs.prewarm_modal }}
PREWARM_NODES: ${{ inputs.prewarm_nodes }}
PREWARM_TIMEOUT: ${{ inputs.prewarm_timeout }}
run: |
IMAGE_TAG="${IMAGE_TAG:-latest}"
NO_CACHE="${NO_CACHE:-false}"
export PREWARM_MODAL="${PREWARM_MODAL:-auto}"
PREWARM_MODAL_INPUT="${PREWARM_MODAL_INPUT:-true}"
PREWARM_NODES="${PREWARM_NODES:-true}"
PREWARM_TIMEOUT="${PREWARM_TIMEOUT:-30m}"

Expand All @@ -175,6 +185,10 @@ jobs:
args+=(--no-cache)
fi

if [ "${PREWARM_MODAL_INPUT}" = "false" ]; then
args+=(--no-prewarm-modal)
fi

if [ "${PREWARM_NODES}" != "true" ]; then
args+=(--no-prewarm-nodes)
fi
Expand Down
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,8 @@ data/cache.db
streaming-chat-completions/
unsloth_compiled_cache/
wandb/
!/typings/wandb/
!/typings/wandb/**
docs/node_modules/
dist/
replays/
Expand Down
Loading
Loading