fix(conv1d): prevent gradient error by removing in-place ops by ricardocardn · Pull Request #82 · NX-AI/xlstm

ricardocardn · 2025-03-31T20:25:01Z

🛠️ Fix: Avoid in-place modification in conv1d_step to preserve autograd graph
This PR addresses a critical issue in the conv1d_step function where the in-place update of conv_state (.copy_() and [..., -1:, :] = x) was interfering with PyTorch's autograd system, breaking gradient computation during backpropagation. This change ensures that conv_state is safely updated by creating a new tensor, preserving the computational graph and enabling correct gradient flow through time.

…omputation

Ricardo added 2 commits March 31, 2025 21:14

fix(conv1d): avoid in-place ops in conv1d_step to preserve gradient c…

c98181e

…omputation

fix(conv1d): avoid in-place ops in conv1d_step to preserve gradient c…

d732255

…omputation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(conv1d): prevent gradient error by removing in-place ops#82

fix(conv1d): prevent gradient error by removing in-place ops#82
ricardocardn wants to merge 2 commits into
NX-AI:mainfrom
ricardocardn:main

ricardocardn commented Mar 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ricardocardn commented Mar 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant