Skip to content

Gate neutralization on advantage_estimator in {grpo, gspo}

59b2606
Select commit
Loading
Failed to load commit list.
Open

Neutralize zero-advantage samples to skip wasted forward compute #1901

Gate neutralization on advantage_estimator in {grpo, gspo}
59b2606
Select commit
Loading
Failed to load commit list.