Opt-in support for contact matching - simplifies solver warm starting by nvtw · Pull Request #2446 · newton-physics/newton

nvtw · 2026-04-15T07:23:29Z

Description

DON'T MERGE YET. Something seems to be broken.

Add frame-to-frame contact matching to CollisionPipeline. Each frame, contacts are matched against the previous frame's sorted contacts using binary search on deterministic sort keys, then verified against world-space position distance and surface normal dot-product thresholds. The result is a per-contact match_index on Contacts (>= 0 matched, -1 new, -2 broken). Optional contact reports list newly formed and broken contact indices.

Enabled via CollisionPipeline(contact_matching=True), which implies deterministic=True because they use the same underlying sort and the same contact fingerprints.

See #2248

Checklist

New or existing tests cover these changes
The documentation is up to date with these changes
CHANGELOG.md has been updated (if user-facing change)

Test plan

uv run --extra dev -m newton.tests -k test_contact_matching

14 tests covering: first-frame sentinel values, stable-scene identity matching across multiple frames, new contact detection, position and normal threshold breaking, contact report index correctness, deterministic flag implication, and disabled-matching allocation.

New feature / API change

import newton

pipeline = newton.CollisionPipeline(
    model,
    contact_matching=True,
    contact_matching_pos_threshold=0.02,       # meters
    contact_matching_normal_dot_threshold=0.9,  # cos(~25°)
    contact_report=True,                        # optional new/broken lists
)
contacts = pipeline.contacts()

pipeline.collide(state, contacts)

# Per-contact match index: >= 0 matched, -1 new, -2 broken
match_idx = contacts.rigid_contact_match_index.numpy()

# Optional: indices of new and broken contacts
matcher = pipeline.contact_matcher
new_indices = matcher.new_contact_indices.numpy()[:matcher.new_contact_count.numpy()[0]]
broken_indices = matcher.broken_contact_indices.numpy()[:matcher.broken_contact_count.numpy()[0]]

Summary by CodeRabbit

New Features
- Frame-to-frame contact matching with configurable position and normal thresholds; per-contact match indices exposed (matched, new, broken).
- Optional contact reporting that emits compact lists of newly formed and broken contacts.
- Enabling contact matching forces deterministic contact ordering.
Documentation
- Added guidance and examples for enabling contact matching/reporting and interpreting match/report outputs.
Tests
- New test suite validating matching behavior, thresholds, reporting, and deterministic behavior.

…vs mesh collisions

adenzler-nvidia

Nice feature — well-scoped opt-in with strong invariant tests. The scratch-buffer reuse (point0/normal between sort_full and next-frame match) is genuinely clever and well-documented, and the deterministic binary-search-plus-verify shape is the right choice (O(n log n), graph-capture-safe). Graph-capture discipline throughout (preallocated buffers, dim=self._capacity launches, writing MATCH_NOT_FOUND to inactive slots so the gather-permuted result is well-defined on trailing capacity).

A few questions inline — mostly API-shape / doc-clarity, nothing blocking. One reminder: mergeStateStatus: DIRTY (40 ahead / 22 behind upstream/main), so you'll want to rebase before merge.

Independent of the inline threads, also worth engaging with:

camevor's API-shape question: should rigid_contact_match_index be an extended contact attribute (EXTENDED_ATTRIBUTES + requested_attributes={"match_index"} like "force"), and should new/broken_contact_indices live on Contacts when contact_report=True so contact_matcher stays internal? The current split (match_index on Contacts, index lists on ContactMatcher) is asymmetric, and surfacing pipeline.contact_matcher.new_contact_indices leaks a lot of internal state (_prev_sorted_keys, _prev_was_matched, _prev_count are implementation details). Worth a discussion before locking the public shape.
jumyungc's save_sorted_state vs build_report ordering question (line 266): the current order is correct — save_sorted_state overwrites _prev_count with the new count, and _collect_broken_contacts_kernel needs the old _prev_count to know how many _prev_was_matched slots to scan. There's a good comment at collide.py:1020-1021 explaining this. Could you reply with that rationale and lift a shorter form into the ContactMatcher class docstring (under "typical per-frame call sequence"), so the ordering constraint is visible from the class API?
Key-uniqueness invariant: the matcher relies on sort_key = (shape_a, shape_b, sub_key) being unique per active contact. make_contact_sort_key silently masks overflow (mesh-triangle contacts drop to 19 effective bits → ~524K triangles after the <<3 multi-contact expansion). In scenes exceeding those bit budgets two new contacts could map to the same key, the binary search lands on the same old range, both pick the same best_idx → duplicate match_index values plus an atomic_max on prev_was_matched conflating them. The invariant is inherited from the sorter, fine to treat as out of scope — but worth a one-liner in the module docstring so the next reader knows why match_index is assumed unique.
Broken-on-both-sides test missing: test_broken_pos_threshold_all_contacts verifies the new side gets -2 but not that the same old contact appears in broken_contact_indices. Combined assertion would close the most interesting semantic gap.
has_report: int is the codebase's int-as-bool convention; no change needed — noted for future cleanup when Warp gains native struct-field bools.

adenzler-nvidia · 2026-04-17T12:23:11Z

+        contact_report: bool = False,
+        device: Devicelike = None,
+    ):
+        with wp.ScopedDevice(device):


Q on CPU support: all tests use get_cuda_test_devices(), and the sorter depends on radix_sort_pairs (CUDA-only in practice), but the matcher's own kernels don't depend on anything CUDA-specific. Is CPU intentionally unsupported, or just inherited from the sorter? If intentional, a runtime check here (when device is CPU) would be friendlier than a later kernel crash — something like:

if not wp.get_device(device).is_cuda: raise RuntimeError("ContactMatcher requires a CUDA device (radix_sort_pairs is CUDA-only).")

(Or keep silent and document it as a known limitation — either works; the current state is the least kind.)

adenzler-nvidia · 2026-04-17T12:23:11Z

+            # Only buffer we must own: sorted keys survive across frames
+            # (_sort_keys_copy is overwritten by _prepare_sort each frame).
+            self._prev_sorted_keys = wp.zeros(capacity, dtype=wp.int64)
+            self._prev_count = wp.zeros(1, dtype=wp.int32)


Cross-episode reset semantics question: _prev_count persists across collide() calls (the whole point). But in RL-style workflows where a user wants to "start fresh" after state.reset() or teleporting all bodies, there's no way to zero _prev_count without rebuilding the pipeline. Contacts.clear() resets match_index=-1 but the matcher still has old previous-frame data, so the next frame produces spurious matches against bodies in their new poses.

Would you consider either (a) a ContactMatcher.reset() that zeros _prev_count (+ optionally _prev_was_matched), or (b) having Contacts.clear() coordinate with the matcher when contact_matching=True? If out of scope, a .. note:: in the class docstring flagging the persistence behavior would help users notice.

adenzler-nvidia · 2026-04-17T12:23:11Z

+
+            # Only buffer we must own: sorted keys survive across frames
+            # (_sort_keys_copy is overwritten by _prepare_sort each frame).
+            self._prev_sorted_keys = wp.zeros(capacity, dtype=wp.int64)


Nit: wp.zeros(capacity, dtype=wp.int64) — harmless but prev-keys are always overwritten by the first save_sorted_state before being read, so the initial zero values are never consumed. Initializing with SORT_KEY_SENTINEL from contact_sort.py would make a debug inspection of the buffer before the first save_sorted_state less confusing (zeros look like valid keys for shape_a=0, shape_b=0). Very minor.

adenzler-nvidia · 2026-04-17T12:23:11Z

+        data = _MatchData()
+        data.prev_keys = self._prev_sorted_keys
+        # Reuse sorter scratch buffers for prev-frame world-space data.
+        data.prev_pos_world = self._sorter._full_point0_buf


This (plus lines 382, 440, 441) reaches into ContactSorter's private _full_point0_buf / _full_normal_buf. The scratch-reuse optimization itself is great and well-motivated in the module docstring — the leaky abstraction is the only concern. Would a narrow pair of ContactSorter properties (e.g. scratch_pos_world, scratch_normal) — public names, maybe with a short "for use by ContactMatcher; do not write outside the documented window" comment — make the coupling explicit and refactor-safe? Feel free to push back if the underscore convention already communicates enough.

adenzler-nvidia · 2026-04-17T12:23:11Z

+MATCH_NOT_FOUND = wp.constant(wp.int32(-1))
+"""Sentinel: no matching key found in last frame's contacts."""
+
+MATCH_BROKEN = wp.constant(wp.int32(-2))


Nit: MATCH_NOT_FOUND / MATCH_BROKEN are module-level wp.constants but aren't re-exported through newton.geometry or the top-level. Users who'd rather write match_idx == MATCH_NOT_FOUND than match_idx == -1 can't import them. Documenting the numeric values on the public rigid_contact_match_index docstring is already done — re-export is optional, but would make user code self-documenting.

adenzler-nvidia · 2026-04-17T12:23:11Z

+            model,
+            broad_phase="nxn",
+            contact_matching=True,
+            contact_matching_pos_threshold=0.001,  # 1 mm — very tight


Using the tight 0.001 threshold here (vs the default 0.02) means the test silently becomes a pass if the default ever gets re-tuned. Could this be rewritten to exercise the default? The scene uses an infinite ground plane, so shifting spheres by e.g. 0.2 m (10× the default 0.02) still keeps them in contact while exceeding the position threshold — same invariant, and if someone later changes the default the test follows along:

pipeline = newton.CollisionPipeline( model, broad_phase="nxn", contact_matching=True, # rely on default contact_matching_pos_threshold (0.02 m) ) ... # Shift all dynamic bodies by 0.2 m (10x default threshold). for i in range(len(q)): q[i][0] += 0.2

Same story for the # 10x the threshold comment below at line 178.

# Conflicts: # newton/_src/sim/collide.py

… way - slightly more expensive

This reverts commit ac5c049.

This reverts commit bfb4dc3.

nvtw and others added 30 commits April 10, 2026 15:01

Deterministic contacts start to work for convex vs convex and convex …

3e19e78

…vs mesh collisions

Determinism starts to work for meshes

e67a7a9

Fix issues in contact pre-pruning

f2d7647

Add more unit tests

73849b1

Ran ruff

50858c7

Remove debug printf that slipped in

ad5c0f7

Forgot to add new file

7623a82

Ran ruff

4f15e42

Update the docs

f9ef6c4

Fix sort issue in contact determinism

c38b83e

Fix hte docs

92d9399

Fix the performance regression

587be69

Implement more improvements

ed6ee76

Implement CodeRabbit comments

41ed6c8

Update the changelog and the docs

d2f7a09

Implement MR comments

c7bc73c

Merge branch 'main' into dev/tw3/deterministic_contacts

5df8e2c

Merge branch 'main' into dev/tw3/deterministic_contacts

5e89322

Implement more MR comments

ac66d07

Merge branch 'main' into dev/tw3/deterministic_contacts

4d56f4f

Implement more CodeRabbit comments

45cfbf0

Improve sorting

77b5331

Improve the unit tests

bb3ec2a

Attempt to fix some remaining non-determinism

c37a749

First attempt towards contact matching

f5f78e4

Reduce memory footprint

a367f23

Bugfix

f6c61eb

Improve the test coverage

975d973

Add more diagnostics to the tests.

77882a0

Disable prepruning for determinism experiments.

3a8ed92

adenzler-nvidia reviewed Apr 17, 2026

View reviewed changes

nvtw and others added 4 commits April 17, 2026 14:53

Implement more MR comments

cb52409

Update docs

1fc6ea7

Fix CI

4379204

Merge branch 'main' into dev/tw3/contact_matching

1295271

nvtw requested review from adenzler-nvidia, camevor and jumyungc April 17, 2026 14:26

nvtw marked this pull request as draft April 17, 2026 17:32

nvtw removed request for adenzler-nvidia, camevor, eric-heiden and jumyungc April 17, 2026 17:44

nvtw and others added 16 commits April 17, 2026 20:05

Test better matching - not done yet

6691056

Ensure that only one contact can claim a contact from the last frame

d8f89c7

Small performance improvement

2c82a1d

Merge branch 'main' into dev/tw3/contact_matching

287ce43

# Conflicts: # newton/_src/sim/collide.py

Add sticky contact matching mode

cf4e985

Merge branch 'main' into dev/tw3/contact_matching

99b0e2c

Handle contact matching break distance evaluation in a more symmetric…

bd6ed7a

… way - slightly more expensive

Improve the docs

f1dca6e

Attempt to fix CI docs

4fec7f5

Merge branch 'main' into dev/tw3/contact_matching

dcd693b

Contact matching experiments

bfb4dc3

Add more diagnostic output

ac5c049

Revert "Add more diagnostic output"

49ce8bd

This reverts commit ac5c049.

Revert "Contact matching experiments"

743bac9

This reverts commit bfb4dc3.

Fix normal misalignment issue

011d75c

Unit test fixes

e6f0542

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Opt-in support for contact matching - simplifies solver warm starting#2446

Opt-in support for contact matching - simplifies solver warm starting#2446
nvtw wants to merge 63 commits intonewton-physics:mainfrom
nvtw:dev/tw3/contact_matching

nvtw commented Apr 15, 2026 •

edited

Loading

Uh oh!

adenzler-nvidia left a comment

Uh oh!

adenzler-nvidia Apr 17, 2026

Uh oh!

adenzler-nvidia Apr 17, 2026

Uh oh!

adenzler-nvidia Apr 17, 2026

Uh oh!

adenzler-nvidia Apr 17, 2026

Uh oh!

adenzler-nvidia Apr 17, 2026

Uh oh!

adenzler-nvidia Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

nvtw commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Test plan

New feature / API change

Summary by CodeRabbit

Uh oh!

adenzler-nvidia left a comment

Choose a reason for hiding this comment

Uh oh!

adenzler-nvidia Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

adenzler-nvidia Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

adenzler-nvidia Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

adenzler-nvidia Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

adenzler-nvidia Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

adenzler-nvidia Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nvtw commented Apr 15, 2026 •

edited

Loading