perf(p2p): add flat connection pool decoupled from Kademlia routing table by azteca1998 · Pull Request #6504 · lambdaclass/ethrex

azteca1998 · 2026-04-20T11:42:35Z

Summary

Add a separate flat connection pool (50K capacity) for RLPx connection initiation, decoupled from the k-bucket routing table
Randomize contact selection from the pool to avoid bucket traversal bias

Builds on #6497 (peer pruning fix). Alternative/complementary to #6503 (randomization-only approach).

Problem

The Kademlia k-bucket routing table (#6458) limits stored contacts to 256 × 16 = 4,096 by design. The old flat IndexMap held up to 100K. This 25x reduction in candidate pool size caused snap sync regressions of 39-75% across all networks because:

Fewer candidates for connection initiation → slower ramp to TARGET_PEERS (100)
Faster exhaustion of candidates → more frequent retries of failed contacts
XOR distance distribution means ~87% of contacts cluster in buckets 253-255, but each bucket only holds 16

Changing k-bucket sizes would break Kademlia protocol semantics, so the routing table structure can't be modified.

Approach

Decouple "routing table" from "connection candidate pool":

K-buckets (unchanged): used for all Kademlia protocol operations (get_closest_nodes, get_nodes_at_distances, get_contact_for_lookup, etc.)
Connection pool (new): flat IndexMap<H256, Node> capped at 50K, used exclusively by get_contact_to_initiate for RLPx connection initiation

All discovered contacts are inserted into both structures. The connection pool is cleaned during prune() and uses k-bucket state for filtering when available (unwanted, fork ID validity). Contacts only in the pool (not in k-buckets) are assumed eligible — the RLPx handshake rejects incompatible peers.

Changes

Add connection_pool: IndexMap<H256, Node> field to PeerTableServer
Insert into pool on every discovery path (new_contacts, new_contact_records, insert_if_new)
Rewrite do_get_contact_to_initiate to draw from pool with random selection
Clean pool entries during prune() when contacts are discarded

Test plan

All 33 p2p unit tests pass
Run daily snapsync test and compare sync times against baseline (pre-kademlia) and perf(p2p): randomize contact selection for RLPx connection initiation #6503 (randomization-only)
Monitor peer count during sync — should show higher diversity than k-bucket-only approach

…able Add a separate IndexMap<H256, Node> connection pool (capacity 50K) for RLPx connection initiation, decoupled from the k-bucket routing table (which is limited to 256 × 16 = 4,096 contacts by Kademlia design). All discovered contacts are inserted into both the k-buckets (for Kademlia protocol operations like FindNode/GetClosestNodes) and the connection pool (for peer connection initiation). This restores the large candidate pool that existed before the k-bucket migration while preserving correct Kademlia routing semantics. The connection pool is: - Populated on every contact discovery (discv4, discv5, insert_if_new) - Cleaned during prune() when contacts are marked disposable - Capped at 50K entries with oldest-first eviction - Used with random selection and k-bucket state filtering

github-actions · 2026-04-20T11:45:11Z

Lines of code report

Total lines added: 53
Total lines removed: 0
Total lines changed: 53

Detailed view

+------------------------------------------------+-------+------+
| File                                           | Lines | Diff |
+------------------------------------------------+-------+------+
| ethrex/crates/networking/p2p/peer_handler.rs   | 555   | +4   |
+------------------------------------------------+-------+------+
| ethrex/crates/networking/p2p/peer_table.rs     | 1277  | +48  |
+------------------------------------------------+-------+------+
| ethrex/crates/networking/p2p/sync/snap_sync.rs | 1020  | +1   |
+------------------------------------------------+-------+------+

Matches the candidate pool size used by Reth and Nethermind.

github-actions · 2026-04-20T12:37:44Z

Benchmark Block Execution Results Comparison Against Main

Command	Mean [s]	Min [s]	Max [s]	Relative
`base`	62.433 ± 0.163	62.234	62.719	1.00
`head`	62.482 ± 0.189	62.260	62.948	1.00 ± 0.00

Merge PR #6503 (randomized contact selection) into PR #6504 (flat connection pool). The connection pool approach already includes randomization, so we keep its version of do_get_contact_to_initiate.

azteca1998 · 2026-04-21T10:59:35Z

Superseded by #6511 (Kademlia v2) which includes the connection pool + all performance fixes.

github-actions Bot assigned azteca1998 Apr 20, 2026

github-actions Bot added the performance Block execution throughput and performance in general label Apr 20, 2026

chore(p2p): reduce connection pool cap from 50K to 10K

6801028

Matches the candidate pool size used by Reth and Nethermind.

azteca1998 changed the base branch from main to fix/kademlia-snapsync-peer-pruning April 20, 2026 12:03

azteca1998 closed this Apr 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(p2p): add flat connection pool decoupled from Kademlia routing table#6504

perf(p2p): add flat connection pool decoupled from Kademlia routing table#6504
azteca1998 wants to merge 2 commits intofix/kademlia-snapsync-peer-pruningfrom
perf/kademlia-connection-pool

azteca1998 commented Apr 20, 2026

Uh oh!

github-actions Bot commented Apr 20, 2026

Uh oh!

github-actions Bot commented Apr 20, 2026

Uh oh!

azteca1998 commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

azteca1998 commented Apr 20, 2026

Summary

Problem

Approach

Changes

Test plan

Uh oh!

github-actions Bot commented Apr 20, 2026

Lines of code report

Uh oh!

github-actions Bot commented Apr 20, 2026

Benchmark Block Execution Results Comparison Against Main

Uh oh!

azteca1998 commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant