Add new Iceberg BatchedWriter that auto handles batches and is durabl… by bbalser · Pull Request #1184 · helium/oracles

bbalser · 2026-05-05T18:55:38Z

Add `BatchedWriter` to `helium_iceberg` with crash-recovery spooling

Summary

Introduces BatchedWriter<T> — a batching layer over IcebergTable<T> that accumulates records, spools them to disk, and commits them to Iceberg in larger snapshots. Designed for streaming-ingestion call sites that today either commit per record (snapshot churn) or hand-roll their own buffering.

Records are durably spooled in Arrow-IPC stream format on the way in, so a process abort between flushes is recoverable: on next startup the new task replays any leftover spool files for its table before accepting new traffic.

Public API

let (writer, task) = BatchedWriter::new(
    table,
    BatchedWriterConfig::new(spool_dir)
        .with_max_batch_size(10_000)
        .with_batch_timeout(Duration::from_secs(60)),
);

// Register `task` with TaskManager (impl ManagedTask), or spawn `task.run(shutdown)`.

writer.queue(record).await?;          // returns once on disk (kernel page cache)
writer.queue_all(records).await?;     // batch variant
writer.flush().await?;                // forces an Iceberg commit, waits for result

queue / queue_all are ack'd: they don't return until the records have been pushed through the spool's BufWriter to the kernel page cache. After they return, the records survive a process abort.
The task triggers an Iceberg commit when the spool reaches max_batch_size, when batch_timeout elapses, on explicit flush(), on triggered::Listener shutdown, or on channel close.
Each commit emits an info log: flushed batch to iceberg table="ns.tbl" reason="size" records=10000 duration_ms=842. Reasons: "size", "timeout", "manual", "shutdown", "channel_closed".

How the spool works

One {namespace}__{table}__{uuid_v7}.arrow file per task, opened lazily on first append (so a clean shutdown with nothing buffered leaves the dir empty).
Append path: T → RecordBatch → StreamWriter::write → BufWriter::flush (kernel page cache). Wrapped in spawn_blocking because arrow-ipc is sync-only.
Flush path: StreamWriter::finish → File::sync_data (the only fsync) → read all batches back via StreamReader → IcebergTable::write_record_batches → delete file.
Replay path: scan the dir for files matching the table's prefix, read each, commit, delete. Truncated trailing batches (kill -9 mid-append) are detected and dropped via the IPC stream's length-prefix framing.

Files changed

File	Change
`helium_iceberg/src/batched_writer/mod.rs`	New: `BatchedWriter`, `BatchedWriterConfig`, `BatchedWriterTask`, `ManagedTask` impl, run loop, log helper.
`helium_iceberg/src/batched_writer/spool.rs`	New: `Spool` — Arrow-IPC stream lifecycle, append, flush, replay.
`helium_iceberg/src/iceberg_table.rs`	`write_data_files` now accepts `Vec<RecordBatch>` so one snapshot can absorb many spool batches; new `IcebergTable::write_record_batches` skips the arrow-json round-trip; `records_to_batch` promoted to `pub(crate)`.
`helium_iceberg/src/lib.rs`	Re-export the new types.
`helium_iceberg/Cargo.toml`	Add `arrow-ipc`, `task-manager`, `triggered`; `tempfile` as dev-dep.
`helium_iceberg/tests/batched_writer.rs`	New: 5 integration tests against the Polaris/Trino/S3 harness.

Design decisions

Non-idempotent only. A batch aggregates records from many submissions; idempotency stays on IcebergTable directly via write_idempotent.
Separate API, not DataWriter<T>. write_idempotent doesn't fit batching semantics, so we don't pretend to.
ManagedTask integration. Mirrors FileSink — fits the workspace shutdown story.
spool_dir is required. No Default impl; the builder's BatchedWriterConfig::new(spool_dir) is the entry point.

…e to disk

michaeldjeffrey · 2026-05-05T19:03:12Z

+use crate::{Error, Result};
+use spool::Spool;
+
+const DEFAULT_MAX_BATCH_SIZE: usize = 10_000;


What unit is this?
glacon-rs uses the bytesize crate that provides some const constructors.

just the number of records, not bytesize

Add new Iceberg BatchedWriter that auto handles batches and is durabl…

18354bf

…e to disk

bbalser requested review from macpie and michaeldjeffrey May 5, 2026 18:55

michaeldjeffrey reviewed May 5, 2026

View reviewed changes

michaeldjeffrey approved these changes May 5, 2026

View reviewed changes

bbalser added 2 commits May 5, 2026 15:49

Add log message when batchwriter flushes

ebe6a8d

rename function

c57b133

bbalser marked this pull request as ready for review May 5, 2026 20:15

bbalser merged commit 9c3ac8a into main May 5, 2026
29 checks passed

bbalser deleted the bbalser/helium-iceberg-batched-writer branch May 5, 2026 20:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new Iceberg BatchedWriter that auto handles batches and is durabl…#1184

Add new Iceberg BatchedWriter that auto handles batches and is durabl…#1184
bbalser merged 3 commits intomainfrom
bbalser/helium-iceberg-batched-writer

bbalser commented May 5, 2026 •

edited

Loading

Uh oh!

michaeldjeffrey May 5, 2026

Uh oh!

bbalser May 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bbalser commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add BatchedWriter to helium_iceberg with crash-recovery spooling

Summary

Public API

How the spool works

Files changed

Design decisions

Uh oh!

michaeldjeffrey May 5, 2026

Choose a reason for hiding this comment

Uh oh!

bbalser May 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bbalser commented May 5, 2026 •

edited

Loading

Add `BatchedWriter` to `helium_iceberg` with crash-recovery spooling