Document read_auto operator by mavam · Pull Request #366 · tenzir/docs

mavam · 2026-05-28T15:47:35Z

🔍 Problem

read_auto adds user-facing automatic reader detection, but docs.tenzir.com has no operator reference for it.

🛠️ Solution

Add a read_auto reference page with strict detection behavior, fallbacks, probe limits, and examples.
Add read_auto to the operator reference index.

💬 Review

Check the fallback semantics and supported format list against Add automatic reader detection tenzir#6191.

_{⚙️ Code PR: tenzir/tenzir#6191}

github-actions · 2026-05-28T15:49:24Z

📦 Preview · ~~View →~~ · ⚪ Removed

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4d7d284ceb

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Add the reference entry for automatic reader detection, including strict detection behavior, fallback modes, probe limits, and examples. Assisted-by: GPT-5 (pi)

State that fallback=all chooses text or binary mode from the current probe bytes, not from the entire stream. Point users with binary payloads that start with a UTF-8 prefix to a larger probe or direct read_all binary mode. Assisted-by: GPT-5 (pi)

Replace the invalid load snippets with from_file subpipelines, matching the documented file-reading syntax for parsing byte streams. Assisted-by: GPT-5 (pi)

Document the default probe limit as 1Mi to match the TQL spelling users can configure. Assisted-by: GPT-5 (pi)

Add guide examples for rapid prototyping, mixed file drops, and TCP intake endpoints that accept several input formats. Expand the operator reference with guidance about when to choose automatic detection versus a concrete reader. Assisted-by: ChatGPT (pi)

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f5c39acde6

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Document that fallback selection waits until the probe is final. This makes the long-lived stream behavior explicit and points users to lower probe limits or concrete readers when they need immediate text parsing. Assisted-by: GPT-5 (Codex)

Describe the two detection layers in the description: capability via dry runs of the actual parsers, and a specificity order that picks the most precise format among capable readers. Document that SSV and PRI-less Syslog never auto-detect, and that output keeps the selected reader's schema name. Assisted-by: Fable 5 (Claude Code)

chatgpt-codex-connector · 2026-06-10T13:11:07Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Assisted-by: Fable 5 (Claude Code)

chatgpt-codex-connector · 2026-06-10T13:13:57Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Assisted-by: Fable 5 (Claude Code)

chatgpt-codex-connector · 2026-06-10T13:18:51Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Assisted-by: Fable 5 (Claude Code)

chatgpt-codex-connector · 2026-06-10T13:44:49Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Document that YAML auto-detection requires a map document that read_yaml would turn into an event. Assisted-by: GPT-5 Codex (OpenAI Codex)

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f9f322198c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Set a smaller probe limit in the TCP read_auto example and point known long-lived plain-text streams to read_lines directly. Assisted-by: GPT-5 Codex (OpenAI Codex)

## 🔍 Problem - Users currently need to choose a concrete reader up front, even when the input format is obvious from the first bytes. - Generic readers such as `read_lines` and `read_all` are too weak to be safe defaults for automatic parsing. ## 🛠️ Solution - Add `read_auto` as a strict detector-driven reader selector for chunk input. - Add detector variants for the first supported JSON, text-line, delimited, and magic-byte formats. - Require an explicit `fallback="lines"` or `fallback="all"` for unknown input. - Add a read-detection extension point for reader plugins. - Add a changelog entry and focused integration coverage. ## 💬 Review - Focus on detector precedence and ambiguity behavior, especially JSON object vs NDJSON and explicit fallbacks. - Verified with `scripts/build.sh tenzir-unit-test` and `uvx tenzir-test --root test --match read_auto`. <sub> 📚 Docs PR: tenzir/docs#366 </sub>

mavam mentioned this pull request May 28, 2026

Add automatic reader detection tenzir/tenzir#6191

Merged

github-actions Bot added the reference Reference documentation label May 28, 2026

chatgpt-codex-connector Bot reviewed May 28, 2026

View reviewed changes

Comment thread src/content/docs/reference/operators/read_auto.mdx Outdated

mavam added 5 commits June 6, 2026 10:23

Document read_auto operator

9d5617a

Add the reference entry for automatic reader detection, including strict detection behavior, fallback modes, probe limits, and examples. Assisted-by: GPT-5 (pi)

Clarify read_auto fallback probing

665177d

State that fallback=all chooses text or binary mode from the current probe bytes, not from the entire stream. Point users with binary payloads that start with a UTF-8 prefix to a larger probe or direct read_all binary mode. Assisted-by: GPT-5 (pi)

Use valid TQL in read_auto examples

f9e94bf

Replace the invalid load snippets with from_file subpipelines, matching the documented file-reading syntax for parsing byte streams. Assisted-by: GPT-5 (pi)

Use SI literal in read_auto docs

13a2df4

Document the default probe limit as 1Mi to match the TQL spelling users can configure. Assisted-by: GPT-5 (pi)

mavam force-pushed the topic/read-auto branch from 0067154 to f5c39ac Compare June 6, 2026 08:26

github-actions Bot added the guide How-to guides label Jun 6, 2026

chatgpt-codex-connector Bot reviewed Jun 6, 2026

View reviewed changes

Comment thread src/content/docs/reference/operators/read_auto.mdx

mavam added 2 commits June 6, 2026 10:31

Clarify read_auto fallback latency

1339c07

Document that fallback selection waits until the probe is final. This makes the long-lived stream behavior explicit and points users to lower probe limits or concrete readers when they need immediate text parsing. Assisted-by: GPT-5 (Codex)

Add detection flow diagram to read_auto docs

6755645

Assisted-by: Fable 5 (Claude Code)

Replace detection diagram with steps

6b96dda

Assisted-by: Fable 5 (Claude Code)

Use a plain list for the detection flow

b8d9345

Assisted-by: Fable 5 (Claude Code)

Clarify read_auto YAML detection

f9f3221

Document that YAML auto-detection requires a map document that read_yaml would turn into an event. Assisted-by: GPT-5 Codex (OpenAI Codex)

chatgpt-codex-connector Bot reviewed Jun 15, 2026

View reviewed changes

Comment thread src/content/docs/guides/collecting/get-data-from-the-network.mdx Outdated

Clarify TCP read_auto fallback latency

d2032ed

Set a smaller probe limit in the TCP read_auto example and point known long-lived plain-text streams to read_lines directly. Assisted-by: GPT-5 Codex (OpenAI Codex)

mavam merged commit f7bfc55 into main Jun 16, 2026
5 checks passed

mavam deleted the topic/read-auto branch June 16, 2026 14:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Document read_auto operator#366

Document read_auto operator#366
mavam merged 12 commits into
mainfrom
topic/read-auto

mavam commented May 28, 2026

Uh oh!

github-actions Bot commented May 28, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

chatgpt-codex-connector Bot commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

mavam commented May 28, 2026

🔍 Problem

🛠️ Solution

💬 Review

Uh oh!

github-actions Bot commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

chatgpt-codex-connector Bot commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions Bot commented May 28, 2026 •

edited

Loading