feat(audio): add stream audio encoder for turn detection by chenghao-mou · Pull Request #5494 · livekit/agents

chenghao-mou · 2026-04-20T06:27:49Z

Added a stream audio encoder for turn detection, supporting opus, mp3, and pcm

Added a stream audio encoder for turn detection, supporting opus, mps, and pcm

theomonnom · 2026-04-24T02:43:41Z

+        return data
+
+
+class AudioStreamEncoder:


We should encode in another thread, like we do for our AudioDecoder

I thought about this before, but I can see some difference here:

Decoder: we need a thread so that the blocking read() wait doesn't stall the event loop

Encoder: caller pushes data (calling encode() when we have a frame) → no blocking wait, no thread needed

I can create a threaded version and show some benchmarks.

Here are the results:

metric sync threaded

push mean 1,186 us 40 us

push p95 2,053 us 45 us

push max 4,303 us 728 us

first page 4.0 ms 10.0 ms

inter-page mean 990 ms 989 ms

inter-page median 990 ms 990 ms

pages / bytes 7 / 1520 7 / 1520

Threaded version has a 6ms delay for the first page, but all of them are pretty much invisible in real-time load (60ms input frame size, opus needs about 16 frames for a page)

BTW, I updated the eot PR to use the threaded version: https://github.com/livekit/agents/pull/4722/changes#diff-07d680088a7c2a58bad7bec653cc4d5197cc212269eb0d76d35eab64a1195b07

So the Opus encode is almost instantaneous? Tho what if you push more than 60ms? like if you push 500ms?
isn't it going to block? I understand we will push tiny frames for the barge-in model, but since this is a public utility, we still need to get the interface right

The sync version is still blocking 4ms sometimes, for the asyncio it's still not ideal (it accumulates with the user code and a lot of stuff inside our framework).

Oh, reading this comment #5494 (comment)

Seems like we should close this PR then?

chenghao-mou added 2 commits April 20, 2026 14:26

feat(audio): add stream audio encoder

9ddba79

Added a stream audio encoder for turn detection, supporting opus, mps, and pcm

update uv.lock

4a883ad

chenghao-mou requested a review from a team April 20, 2026 06:28

This comment was marked as resolved.

Sign in to view

chenghao-mou added 2 commits April 20, 2026 15:07

address comments

fbea841

add test in makefile

b03946d

This comment was marked as resolved.

Sign in to view

export

66019f1

theomonnom reviewed Apr 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(audio): add stream audio encoder for turn detection#5494

feat(audio): add stream audio encoder for turn detection#5494
chenghao-mou wants to merge 5 commits intomainfrom
chenghao/feat/streaming-encoder

chenghao-mou commented Apr 20, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

theomonnom Apr 24, 2026 •

edited

Loading

Uh oh!

chenghao-mou Apr 24, 2026

Uh oh!

chenghao-mou Apr 24, 2026

Uh oh!

chenghao-mou Apr 24, 2026

Uh oh!

theomonnom Apr 24, 2026 •

edited

Loading

Uh oh!

theomonnom Apr 24, 2026

Uh oh!

theomonnom Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

metric	sync	threaded
push mean	1,186 us	40 us
push p95	2,053 us	45 us
push max	4,303 us	728 us
first page	4.0 ms	10.0 ms
inter-page mean	990 ms	989 ms
inter-page median	990 ms	990 ms
pages / bytes	7 / 1520	7 / 1520

Conversation

chenghao-mou commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

theomonnom Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

theomonnom Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

theomonnom Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

theomonnom Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chenghao-mou commented Apr 20, 2026 •

edited

Loading

theomonnom Apr 24, 2026 •

edited

Loading

theomonnom Apr 24, 2026 •

edited

Loading