Back to Home11/19/2025, 5:42:44 PM

Some Latency Metrics for Voice UIs

fatruchir

1 points

1 comments

Discussion Activity

Light discussion

First comment

N/A

Peak period

Hour 1

Avg / period

Comment distribution1 data points

Based on 1 loaded comments

Key moments

01Story posted
11/19/2025, 5:42:44 PM
2h ago
Step 01
02First comment
11/19/2025, 5:42:44 PM
0s after posting
Step 02
03Peak activity
1 comments in Hour 1
Hottest window of the conversation
Step 03
04Latest activity
11/19/2025, 5:42:44 PM
2h ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (1 comments)

Showing 1 comments

fatruchir

2h ago

Continuing on the journey to get my hands dirty with voice UIs - I put down some user perceived latency metrics I was seeing when building VUIs.

Key points: - I used the 'pipeline' approach of STT + LLM + TTS (as opposed to the S2S approach eg: gpt-realtime) - This approach (with my specific setup) - yielded latency far greater than the 500ms target, where conversations feel "natural" and there aren't any awkward silences - With the LLM as gpt-5-mini I saw latency at ~1.4s and with the LLM as Llama 3.1-8b on Cerebras I saws 1.1s

View full discussion on Hacker News

ID: 45982395Type: storyLast synced: 11/19/2025, 7:17:56 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Read Article View on HN