Back to Home11/14/2025, 7:07:45 PM

Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required

whamp

32 points

18 comments

Mood

supportive

Sentiment

positive

Discussion Activity

Active discussion

First comment

Peak period

Day 1

Avg / period

Comment distribution17 data points

Based on 17 loaded comments

Key moments

01Story posted
11/14/2025, 7:07:45 PM
4d ago
Step 01
02First comment
11/14/2025, 7:12:15 PM
5m after posting
Step 02
03Peak activity
17 comments in Day 1
Hottest window of the conversation
Step 03
04Latest activity
11/15/2025, 4:32:43 PM
3d ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (18 comments)

Showing 17 comments of 18

lxe

4d ago

1 reply

I've done something similar for Linux and Mac. I originally used Whisper and then switched to Parakeet. I much prefer whisper after playing with both. Maybe I'm not configuring Parakeet correctly, But the transcription that comes out of Whisper is usually pretty much spot on. It automatically removes all the "ooms" and all the "ahs" and it's just way more natural, in my opinion. I'm using Whisper.CPP with CUDA acceleration. This whole comment is just written with me dictating to a whisper, and it's probably going to automatically add quotes correctly, there's going to be no ums, there's going to be no ahs, and everything's just going to be great.

clueless

4d ago

1 reply

Mind sharing your local setup for Mac?

lxe

4d ago

https://github.com/lxe/yapyap/tree/parakeet-nemo

It's been a while, so I don't know if it's going to work because of the Nemo toolkit ASR numpy dependency issues.

I use it for Linux using whisper CPP and it works great

hastamelo

4d ago

2 replies

how does the quality compare with the windows built in one (Win+H), the one with online models?

I'm using that to dictate prompts, it struggles with technical terms: JSON becomes Jason, but otherwise is fine

lxe

4d ago

2 replies

In my opinion, attempting to perform live dictation is a solution that is looking for a problem. For example, the way I'm writing this comment is: I hold down a keyboard shortcut on my keyboard, and then I just say stuff. And I can say a really long thing. I don't need to see what it's typing out. I don't need to stream the speech-to-text transcription. When the full thing is ingested, I can then release my keys, and within a second it's going to just paste the entire thing into this comment box. And also, technical terms are going to be just fine with Whisper. For example, Here's a JSON file.

(this was transcribed using whisper.cpp with no edits. took less than a second on a 5090)

atonse

4d ago

1 reply

I’ve been using Parakeet with MacWhisper for a lot of my AI coding interactions. It’s not perfect but generally saves me a lot of time.

lxe

4d ago

I barely use a keyboard for most things anymore.

whamp

3d ago

Yea whisper has more features and is awesome if you have the hardware to run the big models that are accurate enough. The constraint here is the best cpu only implementation. By no means am I wedded or affiliated with parakeet, it's just the best/fastest within the CPU hardware space.

whamp

4d ago

My project has a built-in word_replacement so you can automatically replace certain terms if that's important to you in the config.toml

i loved whisper but it was insanely slow on cpu only and even then it was with a smaller whisper that isn't as accurate as parakeet.

my windows environment locks down the built-in windows option so i don't have a way to test it. i've heard it's pretty good if you're allowed to use it, but your inputs don't stay local which is why i needed to create this project.

zahlman

4d ago

1 reply

> I’m allowed to run Python, but not install or launch new `.exe` files.

> NVIDIA’s ParakeetV3 model

You can't install .exe's, but you can connect to the Internet, download and install approximately two hundred wheels (judging by uv.lock), many of which contain opaque binary blobs, including an AI model?

Why does your organization think this makes any sense?

whamp

3d ago

Never said it did! Working with what I got.

hebelehubele

4d ago

1 reply

Is there a macOS equivalent of this?

My use case is to generate subtitles for Youtube videos (downloaded using yt-dlp). Word-level accurracy is also nice to have, because I also translate them using LLMs and edit the subtitles to better fit the translation.

redrove

4d ago

I use MacWhisper[1] with local Parakeet models. It’s got quite a lot of features, I myself only need the dictation.

[1] https://goodsnooze.gumroad.com/l/macwhisper

whamp

4d ago

Here is the huggingface ASR leaderboard for those wondering how parakeet V3 compares to Whisper Large V3

Accuracy Average WER: Whisper-large-v3 4.91 vs Parakeet V3 5.05

Speed RTFx: Whisper-large-v3 126 vs PArakeet V3 2154

~17x faster

https://huggingface.co/spaces/hf-audio/open_asr_leaderboard

hamza_q_

4d ago

Cool use of ONNX! Fluid Inference also have great implementations of Parakeet v2/v3 in CoreML for Apple devices and OpenVINO for Intel:

https://github.com/FluidInference/FluidAudio

https://github.com/FluidInference/eddy-audio

whamp

4d ago

btw this is my first open-source project

feynmanquest

4d ago

I built something similar for macOS that is a CLI app and generates notes for you. Also has a conversational chat interface to query your notes. Funny enough, it’s also called Chirp.

https://github.com/Code-and-Sorts/chirp-ai-note-app

1 more comments available on Hacker News

View full discussion on Hacker News

ID: 45930659Type: storyLast synced: 11/16/2025, 9:43:00 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Read Article View on HN