Doing
Voice and visual context for AI builders. No subscription.
Mac
Audio
Alpha
Visit Website See on Product Hunt

Upvotes132

▲ 132View on ProductHunt ⧉

Comments19

19 commentsSee comments on PH ⧉

Featured onApril 9th, 2026

Hunted by

Brian Ellin

Doing is for AI builders who use voice and screenshots to bring context to Claude Code, Codex, and other AI agents. Tap a hotkey and Doing listens. Optimized over thousands of hours of building with Claude & Codex. Blazing fast, private, local, no account, no subs. Just a quality tool that you own and works well.

Top comment

Upvotes132

▲ 132View on ProductHunt ⧉

Comments19

19 commentsSee comments on PH ⧉

Product of the Day19th

👋 I'm Brian, and I created Doing to help me share the context that's in my head and on my screen with Claude Code, Codex, Gemini, and other AI agents. I've used the other tools and they are slow, expensive, and full of privacy nightmares. Doing is the opposite.
Doing is not a general purpose voice transcription tool like the myriad of other voice apps others out there, but purpose built for pairing with your agent. It is simple, fast, and gets out of your way so you can get the ideas out of your head and into the context window.

A few things I'm proud of:
🌱 Simple, effective workflow. Hold a hotkey and Doing listens. The cyan pip follows your mouse, so you'll know where your words will land. (watch the vid above)
🔥 Fastest transcription. NVIDIA's Parakeet is 10-100x faster at transcribing than the alternatives, and it runs entirely on your Mac (Apple Silicon required). There is no latency, no waiting. You really have to try the TDT approach to believe how incredible it truly is.
⚡ YOLO mode. Auto-submit your prompt after pasting. Talk, release, and your words and screenshots are already submitted. Avoid the self-editing anti-pattern and let the LLM do its thing.
🙅‍♂️ Hands free mode. Tap shift to go hands free and just talk. Tap it again to wrap it up.
📸 Screenshots. Drag a rectangle to grab visual context that's automatically added to your transcription.
📝 Markdown transcripts. Transcripts are saved locally as daily .md files. Integrates perfectly with your Obsidian knowledge base.
🔒 Truly local. Your audio never leaves your machine. Not a privacy policy, it's architecture.
📚 Nerd words. Doing is for AI builders and understands AI Engineering, Software Engineering, Product & Biz dev terms and terminology. You can add your own dictionaries, terms, and common corrections.
⚒️ Customize with skills. Post-process your transcriptions with LLM based skills, and tune based on target app. Follows the SKILLS.md standard.
Free trial, no account needed. If you like it, it's $49 and yours forever. I'll be here all day and would love to hear your questions and feedback!

Comment highlights

I've been using for a few weeks and love it. Congrats on the launch Brian!

Congratulations Brian! I'm using Aqua Voice. How do you compare with it in terms of accuracy and latency?

Mac native, local processing, no subscription, $49 forever — this is exactly how developer tools should be sold. Respect.
I'm a solo founder building a Mac-native video editor with Swift + Rust, and I've been dealing with the same transcription challenge from a different angle: speech-to-text for video editing. The speed difference between local and cloud transcription is night and day for UX.
Curious — you mentioned NVIDIA's Parakeet is 10-100x faster than alternatives. How does it compare to Whisper in terms of word-level timestamp accuracy? That's the one area where I've found local models still struggle.

YOLO mode is a bold UX choice. how did you arrive at auto-submitting without review, and have you found users need to build up trust in it before they stop second-guessing themselves?

Hey! This is super interesting to me as someone who has brought a lot of ai products into my businesses but I have one question. I spend a lot of time refining guardrails for a project, goals, acceptance criteria and describing the workflow such as following TDD. Is there a way to set this up so that some of it is reusable across projects and code bases like we have with AGENTS.md today?

bold claim on the YOLO auto-submit. that works fine for prose context but for technical specifics - variable names, file paths, error codes - voice introduces correction overhead that slows you down.

About Doing on Product Hunt

“Voice and visual context for AI builders. No subscription.”

Doing launched on Product Hunt on April 9th, 2026 and earned 132 upvotes and 19 comments, placing #19 on the daily leaderboard. Doing is for AI builders who use voice and screenshots to bring context to Claude Code, Codex, and other AI agents. Tap a hotkey and Doing listens. Optimized over thousands of hours of building with Claude & Codex. Blazing fast, private, local, no account, no subs. Just a quality tool that you own and works well.

Doing was featured in Mac (103.6k followers), Audio (2.1k followers) and Alpha (11 followers) on Product Hunt. Together, these topics include over 12.8k products, making this a competitive space to launch in.

Who hunted Doing?

Doing was hunted by Brian Ellin. A “hunter” on Product Hunt is the community member who submits a product to the platform — uploading the images, the link, and tagging the makers behind it. Hunters typically write the first comment explaining why a product is worth attention, and their followers are notified the moment they post. Around 79% of featured launches on Product Hunt are self-hunted by their makers, but a well-known hunter still acts as a signal of quality to the rest of the community. See the full all-time top hunters leaderboard to discover who is shaping the Product Hunt ecosystem.

Want to see how Doing stacked up against nearby launches in real time? Check out the live launch dashboard for upvote speed charts, proximity comparisons, and more analytics.