Summarizer

Voice Input for Mobile Coding

← Back to Claude Code On-the-Go

To overcome the limitations of mobile keyboards, developers are increasingly leveraging high-quality transcription tools like WisprFlow, VoiceInk, and specialized AI keyboards to enable a "responsive programming" workflow on the go. By pairing these voice-to-text utilities with terminal emulators or web-based interfaces, users can issue complex instructions to AI agents like Claude Code, effectively managing servers and codebases while walking or away from their desks. While some practitioners prioritize the speed and privacy of local models like Parakeet, others focus on fine-tuning dictionaries to ensure technical jargon is accurately captured during dictation. Ultimately, the community views voice input as a powerful way to balance productivity with mobility, though some still face technical hurdles regarding transcription latency and terminal emulator glitches.

8 comments tagged with this topic

View on HN · Topics
I use the same setup myself, download WisprFlow for IOS and over time just add to its dictionary the unusual words you often use during development works perfectly, i just say what i want coded, press enter, and Claude Code just does it in my server over Termius app
View on HN · Topics
I also don't like typing long messages on my phone that's why I use this keyboard that will do high quality transcriptions via whatever AI provider you want. Much better than siri/google speech to text on device. https://github.com/DevEmperor/Dictate
View on HN · Topics
I used to use Wispr Flow but did not like the non-local aspect, and having yet another subscription, so I switched to VoiceInk (one-time payment around $30 I think), and with a locally running Parakeet v3 model on my MacBook, transcription is basically instant. I was previously using it with the local Whisper Turbo 3 which is slightly more accurate and it had a 3-4 second lag, so I was absolutely shocked how fast parakeet v3. The slight drop in accuracy is totally fine when talking to AIs, and I also have a line in my CLAUDE.md that says I am usually dictating and that it should take that into account when interpreting my messages.
View on HN · Topics
I was looking for a similar scheme, and though far from perfect I found you can run tmux+ttyd. ttyd lets you share your terminal over http. That lets you use your phone's browser (and speech-2-text).
View on HN · Topics
Been using Termux and iSH on my phones for years. You can ssh to your server or just directly code for the phone itself. I also used Web based coding environment like Glitch (R.I.P.) for years. You can do that with your virtual keyboard, voice or a even a physical keyboard via BT, e.g. Corne-ish Zen. That's how I travel. That's really nothing AI specific or novel. It's cool though. FWIW I even coined a related term https://fabien.benetou.fr/Languages/OwnConcepts#ResponsivePr... "extending responsive design to be able to program on the device, any device from eink to mobile phone to device, one is currently using not just to "consume" content, e.g read a Website that is then properly formatted for it, but rather program back that very device" That being said, if you do want to go that route check out CloudInit as it will help you (or whatever tool you rely on) to spawn new instance on your favorite cloud provider to boot specific instances and e.g. setup Docker/Podman then services, etc with no interaction. Also ntfy can help you manage notifications across devices on your own infrastructure, no 3rd parties.
View on HN · Topics
I currently use Hapi ( https://github.com/tiann/hapi/ ) for this and find it quite handy. I can easily tap into a session on my PC from my phone. Before that I used Happy ( https://happy.engineering/ ) which is also open source and a lot more sophisticated. It has a voice assistant that can chat with Claude Code on your behalf in the mobile app. However, it wasn't very reliable, and there are other reasons to use Hapi instead (documented in the Hapi repo). Before that, Omnara ( https://www.omnara.com/ ) a YC company and seemingly a proprietary Happy fork (?) but it never worked properly for me. Long story short, there are a few of the around, and frankly I really like to use them. Unlike other commenters, I don't find that they wreck my work-life balance. Rather, I can go out and have a walk in the park, only checking in on long-running tasks every once in a while. The diff view is pretty good too. There are many tasks where I'd rather not stare at my PC all day and instead do other things, and these tools allow me to do that.
View on HN · Topics
I found wisprflow (paid app) to do a great job at smart-dictating my notes / emails.
View on HN · Topics
I really want to use Claude Code on the phone or tablet, with voice commands only, and perhaps a few simple approval thumb actions. I don't want to type out complex prompt information on a virtual keyboard. I tried setting this up with some of the iOS terminal emulators, and it almost worked, but there was some glitch where Claude would try to start using the first characters that arrived from the voice command. Anyone have better results?