Show HN: Wayland Speech-to-Text Tool

github.com

16 points by artur_roszczyk 2 days ago

I vibe-coded a speech-to-text tool for Wayland that works for me. You trigger it with a keybind, speak into your mic, and it transcribes using OpenAI Whisper or Google STT (local is coming), then either types it directly into your active text field or saves it to the clipboard.

It uses PipeWire for audio capture and works signal-driven, so there's no background process running. Just on-demand transcription when you need it. I've tested it on Niri and it should work on Hyprland, though I haven't tested GNOME or KDE yet.

This was a one day Rust project and probably has some bugs since I just implemented it. It's definitely rough around the edges, but it serves its purpose for quick dictation. I'm open to feedback and input from anyone who tries it out.

KetoManx64 20 hours ago

What makes it only compatible with Wayland and not work with x11 and it's successor XLibre? Very cool tool though, I was just thinking about something similar to this the other week after seeing a post about Apple Mac implementation of this very functionality.

  • artur_roszczyk 19 hours ago

    Hey, Good question! You're right. Nothing should prevent you from using it with other environments, especially after the 0.2.x refactor, where I dropped dependencies on wtype and ydotool. Now you can pipe the transcribed text to anything you want. My motivation was that the only reason I stayed stuck with ChatGPT was access to many customized system prompts (through projects) that I could dictate to. I like speaking to AI in my native tongue. Now I can switch to Claude fully. Thanks for checking it out!