Skip to content

[FEATURE]: Speech-to-Text Voice Input for Lazy People in OpenCode #4695

@Fuzu

Description

@Fuzu

Feature hasn't been suggested before.

  • I have verified this feature I'm about to request hasn't been suggested before.

Describe the enhancement you want to request

Hi! First of all, congratulations on the amazing project.

I've been working on a Speech-to-Text voice input feature that integrates directly into the TUI. It allows users to start audio recording with a keybind, automatically transcribe speech using different providers, and insert the resulting text directly into the prompt.

I've built an initial working version, currently tested only on macOS, and the system includes:

  • Real-time audio recording via FFmpeg;
  • Support for Groq Whisper, OpenAI Whisper, and local whisper.cpp;
  • Automatic microphone/device detection;
  • Interactive menus for choosing provider, model, and audio device;
  • Persistent configuration stored in ~/.opencode/state/speech.json;
  • Customizable keybinds (Ctrl+X v, Ctrl+X P, Ctrl+X D);
  • Smooth flow: record → transcribe → insert into prompt input;
Image Image Image Image Image

Would this be something you'd be interested in integrating into the project?

Metadata

Metadata

Assignees

No one assigned

    Labels

    discussionUsed for feature requests, proposals, ideas, etc. Open discussion

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions