Dikt

Speech-to-Text for GNOME/Wayland

Dikt is a native speech-to-text application for GNOME on Wayland. It integrates directly with IBus, letting you dictate into any application with a global keyboard shortcut.

Features

Native IBus Integration — Seamless input method switching during dictation
Global Dictation Shortcut — Toggle recording from anywhere, automatic input switching
Offline Processing — All speech recognition runs locally on your device
Multi-language Support — 50+ languages supported
Multiple Recognition Engines — Whisper, Parakeet, Moonshine, SenseVoice
GNOME-Native UI — Built with GTK4 and Libadwaita
AI Post-Processing — Optional LLM-based cleanup of transcripts

Installation

Fedora / RHEL / CentOS

# Add the repository
sudo dnf config-manager addrepo --from-repofile=https://rohithmahesh3.github.io/dikt-rpm/dikt.repo

# Install
sudo dnf install ibus-dikt

From Source

# Dependencies (Fedora)
sudo dnf install -y \
    rustc cargo \
    gtk4-devel libadwaita-devel graphene-devel \
    alsa-lib-devel ibus-devel glib2-devel \
    openssl-devel cmake clang-devel glslc

# Build
git clone https://github.com/rohithmahesh3/Dikt.git
cd Dikt
cargo build --release

Setup

Install Dikt (see Installation above)
Open Dikt from your application menu
Configure your dictation shortcut
Download a recognition model

That's it. Dikt automatically handles input method switching during transcription.

Usage

Press your dictation shortcut to start recording
Speak naturally
Press the shortcut again to transcribe and insert text

Dikt automatically switches to its input method during transcription and switches back when done. The text appears in whichever application has focus.

Recognition Models

Dikt supports multiple speech recognition backends:

Model	Strengths	Languages
Whisper (Small/Medium/Turbo)	High accuracy	50+
Parakeet V3	CPU-optimized, auto-detect language	50+
Moonshine	Fast, low-resource	English
SenseVoice	Optimized for CJK	Chinese, Japanese, Korean, English

Models are downloaded on-demand from the preferences window.

Configuration

Open Dikt from your application menu to configure:

Language — Primary recognition language
Dictation Shortcut — Global keybinding to toggle recording
Audio Feedback — Sounds for start/stop events
Model Selection — Choose and download recognition models
Post-Processing — Optional AI cleanup via LLM

Requirements

GNOME on Wayland
IBus (default on most GNOME installations)
PulseAudio or PipeWire audio system
Microphone

Troubleshooting

Dictation shortcut not working

# Check daemon status
systemctl --user status dikt.service

# Restart if needed
systemctl --user restart dikt.service

Also ensure no other application is capturing your shortcut key.

No microphone access

# Add user to audio group
sudo usermod -aG audio $USER

# Log out and back in

Manual model installation

Place models in ~/.local/share/dikt/models/:

Whisper: .bin files directly
Parakeet/SenseVoice: extract .tar.gz to subdirectory

Development

# Build
cargo build

# Run daemon
cargo run -- --daemon

# Run GUI
cargo run

# Run IBus engine
cargo run --bin ibus-dikt-engine -- --ibus

Roadmap

Additional distribution packages (Arch, Debian, openSUSE)
Custom vocabulary GUI integration
Real-time transcription preview
Global shortcuts on Wayland

Notes

This project was built for personal use and shared in case others find it useful. As a hobbyist in this domain, the implementation may not follow all best practices. Bug reports and suggestions are welcome.

Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

License

MIT License — see LICENSE for details.

Acknowledgments

Whisper by OpenAI
IBus — Intelligent Input Bus
Handy — Original inspiration for this project

Website • Issues • Discussions

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.cargo		.cargo
.github		.github
data		data
ibus-sys		ibus-sys
packaging/fedora		packaging/fedora
resources		resources
src		src
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
RELEASE		RELEASE
build-rpm.sh		build-rpm.sh
build.rs		build.rs
bump-release-build-install.sh		bump-release-build-install.sh
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dikt

Features

Installation

Fedora / RHEL / CentOS

From Source

Setup

Usage

Recognition Models

Configuration

Requirements

Troubleshooting

Development

Roadmap

Notes

Contributing

License

Acknowledgments

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dikt

Features

Installation

Fedora / RHEL / CentOS

From Source

Setup

Usage

Recognition Models

Configuration

Requirements

Troubleshooting

Development

Roadmap

Notes

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages