Back to AI Lab
Speech
Research papers, repositories, and articles about speech
Showing 3 of 3 items
microsoft/VibeVoice
Open-source frontier voice model stack from Microsoft. Aims at natural, low-latency speech AI that builders can inspect and extend.
36,444
ggml-org/whisper.cpp
A fast C/C++ port of OpenAI’s Whisper that runs on laptops, phones, and edge devices. It’s the go-to option when you need offline speech transcription.
46,559
altic-dev/FluidVoice
Offline macOS dictation app using local speech models. Shows how to package on-device AI into a polished consumer product. Useful for anyone targeting privacy-sensitive voice features. ([github.com](https://github.com/trending?since=daily))
3,704