Speech

Research papers, repositories, and articles about speech

Showing 3 of 3 items

microsoft/VibeVoice

Open-source frontier voice model stack from Microsoft. Aims at natural, low-latency speech AI that builders can inspect and extend.

36,444

ggml-org/whisper.cpp

A fast C/C++ port of OpenAI’s Whisper that runs on laptops, phones, and edge devices. It’s the go-to option when you need offline speech transcription.

46,559

altic-dev/FluidVoice

Offline macOS dictation app using local speech models. Shows how to package on-device AI into a polished consumer product. Useful for anyone targeting privacy-sensitive voice features. ([github.com](https://github.com/trending?since=daily))

3,704