GitHub Repository

altic-dev/FluidVoice

June 28, 20263,704238

Summary

Offline macOS dictation app using local speech models. Shows how to package on-device AI into a polished consumer product. Useful for anyone targeting privacy-sensitive voice features. ([github.com](https://github.com/trending?since=daily))

Topics

speech local applications

View Original

Related Content

stable-diffusion-webui

stable-diffusion-webui by AUTOMATIC1111 is the de facto standard local web interface for Stable Diffusion, providing a massive feature set—txt2img, img2img, inpainting/outpainting, upscaling, LoRA/embeddings support, training utilities, and a huge extension ecosystem—on top of consumer GPUs. If you’re doing any kind of image generation or fine-tuning with Stable Diffusion in a local or lab environment, this is usually the first tool people reach for and the one most community workflows target. ([github.com](https://github.com/AUTOMATIC1111/stable-diffusion-webui?utm_source=openai))

huggingface/transformers

The standard library for state-of-the-art models in text, vision, audio, and combined formats. If you build with open models, you almost certainly depend on this already.

opendatalab/MinerU

Pipeline that converts messy PDFs and Office docs into clean markdown or JSON tuned for LLM and agent workflows. It's quickly becoming a standard pre-processing tool. Plug it in if you're serious about document-heavy RAG. ([github.com](https://github.com/trending?since=daily))

ggml-org/llama.cpp

llama.cpp keeps pushing local LLM performance on CPUs and small GPUs. It’s still the reference for running big models on modest hardware. If you care about running the AI cheaply or on-device, you should track every major change here.