Back to AI Lab

Framework

Research papers, repositories, and articles about framework

Showing 8 of 8 items

stable-diffusion-webui

stable-diffusion-webui by AUTOMATIC1111 is the de facto standard local web interface for Stable Diffusion, providing a massive feature set—txt2img, img2img, inpainting/outpainting, upscaling, LoRA/embeddings support, training utilities, and a huge extension ecosystem—on top of consumer GPUs. If you’re doing any kind of image generation or fine-tuning with Stable Diffusion in a local or lab environment, this is usually the first tool people reach for and the one most community workflows target. ([github.com](https://github.com/AUTOMATIC1111/stable-diffusion-webui?utm_source=openai))

158,945

NousResearch/hermes-agent

General-purpose AI agent framework that grows with user needs. Focuses on composable tools and skills instead of one fixed workflow.

26,246

GPT-SoVITS

GPT-SoVITS is a hugely popular WebUI and pipeline for few-shot TTS and voice conversion, enabling convincing voice cloning with as little as 5 seconds to 1 minute of audio, plus dataset prep tools (separation, ASR, labeling) and multi-lingual support (EN/JA/KO/ZH/Cantonese). If you’re experimenting with custom voices, VTuber-style content, or rapid TTS prototyping on consumer GPUs, this is effectively the community standard toolkit. ([github.com](https://github.com/RVC-Boss/GPT-SoVITS?utm_source=openai))

53,110

Q00/ouroboros

Positions itself as an "agent OS": specify tasks and constraints instead of hand-writing prompts. Aims to standardize how you define and compose agent behaviors. If you’re drowning in brittle prompt chains, this is a cleaner abstraction to copy. ([github.com](https://github.com/trending/python?since=daily))

3,197

pandas-ai

pandas-ai turns DataFrames and SQL/CSV/Parquet sources into a conversational interface, translating natural-language questions into code or SQL, running them in a (configurable) sandbox, and optionally using RAG and semantic schemas to answer more complex queries. It’s attractive for quickly giving analysts or business users an LLM front-end on top of existing data, though you do need to pay attention to security configurations given its history of prompt-injection/RCE issues that were later mitigated with new settings. ([github.com](https://github.com/sinaptik-ai/pandas-ai?utm_source=openai))

22,805

ed-donner/agents

Course repo focused on agent-style systems: planning loops, tools, memory, and evaluation. Uses notebooks to walk from toy examples to more realistic workflows. If you’re new to agents, this pairs well with the theory-heavy surveys above. ([github.com](https://github.com/trending/jupyter-notebook?since=daily))

4,856

tinker-cookbook

tinker-cookbook provides practical, end‑to‑end examples of post‑training LLMs using Tinker, a managed fine‑tuning API from Thinking Machines Lab that handles distributed training while you control the algorithms and data. The repo includes recipes for instruction tuning, math reasoning, RLHF-style preference learning, tool use, prompt distillation, and multi-agent setups, making it a strong starting point if you want to fine‑tune open-weight models like Llama or Qwen without building your own training stack. ([github.com](https://github.com/thinking-machines-lab/tinker-cookbook?utm_source=openai))

2,434

geoai

geoai is a Python package from the opengeos ecosystem that integrates deep-learning frameworks (PyTorch, Transformers, segmentation models) with geospatial tooling to handle everything from remote-sensing data download and tiling to training, inference, and interactive map visualization. It’s aimed at practitioners who want a higher-level, batteries-included stack for tasks like land-cover classification, building footprint extraction, and change detection, without reinventing all the GIS + ML plumbing. ([github.com](https://github.com/opengeos/geoai?utm_source=openai))

2,116