Automation
Research papers, repositories, and articles about automation
Showing 11 of 11 items
ENPIRE: Agentic Robot Policy Self-Improvement in the Real World
Wraps real robots in a closed-loop system where coding agents iteratively reset scenes, run policies, check results, and improve code. If you’re serious about autonomous robot labs, this is basically a blueprint.
ENPIRE: Agentic Robot Policy Self-Improvement in the Real World
Lets coding agents run real robots in a closed loop and continuously improve policies with minimal human babysitting. Robotics groups should treat this as a design template for autonomous labs.
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering
ML-Master 2.0 introduces a "hierarchical cognitive cache" that separates short-term logs from long-term strategy for AI agents working for days on ML engineering tasks. It hits state-of-the-art on MLE-Bench, hinting at how to run week-long research agents.
aaif-goose/goose
Goose is an extensible coding agent that can install, run, edit, and test code with any chosen model. It’s designed to be a full engineer assistant, not just an autocomplete tool.
block/goose
Open-source AI agent that installs, edits, executes, and tests code with any language model. Targets real workflows, not just inline suggestions.
eigent-ai/eigent
Eigent is a desktop app for running multi-agent AI workflows locally. It orchestrates specialized workers, tools, and context so agents can execute long, complex jobs for you.
GUI Agents with Reinforcement Learning: Toward Digital Inhabitants
Surveys how teams use reinforcement learning plus GUI interaction to push beyond simple desktop macros into always-on "digital inhabitants". Breaks the space into offline, online, and hybrid strategies, and highlights trends like world-model training and process-level rewards. If you’re automating real GUI workflows, treat this as a roadmap, not just a survey. ([arxiv.org](https://arxiv.org/abs/2604.27955))
Ed1s0nZ/CyberStrikeAI
AI-native security testing platform that wraps 100+ tools behind an orchestration engine. Lets you script serious security checks with agent-style workflows.
activepieces/activepieces
A no-code workflow engine built around AI agents and MCP servers. It connects hundreds of tools so you can wire up agent workflows without writing glue code.
AIDC-AI/Pixelle-Video
End-to-end pipeline for fully automated short-form video creation with AI. Takes scripts or prompts to generate clips, edits, and captions. If you run content operations, this shows where AI video automation is headed in practice. ([github.com](https://github.com/trending/python?since=daily))
frankbria/ralph-claude-code
Ralph wraps Claude Code in an autonomous development loop that runs tasks, monitors progress, and decides when to stop. It’s a concrete testbed for hands-free agent workflows on real codebases.