Back to AI Lab

Automation

Research papers, repositories, and articles about automation

Showing 11 of 11 items

ENPIRE: Agentic Robot Policy Self-Improvement in the Real World

Wraps real robots in a closed-loop system where coding agents iteratively reset scenes, run policies, check results, and improve code. If you’re serious about autonomous robot labs, this is basically a blueprint.

Wenli Xiao, Jia Xie

ENPIRE: Agentic Robot Policy Self-Improvement in the Real World

Lets coding agents run real robots in a closed loop and continuously improve policies with minimal human babysitting. Robotics groups should treat this as a design template for autonomous labs.

Wenli Xiao, Jia Xie

Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

ML-Master 2.0 introduces a "hierarchical cognitive cache" that separates short-term logs from long-term strategy for AI agents working for days on ML engineering tasks. It hits state-of-the-art on MLE-Bench, hinting at how to run week-long research agents.

Xinyu Zhu, Yuzhu Cai

aaif-goose/goose

Goose is an extensible coding agent that can install, run, edit, and test code with any chosen model. It’s designed to be a full engineer assistant, not just an autocomplete tool.

49,349

block/goose

Open-source AI agent that installs, edits, executes, and tests code with any language model. Targets real workflows, not just inline suggestions.

36,989

eigent-ai/eigent

Eigent is a desktop app for running multi-agent AI workflows locally. It orchestrates specialized workers, tools, and context so agents can execute long, complex jobs for you.

2,455

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

Surveys how teams use reinforcement learning plus GUI interaction to push beyond simple desktop macros into always-on "digital inhabitants". Breaks the space into offline, online, and hybrid strategies, and highlights trends like world-model training and process-level rewards. If you’re automating real GUI workflows, treat this as a roadmap, not just a survey. ([arxiv.org](https://arxiv.org/abs/2604.27955))

Junan Hu, Jian Liu

Ed1s0nZ/CyberStrikeAI

AI-native security testing platform that wraps 100+ tools behind an orchestration engine. Lets you script serious security checks with agent-style workflows.

2,288

activepieces/activepieces

A no-code workflow engine built around AI agents and MCP servers. It connects hundreds of tools so you can wire up agent workflows without writing glue code.

20,787

AIDC-AI/Pixelle-Video

End-to-end pipeline for fully automated short-form video creation with AI. Takes scripts or prompts to generate clips, edits, and captions. If you run content operations, this shows where AI video automation is headed in practice. ([github.com](https://github.com/trending/python?since=daily))

10,029

frankbria/ralph-claude-code

Ralph wraps Claude Code in an autonomous development loop that runs tasks, monitors progress, and decides when to stop. It’s a concrete testbed for hands-free agent workflows on real codebases.

1,547