Project Page Technical Report

Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring

arXiv:2604.05854

Open-source framework for autonomous experiment iteration with zero-cost monitoring, constant-size memory, and a minimal-toolset leader-worker architecture.

arXiv Website GitHub Architecture AI Guide

Deep Researcher Agent is an autonomous framework for long-horizon deep learning experimentation: it reads a project brief, plans the next run, edits or configures code, launches training, monitors progress without calling the LLM, and wakes up again only when reflection is needed.

📋 Overview

We present Deep Researcher Agent, an open-source framework that enables large language model agents to autonomously conduct deep learning experiments around the clock. Unlike research assistants that focus only on writing or code generation, the framework covers the full experiment lifecycle: hypothesis formation, code implementation, training execution, result analysis, and iterative refinement. The system is designed for long-horizon deployment, with a persistent THINK → EXECUTE → REFLECT loop, fixed-size memory, and zero-LLM monitoring during training.

📊 Reported Results

500+

Autonomous experiment cycles completed

52%

Best single-project improvement over baseline

Concurrent projects managed across GPU servers

30+ days

Longest continuous autonomous operation

~$0.08

Average LLM cost per 24-hour cycle

🏗️ Core Design

The system combines a persistent leader, lightweight workers, fixed-size memory, and training-time zero-cost monitoring.

THINK   -> analyze brief, history, and unfinished questions
EXECUTE -> modify code/config, dry-run, launch training on GPU
MONITOR -> check PID, GPU, and logs with no LLM calls
REFLECT -> parse results, compare with baselines, choose next step
REPEAT  -> continue 24/7 with compact rolling memory

The architecture follows a leader-worker pattern: the leader keeps cycle-level reasoning coherent, while specialized idea, code, and writing workers stay stateless and cheap. Memory is split into a frozen project brief and a compact rolling log so the context size stays stable even after weeks of operation. During long training windows, the agent does not poll the LLM. It simply checks process liveness, GPU state, and recent log tails, which keeps runtime cost practical for real research usage.

📄 BibTeX

@article{zhang2026deepresearcheragent,
  title={Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring},
  author={Zhang, Xiangyue},
  journal={arXiv preprint arXiv:2604.05854},
  year={2026}
}

⚙️ Quickstart

The repository is designed so a researcher can hand the setup to an AI assistant and start from a single project brief.

git clone https://github.com/Xiangyue-Zhang/auto-deep-researcher-24x7.git
cd auto-deep-researcher-24x7
pip install -r requirements.txt
python install.py
python -m core.loop --check

After installation, the user prepares a PROJECT_BRIEF.md describing the goal, codebase constraints, and experiment directions. The agent then uses that brief as the long-term task definition and keeps iterating automatically. The repository also ships with an AI guide so first-time users can paste one file into Claude, ChatGPT, or Codex and let the assistant walk through setup step by step.

🔗 Links

Website arXiv Repository Full Architecture 中文文档