AI stream

AI Post

@hwchase17
Ai agents Medium

@hwchase17

Importance score: 4 • Posted: March 09, 2026 at 17:41

Score

4

loved this from @karpathy over the weekend I built "autoresearch but for agents" Same idea — give an AI coding agent your agent code + an eval dataset, let it experiment autonomously overnight. It modifies the code, runs evals via LangSmith, keeps improvements, discards regressions. You wake up to a better agent. Bring your own agent (any framework or none), dataset, and eval metrics. https://github.com/hwchase17/autoresearch-agents

Andrej Karpathy

Andrej Karpathy

@karpathy

2026-03-07T19:53:15.000000Z

Open

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. https://github.com/karpathy/autoresearch Part code, part sci-fi, and a pinch of psychosis :)

Grok reasoning
LangChain founder Harrison Chase on building autonomous agent improvement loop inspired by Karpathy's autoresearch, key for AI agent dev.

Likes

218

Reposts

23

Views

30,384

Tweet ID: 2031062715616436394
Prompt source: ai-news
Fetched at: March 10, 2026 at 05:40