Test-Driven Development

Autonomous Agents

January 3, 2026

More on this Later, for now only this:

Sunday Evening 22/02/2026

I've been having some chats lately about best-practice references for 'eval-driven development for autonomous AI agents'.

The idea here is to set up (similar to a CI/CD pipeline) a series of acceptance tests that every new edit by an agent on your prodject has to satisfy, before they are allowed to commit their changes. For example, when working on a React project, you can specify a bunch of acceptance tests in the agents.md files (so that every time a coding agent makes any changes) it will for example first

I don't systematically use this enough in setting up the agents.md files, and here I will soon document some best practice summaries for this