The line between "AI writes about science" and "AI does science" is getting blurry fast. A new class of LLM agents doesn't just summarize papers or write code — they design experiments, control lab equipment, and iterate on results with minimal human intervention.

AI-Mandel: The AI Physicist

Built by Sören Arlt, Xuemei Gu, and Mario Krenn[1], AI-Mandel is an LLM agent that generates and implements ideas in quantum physics. Here's the workflow:

Specific ideas it came up with include new variations of quantum teleportation, quantum network primitives using indefinite causal orders, and novel geometric phases based on closed loops of quantum information transfer.

Coscientist: The Chemistry Robot

From Carnegie Mellon University[2], Coscientist (powered by GPT-4) operates in actual chemistry labs. It has:

It bridges LLM reasoning with physical lab equipment — not just simulation.

The AI Scientist: Full Pipeline Automation

Sakana AI's The AI Scientist[3] (August 2024) aims for the full research lifecycle:

It produces papers for about $15 each. One of its generated papers was accepted to a workshop track at a top ML conference.

Agent Laboratory: The Research Team

Agent Laboratory[4] uses multiple specialized LLM agents working together:

There's also AgentRxiv[5] (March 2025), where autonomous agents upload, retrieve, and build on each other's research — cumulative scientific discovery without human intermediaries.

The Three Levels of Autonomy

Researchers classify LLM scientific agents into three tiers:

Why This Matters

We're not at "AI replaces scientists" yet. But we're at "AI is a genuine co-investigator" — one that works 24/7, reads everything, never forgets a paper, and can iterate through a thousand failed ideas before breakfast. The bottleneck is shifting from "who has the idea" to "who validates it."

References

Further Reading