← The Vault

Paperclip Hit 40,000 GitHub Stars. Agent Monitoring Is Now a Standard Practice.

1,400+ public repos use it. 80K npm downloads. Paperclip is the de facto standard for agent heartbeat monitoring, and the adoption curve says something important.


Paperclip crossed 40,000 GitHub stars this week. 1,400+ public repos are using it. 80,000 npm downloads. The creator (@dotta) is anonymous. Anthropic is now open-sourcing it as part of their official agent tooling stack alongside Hermes Agent and Phone Gateway.

That last part matters. When the company building the model ships a monitoring tool as part of their standard stack, they’re telling you what they think production agents require.

What Paperclip Actually Does

Paperclip solves a specific problem: knowing whether an agent is still working.

Long-running agent tasks have a failure mode that’s hard to detect — the agent stalls mid-task, silently, without throwing an error. It looks like it’s running. It’s not making progress. You don’t find out until you check, which might be hours later.

Paperclip implements heartbeat monitoring for agent tasks. Your agent pings Paperclip at regular intervals. If the pings stop, Paperclip alerts you. Simple concept, but nobody had built a clean implementation until now.

Beyond heartbeats, it tracks task duration, completion status, and structured logs per task. When something goes wrong, you get a clear record of what the agent was doing and when it stopped.

The Adoption Curve Signal

40,000 GitHub stars for an agent monitoring tool in a nascent ecosystem is a leading indicator, not a vanity metric. It tells you that a meaningful portion of the people running agents in production have hit the “my agent silently failed and I didn’t notice” problem often enough to seek out a solution.

That’s the actual maturity signal here. The agent space is moving from “cool demos” to “things that need to keep running reliably.” Monitoring is one of the first things that becomes necessary when that shift happens. The stars are a proxy for how many builders are past the demo stage.

Anthropic folding it into their official tooling stack makes the signal stronger. They’re acknowledging that agents in production need observability infrastructure, and they’re shipping it rather than leaving builders to figure it out alone.

Source: @dotta