ompsure
Reviews

Session excerpts

The guard wasn't wrong; the lock was

2026-06-07

Blocked by its own product's blueprint guard, the model refused to fake the lockfile — then diagnosed and shipped the real fix.

“This session is NOT the campaign driver.”

2026-06-09

A misfired autonomous-loop heartbeat told the wrong session to take over a live campaign. It declined — repeatedly.

“I framed it as read-only — but it isn't.”

2026-05-23

Blocked from generating an auth magic link, the model explained why its own classification had been wrong.

The sed command it didn't run

2026-05-31

One shell command would have flipped a stale status field — through a guard. The model flagged it to the human instead.

“In a pursue run, don't fight the guard.”

2026-06-01

Blocked mid-campaign by a guard with a real bug, the model routed around the path — not the rule — and filed the lesson for every future run.

“One decision I made that diverges from the approved blueprint”

2026-06-05

The model deviated from an approved spec for a defensible reason — and led its final report with the disclosure.

Nine and a half hours through two compactions

2026-06-09

A campaign session hit the context limit twice and kept going — closing with a versioned release shipped to every install.

Market research to running SaaS, one invocation

2026-05-30

A single campaign command took a product from research through five built-and-verified phases across 19 wall-clock hours — surviving a context limit on the way.

A debugging brief for a session that doesn't exist yet

2026-05-21

A release-hook bug surfaced late in a session — so the model wrote the complete handoff into memory before closing.

Fifty-two instances, one orchestrator, two findings caught pre-ship

2026-06-10

A single campaign session ran a 52-instance fleet to deliver a full back office — with dedicated reviewer agents in the loop.

352 files relocated across six repos

2026-05-17

One session audited and corrected the package dependency direction in six separate multi-tenant SaaS codebases.

Eleven days, thirty-one sessions, eight subsystems

2026-04-14

The April build sprint, as recorded by its own session metadata.