// capture intent
=> orcaops thread
agent.session()
{...}
const why =
async await
class Decision
// flight recorder

Your agents ship code. The reasoning behind it disappears.

Orcaops is a local CLI and hosted review app that captures the plan, decisions, and reasoning behind agent-authored code, so reviewers can see what was intended instead of guessing from the diff.

Flight recorder for
AI coding agents

Agents are shipping 3,000+ line PRs. Orcaops preserves the plan, constraints, non-goals, and decisions so reviewers are not stuck reverse-engineering the why from the diff.

Works with Claude Code, Cursor, Codex, and Aider.

Free CLI for individuals
Local-first capture, optional cloud
Claude Code, Cursor, Codex, Aider

Now onboarding teams already shipping agent-authored production PRs.

Agent-authored code is shipping. The reasoning behind it isn't.

Agents now ship 3,000+ line PRs. Reviewers can't tell why decisions were made. A week later, even the developer who triggered the change can't always reconstruct it from the diff or the transcript.

The bottleneck isn't writing code anymore. It's understanding and reviewing it.

Orcaops sits next to whatever agent your team uses. While the agent works, it records the plan, constraints, non-goals, and decisions to a local thread you can review or search later. No new workflow. No spec to write up front.

How It Works

From the first prompt to a reviewable PR, the why is captured along the way

01

Capture while the agent works

The CLI runs alongside Claude Code, Cursor, Codex, or Aider. As the agent works, Orcaops writes the plan, the decisions, and the open questions to a local .orcaops/ thread, along with checkpoints and the final summary.

No new workflow. Developers keep the agents and tools they already use.

02

Catch drift in flight

Deterministic and LLM-backed evaluators flag scope drift, missed constraints, and unsupported completion claims, then re-steer the agent while the work is still in progress.

Scope drift Missed constraints "Done" claims
03

Review against intent, not just the diff

Orcaops posts a PR digest grounded in the captured plan. Reviewers can see whether the change matches the original intent and why each decision was made, without reading the full transcript.

Reviewers stop reverse-engineering intent from the diff.

04

Searchable team memory

Sessions accumulate into a cross-repo record of how and why the codebase changed. Search past agent work, enforce shared AI-workflow policies, and stop re-deriving the same context every time.

Orcaops is the constant. Your agents can change every month. The implementation record shouldn't.

What Orcaops adds to a PR

Reviewers get the implementation record behind the diff: what the agent was asked to do, what it avoided, what checks ran, and why the final shape exists.

orca digest — billing-migration · pr/247 [q] quit
billing-migration · intent verified · 1 flagged
INTENT Migrate billing jobs without changing invoice semantics.
CONSTRNTS Preserve retry behavior. No schema changes. Keep existing alert paths.
NON-GOALS Do not refactor reporting jobs or widen the rollout surface.
CHECKS Scope drift ✓ passed. Completion claims supported. ⚠ 1 open flagged for review.
WHY SHAPE Kept job boundaries intact to reduce rollout risk and avoid changing downstream invoice behavior.
5 fields · scope drift: clean · grounded in captured session

Why capture, not reconstruct

The "why" behind agent-authored code exists during the session, while the plan is changing and decisions are being made. After the session ends, all you have left is the diff and a transcript.

PR summary tools read the diff after the fact. Orcaops captures the plan, constraints, and decisions before the PR exists.

Teams use multiple agents, and the agents themselves keep changing. Orcaops is one neutral implementation record across whichever ones your team picks.

Reviewers see the plan, constraints, and decisions alongside the diff, not just the code

Drift, missed constraints, and unsupported "done" claims surface in flight, while it's still cheap to fix

Past agent sessions are searchable across repos, instead of buried in per-tool transcripts

Shared AI-workflow policies replace per-developer conventions, so quality doesn't depend on who ran the agent

One implementation record that follows the work into every PR.

What Changes

Before Orcaops

  • The "why" disappears when the agent session ends
  • Reviewers reconstruct intent from the diff
  • Each agent leaves its own transcript silo
  • Drift and missed constraints surface during review
  • Last quarter's reasoning lives in someone's head

With Orcaops

  • Plan, decisions, and constraints are captured automatically
  • PR digests show whether the change matches captured intent
  • One implementation record across Claude Code, Cursor, Codex, Aider
  • Evaluators flag drift and re-steer while work is in flight
  • Past sessions are searchable team memory across repos

Code velocity is rising. Review and team understanding aren't.

AI-authored code is becoming the default way software gets written. The bottleneck has moved from writing it to reviewing and understanding it.

Orcaops makes the code your agents are already writing something a teammate can review, search, and trust later. The speed-up your team is paying for sticks.

Capture the why behind every agent change.

Schedule Demo
Works with Claude Code Cursor Codex Aider GitHub and more