Resource 004 / agent proof

Agent Run Receipt Checklist.

A practical receipt for agent work: what happened, what changed, what proof exists, what it cost, and what still needs a human gate.

Public safety status

This staged page uses public-facing checklist and sample copy only. Internal source maps and verification JSON are not routed into the public HTML.

This is an operating resource, not legal, security, compliance, platform-policy, pricing, checkout, service-availability, or client-work advice. Public deploy, outreach, lead capture, pricing, account, credential, payment, provider, gateway, DNS, service, or spend actions require separate approval.

Use note: operating hygiene checklist; not legal, security, compliance, pricing, or service-availability advice.

Audience: solo operators, founders, and small teams letting coding agents, browser agents, workflow agents, or content agents touch real work.

Promise: in under 15 minutes, record what the agent did, why it did it, what it touched, what it cost, what proof exists, and what is still not verified.

Ana version: autonomy without receipts is just a faster way to make a mess. Make the goblin show its work before it touches the business.


The blunt premise

An agent run is not done when the agent says “done.”

Cute. No.

It is done when a human can answer six questions without digging through a haunted scroll of logs:

  1. What was the agent supposed to do?
  2. What did it actually do?
  3. What did it read, write, publish, spend, or change?
  4. What proof can be inspected?
  5. What remains unverified?
  6. What is the next safe decision?

That is an Agent Run Receipt.

Not a SOC audit. Not compliance cosplay. Not a dashboard pretending it is a lawyer. A small operating artifact that keeps agent work useful, bounded, and explainable.


Why this exists now

Recent agent-market signals point in the same direction: people are no longer impressed by “the model generated something.” They want proof that generated work can be tested, bounded, replayed, traced, approved, and stopped.

Current signals behind this resource include:

The practical conclusion: small teams need minimum viable receipts, not enterprise theatre.


Minimum receipt vs full audit

Use the minimum receipt for ordinary agent work: drafts, research, code changes, QA runs, small automations, content packages, internal reports, and local experiments.

Use a full audit only when the run touches real risk: credentials, live accounts, public distribution, client data, payment, pricing, legal/service claims, paid provider actions, destructive writes, or material business commitments.

If every tiny run gets a courtroom binder, nobody will use it. If risky runs get only “vibes looked fine,” the invoice demon wins.

Minimum receipt

A minimum receipt should fit on one page.

FieldWhat to recordWhy it matters
Receipt IDHuman-readable name, date, or versionMakes the run findable
GoalOne concrete outcomePrevents fuzzy victory laps
Agent / workflow usedTool, agent type, or run label; keep public samples genericShows what kind of system acted
Sources usedEvidence, inference, and assumptionsSeparates proof from guesses
Action takenWhat was generated, changed, checked, or deliberately skippedStops “done” from meaning everything and nothing
Surfaces touchedFiles, repo, browser, account, channel, API, tool, or noneNames the blast radius
External side effectsnone, or exact effects: deploy, post, email, account change, provider call, spendMakes world-touching actions visible
SpendAmount by category; use $0.00 when trueBudget silence breeds goblins
Approval statusNot required, covered by a named rule, requested, blocked, or owner-approved for a specific scopeKeeps autonomy inside authority
Artifact / outputDurable path label, public URL, ticket, report, package, screenshot, or hashGives the receipt something inspectable
Verification performedRead-back, tests, schema check, smoke test, render check, safety scan, review, or “not performed”Turns claims into evidence
Claims boundaryWhat this does not proveStops delivery proof becoming demand/revenue/security proof
Unverified / open questionsAnything not checked yetKeeps uncertainty from dressing up as confidence
Next gateContinue, revise, review, publish approval, risk review, pause, or killConverts the receipt into a decision

The 15-minute fill-in version

Copy this after any meaningful agent run.

Agent Run Receipt

Receipt ID:
Goal:
Agent/workflow used:

Sources used:
- Evidence:
- Inference:
- Assumptions still untested:

Action taken:
Surfaces touched:
External side effects:
Spend:
Approval status:

Artifact/output:
Verification performed:
Claims boundary / what this does not prove:
Unverified or open questions:
Next gate:

Use none, not applicable, or not verified instead of making the receipt prettier than reality. Reality is the point.


Quick checklist before you accept “done”

1. Goal check

2. Source check

3. Action check

4. Cost and approval check

5. Proof check

6. Public-safety check

7. Decision check


When the minimum receipt is not enough

Escalate to a full audit if the run involves any of these:

Full audit does not mean panic. It means the minimum receipt gets backup: source inventory, action ledger, approval record, artifact inventory, spend table, verification evidence, risk notes, and reviewer verdict.


What good receipts sound like

Good:

> The agent drafted a resource from approved market-scan and proof-vault inputs. It wrote local Markdown/JSON files only, spent $0.00, touched no accounts or public channels, validated JSON, read files back, and still needs risk/publication review before any public use.

Bad:

> The agent created an amazing launch asset and we are ready to monetize.

That second one is not a receipt. It is a tiny fraud wearing perfume.


Tiny glossary


Public-safe use note

This checklist is operating hygiene, not legal, security, or compliance advice. It does not certify an agent system, promise safety, prove revenue, or make a public service available.

Use it to make agent work easier to trust. Then still apply the boring gates when the work touches real money, real accounts, real customers, real public channels, or real consequences.


Suggested CTA for later public use

Use this after your next meaningful agent run. If the receipt has more “not verified” than proof, do not give the agent more access yet. Start with the ugliest line. That is usually where the money, mess, or trust leak is hiding.

Sample Agent Run Receipt

Use note: public-safe sample with fictionalized/sanitized labels. No private paths, customer data, raw logs, secrets, pricing, or service commitments.

Sample: local resource draft run

FieldReceipt
Receipt IDagent-run-receipt-resource-draft-2026-06-25
GoalDraft a practical checklist that helps a small operator record what an agent did, touched, cost, proved, and left unverified.
Agent / workflow usedContent-strategy agent working from approved local research and proof-template inputs.
Sources usedEvidence: market-scan brief about agent testing, observability, audit trails, spend controls, and browser-agent risk; proof-vault minimum receipt/full audit templates; prior verified resource package. Inference: small teams need a short receipt before they need a heavyweight audit. Assumptions still untested: whether public readers want an editable template or only the checklist.
Action takenDrafted a public-safe checklist, a sample receipt, a source map, and verification notes. Kept the resource local. Did not publish, post, price, sell, gate, email, deploy, create accounts, touch credentials, or call paid providers.
Surfaces touchedLocal draft files only. No public site, no social account, no inbox, no browser login, no payment surface, no provider account.
External side effectsNone. Local file writes only.
Spend$0.00; no paid provider/render/export/subscription action in this sample.
Approval statusLocal drafting allowed. Public publication, channel posting, lead capture, pricing, checkout, client workflow, account changes, and paid provider actions remain separate approval gates.
Artifact / outputPublic-safe resource draft package with Markdown and JSON files. In public examples, use a public URL or sanitized artifact label rather than private machine paths.
Verification performedFiles were read back; JSON validation planned/performed for structured files; public-safety scan checks for private paths, secret-looking assignments, raw keys, pricing/service/legal claims, invented metrics, and lead-capture/checkout language.
Claims boundaryThis receipt proves only that a local draft package exists and was checked for basic safety. It does not prove traffic, saves, replies, buyer demand, leads, revenue, ROI, service availability, legal/compliance status, or security.
Unverified / open questionsExact public title, final CTA, publication route, risk-review verdict, and whether readers prefer a downloadable template remain unverified.
Next gateRisk/publication review before any public use. If published later, measure practical signal only: saves, replies, template requests, implementation questions, or requests for help with proof/spend/approval gates.

What would trigger a full audit here?

The minimum receipt is enough while this stays a local draft.

Escalate to full audit if the next step adds any of the following:

Sample decision

Decision: revise/review before publication.

Reason: the artifact is useful and bounded, but publication needs final public copy, CTA choice, and risk review. No victory lap until the goblin paperwork survives daylight.

Ana takeaway

Use the checklist to make agent work more inspectable before expanding access. No proof, no bigger leash; no approval, no public or money-touching click.

Back to resource index Read the build journal

Public-safety note: this static staged page does not perform account, credential, payment, outreach, deployment, provider, or gateway actions.