DATA

Data

The third face of the environment is "does the loop get better." If money keeps it alive and tokens run it, data makes the next turn better than the last.


OVERVIEW

Does the loop improve?

Persisting and improving are different. Money, tokens, people, and validation keep the loop running, but for it to get smarter over time you must feed back the data the operation produces. A loop without data merely endures; it does not grow.

ASSETS

Three assets of the data flywheel

Execution traces — the raw record

Leave the agent's rationale, choices, handoffs, and outcomes as structured logs. The primary data operation produces — institutional memory that survives a model swap.

Captures
What was decided and why, which module was chosen on what grounds, what failed.
Common failure
Kept in a non-searchable form, a trace is just buried data, not institutional memory.

Evaluation data — the refined yardstick

Build an eval set from traces against business-outcome criteria. A yardstick that measures our results, not an external benchmark.

Captures
Our own answers and labels for what is good and bad, capability and regression eval cases.
Common failure
Watch only benchmarks and you cannot tell if your product actually improves. If the yardstick is external, you have no data sovereignty.

Learning signal — the fuel for improvement

Feed human feedback, outcome labels, and approval records back as signals that improve the next loop. The force that turns the flywheel.

Captures
What passed or was rejected, where a human intervened, which path earned margin.
Common failure
Without feeding signals back, the loop merely persists and does not improve — data leaks out and only the model gets swapped.
SOVEREIGNTY

Data sovereignty and the flywheel

Data is the only resource that makes the loop better. Money runs the loop, tokens compute, people operate — but what makes the next turn better than the last is the data operation accumulates. Operation → trace accumulation → measurement by evaluation → improvement by learning signal → better operation. The more this cycle turns, the more our own data compounds.

So data must live outside the model, as our asset. The model is a replaceable part; swap it and the traces, eval sets, and learning signals remain — that is data sovereignty. This data is the layer that lays our organization's own context on top of a generalist model, and the moat hardest for competitors to copy.

LADDER

Data maturity ladder — the flywheel over the evolution stages

Data assets do not accumulate at once. They mature with the evolution stages — from log collection to a self-improving loop.

Evolution stageData maturity
1. Single-Agent StartLog collection — start leaving calls, outcomes, and failures.
2. Responsibility SeparationTrace structuring — make rationale and handoffs queryable.
3. Multi-Agent CollaborationEval set built — run a private eval on business-outcome criteria.
4. Governance by DesignSelf-improving loop — learning signals lift the next loop automatically.