DATA
The third face of the environment is "does the loop get better." If money keeps it alive and tokens run it, data makes the next turn better than the last.
Does the loop improve?
Persisting and improving are different. Money, tokens, people, and validation keep the loop running, but for it to get smarter over time you must feed back the data the operation produces. A loop without data merely endures; it does not grow.
Three assets of the data flywheel
Execution traces — the raw record
Leave the agent's rationale, choices, handoffs, and outcomes as structured logs. The primary data operation produces — institutional memory that survives a model swap.
- What was decided and why, which module was chosen on what grounds, what failed.
- Kept in a non-searchable form, a trace is just buried data, not institutional memory.
Evaluation data — the refined yardstick
Build an eval set from traces against business-outcome criteria. A yardstick that measures our results, not an external benchmark.
- Our own answers and labels for what is good and bad, capability and regression eval cases.
- Watch only benchmarks and you cannot tell if your product actually improves. If the yardstick is external, you have no data sovereignty.
Learning signal — the fuel for improvement
Feed human feedback, outcome labels, and approval records back as signals that improve the next loop. The force that turns the flywheel.
- What passed or was rejected, where a human intervened, which path earned margin.
- Without feeding signals back, the loop merely persists and does not improve — data leaks out and only the model gets swapped.
Data sovereignty and the flywheel
Data is the only resource that makes the loop better. Money runs the loop, tokens compute, people operate — but what makes the next turn better than the last is the data operation accumulates. Operation → trace accumulation → measurement by evaluation → improvement by learning signal → better operation. The more this cycle turns, the more our own data compounds.
So data must live outside the model, as our asset. The model is a replaceable part; swap it and the traces, eval sets, and learning signals remain — that is data sovereignty. This data is the layer that lays our organization's own context on top of a generalist model, and the moat hardest for competitors to copy.
Data maturity ladder — the flywheel over the evolution stages
Data assets do not accumulate at once. They mature with the evolution stages — from log collection to a self-improving loop.
| Evolution stage | Data maturity |
|---|---|
| 1. Single-Agent Start | Log collection — start leaving calls, outcomes, and failures. |
| 2. Responsibility Separation | Trace structuring — make rationale and handoffs queryable. |
| 3. Multi-Agent Collaboration | Eval set built — run a private eval on business-outcome criteria. |
| 4. Governance by Design | Self-improving loop — learning signals lift the next loop automatically. |