Hub
Harness Engineering
The reference hub for harness engineering: the discipline of building the scaffolding — tools, memory, context, control loops, guardrails and evaluation — that turns raw model capability into reliable agentic systems.
Harness engineering is the practice of designing, building and optimizing the scaffolding (tools, memory, prompts, environment and control loop) that turns a model's raw capability into reliable, goal-directed action.
- The harness is everything around the model that converts capability into action.
- As frontier models converge, the harness becomes the main lever of differentiation.
- Tool design, context management and memory often matter more than model choice.
- Harnesses must be observable and evaluated — you cannot improve what you cannot measure.
- Harness engineering is to agents what platform engineering is to cloud applications.
Hub
What this hub covers
The map below is being built out unit by unit. Live areas link to a knowledge unit; the rest are in progress.
Definition
Live
History
In progress
Taxonomy
In progress
Components
Live
Observability
Live
Memory systems
Live
Tooling
Live
Governance
Live
Evaluation
Live
Benchmarks
Live
Case studies
In progress
Papers
Live