Todos os capítulos
HRN-013ReferenceStatus: Draft · Atualizado Sun Jun 21 2026 00:00:00 GMT+0000 (Coordinated Universal Time)

Glossary

A canonical glossary of Harness Engineering and agentic-systems terminology—harness, agent, tool, orchestration, evaluation, span, RAG, MCP, guardrail, and more—with crisp, citable definitions.

Evidência: TeóricoConfiança: AltaFonte: Observação do setor

Este capítulo é redigido em inglês; as versões localizadas estão em andamento.

Glossary

Executive Summary

This glossary is the canonical reference for terms used across the Harness Engineering handbook and the wider Santismm Knowledge Platform. Definitions are crisp, machine-extractable, and intended to be cited directly. Where a term has a dedicated chapter, the entry points to it.

Definition

The following are the canonical definitions of terms used throughout this corpus. Terms are grouped for readability; within groups they are roughly ordered from foundational to specialized.

Core concepts

  • Agent: a system that uses a language model in a loop to pursue a goal, deciding which actions (tool calls) to take based on observations until a stopping condition is met.
  • Agentic system: a software system whose behavior is driven by one or more agents, including all the surrounding scaffolding required to make them reliable.
  • Harness: the engineered scaffolding around a model — memory, tools, planning, orchestration, observability, evaluation, governance, and security — that turns a raw model into a reliable agentic system.
  • Harness Engineering: the discipline responsible for building reliable agentic systems for enterprise environments by designing and operating the harness.
  • Model / LLM: the underlying large language model that performs reasoning and generation; in the harness it is treated as a powerful but non-deterministic, manipulable component.
  • Autonomy gradient: the spectrum of control modes from fully autonomous to human-in-the-loop, assigned per action class by policy.
  • Blast radius: the maximum harm an action (or a compromised agent) can cause; a core quantity to bound.

Tools and actions

  • Tool: a function or capability the agent can invoke to observe or affect the world (search, code execution, API call, database query).
  • Tool call / function call: a structured request from the model to invoke a tool with arguments.
  • Effector: a tool that produces a side effect in an external system (sends, writes, transfers).
  • Idempotency: the property that performing an operation multiple times has the same effect as performing it once; essential for safe retries and resumes.
  • Side effect: an externally visible change produced by a tool call.
  • Allowlist: an explicit set of permitted items (tools, egress destinations); the safe default for security-sensitive choices.

Memory and context

  • Context window: the bounded span of tokens the model can attend to in a single inference.
  • Memory: the harness layer that persists and retrieves information across steps and sessions (short-term, long-term, episodic, semantic).
  • RAG (Retrieval-Augmented Generation): supplying the model with relevant retrieved content at inference time so its output is grounded in a corpus rather than parametric memory alone.
  • Embedding: a vector representation of text used for semantic similarity search in retrieval.
  • Vector store: a database optimized for nearest-neighbor search over embeddings.
  • Grounding: anchoring generated claims in retrieved, citable evidence.
  • Context engineering: the practice of deciding precisely what information enters the context window for each step.

Planning

  • Goal: the desired end-state with explicit success criteria.
  • Plan: an ordered or partially-ordered set of tasks expected to achieve a goal.
  • Decomposition: breaking a goal into sub-tasks (see PAT-010, HRN-009).
  • DAG (Directed Acyclic Graph): a plan representation that captures task dependencies and exposes parallelism.
  • Replanning: revising a plan in response to failure, new information, or changed constraints.
  • Termination criteria: the budgets and conditions (steps, time, cost) that stop an agent safely.

Orchestration

  • Orchestration: the harness layer that drives execution across agents and tools (see HRN-010).
  • Topology: the arrangement of agents — single, pipeline, supervisor/worker, or network.
  • Supervisor (orchestrator) agent: an agent that plans and delegates to worker agents (see PAT-002).
  • Worker agent: a specialized agent that executes a delegated sub-task (see PAT-005).
  • Routing: selecting the next agent, tool, or branch based on state.
  • Handoff: transfer of control and context from one agent to another.
  • State machine: an explicit graph of states and governed transitions controlling execution.
  • Durable execution: workflow semantics where progress is checkpointed and resumable across failures.
  • Saga: a sequence of operations with compensating actions to undo partial work on failure.

Governance and security

  • Governance: the runtime harness layer enforcing policy, approvals, and guardrails (see HRN-008).
  • Policy-as-code: governance rules expressed in a declarative, versioned, testable format.
  • PEP / PDP: Policy Enforcement Point (intercepts actions) and Policy Decision Point (evaluates policy).
  • Guardrail: a runtime check on inputs or outputs that constrains agent behavior.
  • Approval gate: a control that suspends execution pending a human or higher-authority decision (see PAT-001).
  • Least privilege: granting each agent the minimum permissions required for its task.
  • Agent identity: a distinct, attributable, scoped, revocable principal for each agent.
  • Prompt injection: untrusted content that hijacks an agent's instructions or goals.
  • Indirect prompt injection: injection delivered via data the agent retrieves.
  • Data exfiltration: unauthorized egress of sensitive data through agent outputs or tool arguments.
  • Lethal trifecta: the dangerous combination of private-data access, untrusted-content exposure, and external communication ability.
  • Sandbox: an isolated, resource- and network-constrained environment for executing untrusted tools/code.
  • DLP (Data Loss Prevention): controls that detect and block sensitive data in outbound payloads.

Observability and evaluation

  • Observability: the harness layer that makes agent behavior inspectable via traces, logs, and metrics (see HRN-006).
  • Trace: the end-to-end record of a single agent run.
  • Span: a single timed unit of work within a trace (one tool call, one model call), the building block of distributed tracing.
  • Evaluation (eval): the systematic measurement of agent quality, safety, and reliability (see HRN-007).
  • LLM-as-judge: using a model to score another model's outputs against a rubric.
  • Groundedness: the degree to which generated claims are supported by provided evidence.
  • Regression suite: a fixed set of evaluation cases run on every change to catch quality regressions.
  • Eval-driven development: building and changing agents against a measurable evaluation harness.

Protocols and standards

  • MCP (Model Context Protocol): an open protocol for connecting models/agents to tools and data sources through a standardized interface.
  • Tool schema: the typed declaration of a tool's name, arguments, and description that the model uses to call it.
  • llms.txt: a proposed convention for a site-level Markdown file that surfaces a curated, agent-friendly index of content.
  • JSON-LD: structured data format used to make documents machine-readable in the discovery layer.
  • NIST AI RMF / ISO 42001 / EU AI Act: the principal AI risk, management-system, and regulatory frameworks an enterprise harness must satisfy (see HRN-014, GOV-001).

FAQs

Q: Where do I cite a definition from? A: Cite the term by name plus HRN-013. Where a term has a dedicated chapter, prefer citing that chapter for depth.

Q: A term I need is missing — what do I do? A: Add it here in the appropriate group with a crisp one-sentence definition, and link the dedicated chapter if one exists.

Relacionado