Data Flow

Agent Session

How an agent session flows from initial orientation through tool execution to memory persistence.

Overview

An agent session has three phases: Orient (deterministic context preparation), Work (tool execution loop), and Persist (memory and mechanical write-back). The Orient phase is performed by the Context Pipeline — a deterministic infrastructure component, not an agent — before any LLM tokens are consumed.

Phase 1: Orient (Context Pipeline)

The Context Pipeline is deterministic infrastructure — not an agent and not an LLM call. It prepares context in two steps before any tokens are consumed:

Step 1 — Select (~100ms, 0 tokens)

The pipeline executes three parallel queries against the workspace:

FTS5 full-text search against the prompt
Semantic embedding search (nearest neighbours in the embedding index)
Wiki-link graph traversal from recently accessed pages

The output is a ranked set of candidate context items (pages, blocks, memory entries).

Step 2 — Refine (~500ms, cheap model)

The Refinement Gate uses a small, cheap LLM to rank and trim candidates to fit the context window. It produces the final SessionContext handed to the Orchestrator.

Three Select Verbs

The Context Pipeline is activated by three verbs, each with different selection strategies:

Verb	Used By	Strategy
select	Pipeline	Deterministic retrieval (FTS5 + embeddings + links)
execute	Worker	Direct tool call with known parameters
investigate	Researcher	Iterative exploration across multiple tool calls

Phase 2: Work (Tool Execution Loop)

The Orchestrator delegates tasks to Worker or Researcher process types based on task structure. All tool execution is permission-gated:

All tools are visible in the LLM schema (the model can see every tool)
Enforcement happens at execution time via PermissionGuard::require(capability)
A denied capability returns a structured error — the agent does not crash

Process Types

Type	Role
Orchestrator	Decomposes goals into tasks; delegates to Workers/Researchers
Researcher	Investigates via iterative tool calls; returns structured findings
Worker	Executes discrete tool calls with known parameters
Skill Composer	Assembles and executes skill artifacts (prompt templates, DSPy modules)

Phase 3: Persist

After the work loop completes, the Orchestrator triggers memory persistence. Persist is:

Background — Does not block the user-facing response
Best-effort — Persistence failures are logged but do not fail the session
Tier-aware — Observations persist at the appropriate memory tier (Conversation, Channel, Workspace, or Account)

Memory entries are written via MemoryRepository with an LLM-assigned importance score in [0.0, 1.0]. The AgentMemoryFacade coordinates writes across tiers.

Key Properties

Property	Value
Context Pipeline latency	~100ms (select) + ~500ms (refine)
Context Pipeline tokens	0 (select phase is token-free)
Permission model	All tools visible; enforcement at execution via `PermissionGuard`
Memory persistence	Background, best-effort; does not block user response
Session storage	Conversation-scoped `MemoryRepository` + `ScratchpadRepository`

Agent Core System — Session management and process model
Agent Memory System — 4-tier hierarchy and decay model
Conversation System — Conversation lifecycle and scratchpad
Process Model — Orchestrator / Researcher / Worker / Skill Composer
Permission System — Capability-based access control
docs/ADR/009-agent-harness-architecture.md — Harness architecture decision record (developer docs)

Previous
Overview Next
Authentication

Was this page helpful?