The Softmax Router pattern treats attention mechanism mathematics as a coordination protocol for multi-agent workflows. Agents contribute bounded outputs that are weighted and aggregated like attention heads in a transformer.

Core Invariants

When translating transformer architecture to agent workflows, these invariants must be preserved:

Isomorphic Mapping

Router Agent

Invariant	Transformer Form	Agent Workflow Form
Shared State	Residual stream	`ContextSnapshot`
Bounded Contribution	Bounded vector update	Schema + size limits
Normalized Arbitration	Softmax ensures bounded influence	Router normalizes weights
Identity Stability	Token identities stable	ConceptId persistence
Admissible Transformations	Can’t invent operations	Defined tools per agent
Round-Based Iteration	Layers = refinement steps	Explicit round count

Transformer Component	Agent Workflow Equivalent
Token stream / residual	`ContextSnapshot` + working memory
Attention head	Specialist agent (bounded output)
Q/K/V matrices	Query / capability / payload schemas
Softmax routing	Router agent (relevance + weights)
Layer	One governance-checked round
FFN (MLP)	Synthesis agent
LayerNorm	Contract validation + normalization
Multi-head	Parallel specialist proposals

Responsibility: Determine which specialists are relevant and allocate attention budget.

Output Format

1
2
3
4
5
6
7
8
9
{
  "distribution": [
    { "agentId": "semantic-mapper", "weight": 0.35, "tokenBudget": 500 },
    { "agentId": "rule-analyst", "weight": 0.25, "tokenBudget": 300 },
    { "agentId": "arch-checker", "weight": 0.20, "tokenBudget": 200 },
    { "agentId": "retrieval-agent", "weight": 0.20, "tokenBudget": 200 }
  ],
  "toolPermissions": ["read-kgs", "query-sbvr"]
}

Design Rules

Round Execution Flow

1
2
3
4
5
6
7
8
9
10
11
12
┌─────────────────────────────────────────────────────────────┐
│ Round N                                                      │
├─────────────────────────────────────────────────────────────┤
│  1. Router receives ContextSnapshot + Query                  │
│  2. Router outputs weight distribution + permissions         │
│  3. Specialists execute in parallel (bounded outputs)        │
│  4. Aggregator combines weighted deltas                      │
│  5. Validator checks invariants                              │
│     - Pass: ContextSnapshot(N+1) committed                   │
│     - Block: Violation event, no commit                      │
│  6. If more rounds needed: goto Round N+1                    │
└─────────────────────────────────────────────────────────────┘

Configuration

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
router:
  agentId: "router-v1"
  model: "claude-sonnet"
  maxOutputTokens: 200

specialists:
  - agentId: "semantic-mapper"
    skill: "term-to-concept-binding"
    outputSchema: "schemas/semantic-mapper-output.json"
    maxOutputTokens: 500

aggregator:
  strategy: "weighted-merge"
  conflictResolution: "majority-vote"

validator:
  checks:
    - type: "schema"
    - type: "concept-id-persistence"
    - type: "sbvr-rules"
  onViolation: "block-and-emit"

execution:
  maxRounds: 5
  convergenceThreshold: 0.95

Want This?	Adjust This
Less hallucination	Stricter schemas, shorter deltas
More creativity	Loosen router scoring, add specialists
Explainability	Log router weights + provenance
Faster convergence	Reduce rounds, increase parallelism
Higher accuracy	Increase rounds, tighter validation

Mode	Cause	Mitigation
Unbounded output	Prose instead of structured deltas	Enforce schemas; clip violations
State explosion	State grows without clipping	Size limits; delta-only updates
Political arbitration	“Loudest agent wins”	Normalized weights
Identity drift	Concepts rename themselves	ConceptId enforcement
Infinite rounds	No convergence	Hard round limit

🎯 Softmax Router Pattern

Overview

Core Invariants

Isomorphic Mapping

Router Agent

Output Format

Design Rules

Round Execution Flow

Configuration

Tuning Guide

Failure Modes

🎯 Softmax Router Pattern

Overview

Core Invariants

Isomorphic Mapping

Router Agent

Output Format

Design Rules

Round Execution Flow

Configuration

Tuning Guide

Failure Modes

Related Specs