Status: Accepted
Version: 1.0
Date: 2025-12-30
Supersedes: N/A
Related ADRs: ADR-028 (GovernedSpeed™ LLMOps)
Related PRDs: PRD-010 (AI Governance Runtime)

Context

Key insight: A unified LLM abstraction layer eliminates provider lock-in while maintaining governance compliance and enabling local-first development.

Decision

Adopt LiteLLM as the canonical LLM provider abstraction layer, exposing a single LlmProviderPort interface for all AI interactions.

Core Principles

Supported Providers

Reference Architecture

Provider	Use Case	Configuration
Ollama	Local development, air-gapped environments	`ollama/llama3.2`
OpenAI	Production inference, embeddings	`gpt-4o`, `text-embedding-3-small`
Anthropic	Production inference, long context	`claude-3-5-sonnet-20241022`
OpenRouter	Cost optimization, model diversity	`openrouter/anthropic/claude-3-opus`

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
┌─────────────────────────────────────────────────────────────────┐
│  SEA™ LLM Provider Architecture                                  │
├─────────────────────────────────────────────────────────────────┤
│                                                                 │
│  ┌──────────────────┐                                           │
│  │ Cognitive Service│                                           │
│  │ (Artifact Engine)│                                           │
│  └────────┬─────────┘                                           │
│           │ uses                                                │
│           ▼                                                     │
│  ┌──────────────────┐                                           │
│  │ LlmProviderPort  │ ◄── Hexagonal Port (Interface)            │
│  └────────┬─────────┘                                           │
│           │ implements                                          │
│           ▼                                                     │
│  ┌──────────────────┐    ┌────────────────────┐                 │
│  │ LiteLLMAdapter   │───►│ Policy Gateway     │ (SDS-047)       │
│  │ (Production)     │    │ (PII, Jailbreak)   │                 │
│  └────────┬─────────┘    └─────────┬──────────┘                 │
│           │                        │                            │
│           ▼                        ▼                            │
│  ┌─────────────────────────────────────────────────────────┐    │
│  │                    LiteLLM Router                       │    │
│  │  ┌─────────┐  ┌─────────┐  ┌─────────┐  ┌────────────┐  │    │
│  │  │ Ollama  │  │ OpenAI  │  │Anthropic│  │ OpenRouter │  │    │
│  │  └─────────┘  └─────────┘  └─────────┘  └────────────┘  │    │
│  └─────────────────────────────────────────────────────────┘    │
│                                                                 │
│  Testing:                                                       │
│  ┌──────────────────┐                                           │
│  │ FakeLlmAdapter   │ ◄── Deterministic responses for tests     │
│  └──────────────────┘                                           │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘

Rationale

Constraints (MUST/MUST NOT)

Isomorphic Guarantees

System Invariants

Quality Attributes

Bounded Contexts Impacted

Consequences

Benefits

Trade-offs

Configuration

Environment Variables

Spec Concept	Implementation Target	Mapping Rule
LlmProviderPort	`libs/llm-provider/src/ports/llm-provider.port.ts`	1:1 interface
LiteLLMAdapter	`libs/llm-provider/src/adapters/litellm.adapter.ts`	1:1 implementation
FakeLlmAdapter	`libs/llm-provider/src/adapters/fake.adapter.ts`	1:1 test double
ProviderConfig	Environment variables `LLM_*`	1:1 config mapping

INV-ID	Invariant	Type	Enforcement
INV-LLM-001	All production LLM calls route through Policy Gateway	System	HTTPs proxy config
INV-LLM-002	LlmProviderPort is the only LLM interface	System	Nx module boundaries
INV-LLM-003	Ollama available for local development	Process	Docker Compose config
INV-LLM-004	All LLM calls emit OpenTelemetry spans	System	Adapter instrumentation

Attribute	Target	Rationale
Latency	<50ms overhead	LiteLLM adds minimal proxy overhead
Availability	99.5% with fallbacks	Fallback chains ensure resilience
Testability	100% mockable	FakeLlmAdapter for unit tests
Observability	Full trace coverage	OTel spans on all calls

1
2
3
4
5
6
7
8
9
10
11
12
13
# Provider selection
LLM_PROVIDER=ollama           # ollama | openai | anthropic | openrouter
LLM_MODEL=llama3.2            # Model name per provider
LLM_FALLBACK_MODELS=gpt-4o,claude-3-sonnet  # Comma-separated fallback chain

# API keys (not needed for Ollama)
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
OPENROUTER_API_KEY=sk-or-...

# Policy Gateway (required for production)
LLM_POLICY_GATEWAY_URL=http://policy-gateway:8080/v1
LLM_BYPASS_GATEWAY=false      # Only true for local development

Docker Compose (Development)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
services:
  ollama:
    image: ollama/ollama:latest
    ports:
      - "11434:11434"
    volumes:
      - ollama-models:/root/.ollama
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:11434/api/tags"]
      interval: 10s
      timeout: 5s
      retries: 3

volumes:
  ollama-models:

ADR-035: LLM Provider Abstraction Architecture

Context

Decision

Core Principles

Supported Providers

Reference Architecture

Rationale

Constraints (MUST/MUST NOT)

Isomorphic Guarantees

System Invariants

Quality Attributes

Bounded Contexts Impacted

Consequences

Benefits

Trade-offs

Configuration

Environment Variables

Docker Compose (Development)

Success Criteria

ADR-035: LLM Provider Abstraction Architecture

Context

Decision

Core Principles

Supported Providers

Reference Architecture

Rationale

Constraints (MUST/MUST NOT)

Isomorphic Guarantees

System Invariants

Quality Attributes

Bounded Contexts Impacted

Consequences

Benefits

Trade-offs

Configuration

Environment Variables

Docker Compose (Development)

Success Criteria

Related Documents