Architecture Overview

Version: 1.0 | Date: 2026-03-25 | Author: Abdelaziz Abdelrasol

Executive Summary

orchcore is a reusable Python package (>= 3.12, asyncio-first) that provides orchestration infrastructure for launching, monitoring, and managing multiple AI coding agent CLIs through phase-based pipelines. It is not an agent itself — it orchestrates external agent CLIs (Claude Code, Codex, Gemini, Copilot, OpenCode) as subprocesses.

The package was extracted by analyzing four production orchestration systems — Planora, Articles, Finvault, and Raven/Ralph — identifying the common infrastructure patterns (60-70% of code) that recurred across all four, and packaging them into a single reusable library. Domain-specific logic (prompt content, output interpretation, presentation) stays in each consuming project; orchcore provides the "how" of orchestration.

Key Architectural Decisions

Protocol-based UI decoupling via UICallback so any project can plug its own CLI, TUI, or headless output (ADR-003)
Composable four-stage stream processing pipeline (Filter, Parse, Monitor, Stall Detect) that normalizes the wildly different JSONL formats across agent CLIs into a unified event model (ADR-004)
Registry-as-data, where agent configurations are defined via TOML/dict rather than hardcoded classes, making the system extensible without code changes (ADR-007)

System Context (C4 Level 1)

C4Context
    title System Context — orchcore

    Person(dev, "Developer", "Uses consuming project CLIs")

    System(orchcore, "orchcore", "Reusable orchestration core")

    System_Ext(claude, "Claude Code CLI", "Anthropic")
    System_Ext(codex, "Codex CLI", "OpenAI")
    System_Ext(gemini, "Gemini CLI", "Google")
    System_Ext(copilot, "Copilot CLI", "GitHub/Microsoft")
    System_Ext(opencode, "OpenCode CLI", "Open-source")

    System(planora, "Planora", "Multi-agent planning")
    System(articles, "Articles", "Article writing pipeline")
    System(finvault, "Finvault", "Performance audit")
    System(raven, "Raven/Ralph", "Task-driven development")

    Rel(dev, planora, "Uses")
    Rel(dev, articles, "Uses")
    Rel(dev, finvault, "Uses")
    Rel(dev, raven, "Uses")

    Rel(planora, orchcore, "Imports")
    Rel(articles, orchcore, "Imports")
    Rel(finvault, orchcore, "Imports")
    Rel(raven, orchcore, "Imports")

    Rel(orchcore, claude, "Launches as subprocess")
    Rel(orchcore, codex, "Launches as subprocess")
    Rel(orchcore, gemini, "Launches as subprocess")
    Rel(orchcore, copilot, "Launches as subprocess")
    Rel(orchcore, opencode, "Launches as subprocess")

Package Layout

src/orchcore/
├── stream/          # 4-stage JSONL processing pipeline
│   ├── events.py    # StreamEvent, StreamFormat, AgentState models
│   ├── filter.py    # StreamFilter — pre-parse noise reduction
│   ├── parser.py    # StreamParser — format-specific JSONL → StreamEvent
│   ├── monitor.py   # AgentMonitor — real-time state tracking
│   └── stall.py     # StallDetector — timeout detection
├── pipeline/        # Phase orchestration engine
│   ├── phase.py     # Phase, PhaseResult, PipelineResult models
│   ├── engine.py    # PhaseRunner — per-phase execution
│   ├── pipeline.py  # PipelineRunner — cross-phase coordination
│   └── control.py   # Control flow utilities
├── runner/          # Async subprocess management
│   └── subprocess.py  # AgentRunner
├── registry/        # Agent configuration
│   ├── agent.py     # AgentConfig, AgentMode, ToolSet models
│   └── registry.py  # AgentRegistry — TOML/dict lookup
├── config/          # Layered configuration
│   ├── settings.py  # OrchcoreSettings, load_settings_with_profile
│   └── schema.py    # AgentOverrideConfig
├── recovery/        # Rate-limit & error recovery
│   ├── rate_limit.py  # RateLimitDetector, ResetTimeParser
│   ├── retry.py     # RetryPolicy, FailureMode, BackoffStrategy
│   └── git_recovery.py  # GitRecovery
├── workspace/       # Artifact lifecycle
│   └── manager.py   # WorkspaceManager
├── prompt/          # Jinja2 templates
│   ├── template.py  # render_template, render_string, strip_frontmatter
│   └── loader.py    # TemplateLoader
├── display/         # ANSI colored logging (no Rich)
│   ├── logging.py   # log_info, log_error, status_line, phase_header
│   └── formatting.py  # format_cost, format_duration, format_tokens
├── ui/              # UICallback protocol
│   └── callback.py  # UICallback, NullCallback, LoggingCallback
├── signals/         # Graceful shutdown
│   └── handler.py   # SignalManager
├── observability/   # Optional OpenTelemetry
│   └── telemetry.py # OrchcoreTelemetry
└── __init__.py

Component Diagram

┌─────────────────────────────────────────────────────────────────┐
│                      Consuming Project                          │
│  (Custom UICallback, Phase Definitions, Prompt Templates, TOML) │
└──────────────────────────────┬──────────────────────────────────┘
                               │
┌──────────────────────────────▼──────────────────────────────────┐
│                          orchcore                                │
│                                                                  │
│  ┌────────────┐  ┌────────────┐  ┌────────────┐  ┌───────────┐ │
│  │  pipeline/  │  │   runner/  │  │  registry/  │  │  config/  │ │
│  │ DAG phases  │─▶│ subprocess │─▶│ agent TOML  │  │  layered  │ │
│  │ seq/parallel│  │  async I/O │  │   lookup    │  │  settings │ │
│  └─────┬──────┘  └─────┬──────┘  └─────────────┘  └───────────┘ │
│        │               │                                         │
│  ┌─────▼───────────────▼──────────────────────────────────────┐ │
│  │                    stream/ (4-stage pipeline)               │ │
│  │  JSONL ─▶ Filter ─▶ Parse ─▶ Monitor ─▶ Stall Detect      │ │
│  └────────────────────────────────────────────────────────────┘ │
│                                                                  │
│  ┌──────────┐ ┌──────────┐ ┌─────────┐ ┌──────────┐ ┌────────┐│
│  │recovery/ │ │workspace/│ │ prompt/ │ │ signals/ │ │  ui/   ││
│  │rate-limit│ │ artifact │ │ Jinja2  │ │ graceful │ │Protocol││
│  │retry,git │ │lifecycle │ │templates│ │ shutdown │ │callback││
│  └──────────┘ └──────────┘ └─────────┘ └──────────┘ └────────┘│
└──────────────────────────────────────────────────────────────────┘

Core Abstractions

Pipeline Execution Model

orchcore uses a two-level execution model:

PipelineRunner — coordinates multiple phases in dependency order. Handles resume, skip, and only-phase options.
PhaseRunner — executes a single phase. Launches agents sequentially or in parallel via AgentRunner, enforces concurrency limits, and aggregates results.

sequenceDiagram
    participant C as Consumer
    participant PR as PipelineRunner
    participant PhR as PhaseRunner
    participant AR as AgentRunner
    participant A as Agent CLI

    C->>PR: run_pipeline(phases, prompts, callback)
    loop each phase in dependency order
        PR->>PhR: run_phase(phase, prompt, callback)
        alt parallel phase
            par for each agent
                PhR->>AR: run(agent, prompt, output_path)
                AR->>A: subprocess launch
                A-->>AR: JSONL stream
                AR-->>PhR: AgentResult
            end
        else sequential phase
            loop for each agent
                PhR->>AR: run(agent, prompt, output_path)
                AR->>A: subprocess launch
                A-->>AR: JSONL stream
                AR-->>PhR: AgentResult
            end
        end
        PhR-->>PR: PhaseResult
    end
    PR-->>C: PipelineResult

Stream Processing Pipeline

Every line of JSONL output passes through four composable stages:

StreamFilter — fast-path string matching drops ~95% of noise before json.loads()
StreamParser — format-specific parsers produce normalized StreamEvent instances
AgentMonitor — 9-state machine tracks agent lifecycle, tools, cost, tokens
StallDetector — injects synthetic STALL events after configurable timeout

See Stream Pipeline for the deep-dive.

Tool Resolution Order

Tools available to an agent within a phase are resolved via a layered lookup:

Phase.agent_tools[agent]  >  explicit toolset  >  Phase.tools  >  AgentConfig.flags[mode]  >  defaults

See ADR-009: Tool assignment as phase-level concern.

Design Principles

Principle	How It's Applied
Composability	Each of 12 components usable independently
Extensibility	New agents via TOML config alone (zero code changes)
Reliability	Graceful degradation, configurable partial failure semantics
Performance	< 5ms per event, < 100ms subprocess launch
Type Safety	mypy strict with Pydantic validation at boundaries
Async-First	Pure stdlib asyncio, TaskGroup for structured concurrency
Protocol-Based DI	UICallback is a Protocol, not a base class
Registry-as-Data	Agents defined via TOML/dict, not hardcoded classes

Quality Attributes

Priority	Attribute	Target
1	Extensibility	New agent CLI via TOML only
2	Reliability	Graceful degradation under partial failure
3	Composability	Each component usable standalone
4	Performance	Stream < 5ms/event, launch < 100ms
5	Type Safety	mypy strict, zero errors

Constraints

Python >= 3.12 — TaskGroup, tomllib, modern type syntax
asyncio only — no trio, gevent, or threading
Core deps — pydantic >= 2.10, pydantic-settings >= 2.7, jinja2 >= 3.1
No agent API keys — agents manage their own authentication
POSIX signals — SIGINT/SIGTERM handling (Windows is not first-class in v1.0)

Design Document — problem statement, requirements, proposed design
Stream Pipeline — 4-stage pipeline deep-dive
Architecture Decision Records — all 9 ADRs