Architecture¶

Marionette is designed as a modular, scalable platform for orchestrating AI coding agents.

System Overview¶

DiagramText

flowchart TB
    subgraph Server["Server (Go)"]
        subgraph Core["Core Services"]
            SM[SessionMgr]
            TM[TaskMgr]
            RM[RunnerMgr]
            TuM[TunnelMgr]
            PM[ProviderMgr]
            WM[WorkspaceMgr]
            SaM[SandboxMgr]
        end

        subgraph Providers["Provider Registry"]
            Docker
            K8s[Kubernetes]
            E2B
            FC[Firecracker]
            Pool
        end

        Core --> Providers

        subgraph Endpoints["API Endpoints"]
            GRPC[":9090 gRPC<br/>marionette-agent (mTLS)"]
            API[":8080 Public<br/>CLI / Apps (API Key)"]
            Admin[":8081 Admin<br/>WebUI (Basic Auth)"]
        end

        Providers --> Endpoints

        DB[(PostgreSQL)]
        Endpoints --> DB
    end

    Agent1[marionette-agent] <-.->|mTLS| GRPC
    Agent2[marionette-agent] <-.->|mTLS| GRPC
    CLI[CLI / External Apps] <-.->|API Key| API
    WebUI[Admin WebUI] <-.->|Basic Auth| Admin

┌─────────────────────────────────────────────────────────────────────────────┐
│                              Server (Go)                                    │
│  ┌────────────────────────────────────────────────────────────────────┐     │
│  │                            Core                                    │     │
│  │  ┌────────────┐ ┌────────────┐ ┌────────────┐ ┌────────────┐       │     │
│  │  │ SessionMgr │ │  TaskMgr   │ │ RunnerMgr  │ │ TunnelMgr  │       │     │
│  │  └────────────┘ └────────────┘ └────────────┘ └────────────┘       │     │
│  │  ┌────────────┐ ┌────────────┐ ┌────────────┐                      │     │
│  │  │ ProviderMgr│ │WorkspaceMgr│ │ SandboxMgr │                      │     │
│  │  └─────┬──────┘ └─────┬──────┘ └────────────┘                      │     │
│  └────────┼──────────────┼────────────────────────────────────────────┘     │
│           │              │                                                  │
│  ┌────────▼──────────────▼────────────────────────────────────────────┐     │
│  │                    Provider Registry                               │     │
│  │  ┌────────┐ ┌────────────┐ ┌─────┐ ┌───────────┐ ┌──────┐          │     │
│  │  │ Docker │ │ Kubernetes │ │ E2B │ │Firecracker│ │ Pool │          │     │
│  │  └────────┘ └────────────┘ └─────┘ └───────────┘ └──────┘          │     │
│  └────────────────────────────────────────────────────────────────────┘     │
│           │                                                                 │
│  ┌────────▼───────────────────────────────────────────────────────────┐     │
│  │  :9090 gRPC   |<---- marionette-agent (mTLS)                       │     │
│  │  :8080 Public |<---- CLI / External Apps (API Key)                 │     │
│  │  :8081 Admin  |<---- Admin WebUI (Basic Auth)                      │     │
│  └────────────────────────────────────────────────────────────────────┘     │
│           │                                                                 │
│  ┌────────▼──────┐                                                          │
│  │    Store      │                                                          │
│  │  PostgreSQL   │                                                          │
│  └───────────────┘                                                          │
└─────────────────────────────────────────────────────────────────────────────┘

Components¶

Server¶

The central component that manages all orchestration:

Component	Responsibility
SessionMgr	Session lifecycle (create, suspend, resume, terminate)
TaskMgr	Task execution and retry logic
RunnerMgr	Runner registration and health monitoring
TunnelMgr	Port forwarding and streaming
ProviderMgr	Provider registration and runner spawning
WorkspaceMgr	Workspace storage and synchronization

Agent (marionette-agent)¶

Runs inside each Runner and:

Connects to the server via gRPC
Manages the AI coding agent (Claude Code, Codex, etc.)
Executes tasks in sandboxed environments
Streams logs and handles permission requests

CLI (mctl)¶

Command-line interface for:

Session and task management
Runner monitoring
Permission handling
Admin operations

Communication¶

Protocols¶

Connection	Protocol	Authentication
Agent ↔ Server	gRPC (bidirectional streaming)	mTLS / Runner Token
CLI ↔ Server	REST/HTTP	API Key
Admin ↔ Server	REST/HTTP	Basic Auth / Master Key
Browser ↔ Server	WebSocket / SSE	API Key / Tunnel Token

Agent Connection Model¶

Agents initiate connections to the server (not the reverse):

DiagramText

sequenceDiagram
    participant Agent as Agent (gRPC Client)
    participant Server as Server (gRPC Server)

    Agent->>Server: Connect()
    activate Server
    Server-->>Agent: Control Stream (tasks)
    Agent-->>Server: Event Stream (logs, permissions)
    deactivate Server

┌──────────────────┐                    ┌──────────────────┐
│      Agent       │                    │      Server      │
│                  │                    │                  │
│  ┌────────────┐  │     Connect()      │  ┌────────────┐  │
│  │ gRPC Client├──┼───────────────────►│  │ gRPC Server│  │
│  └────────────┘  │                    │  └────────────┘  │
│                  │◄─── Control ───────│                  │
│                  │     (tasks)        │                  │
│                  │                    │                  │
│                  │──── Events ───────►│                  │
│                  │     (logs, perms)  │                  │
└──────────────────┘                    └──────────────────┘

This design:

Avoids firewall issues (like GitHub Actions runners)
Enables agents behind NAT
Simplifies deployment

Data Flow¶

Task Execution¶

sequenceDiagram
    participant User
    participant Server
    participant Agent
    participant AI as AI Agent

    User->>Server: Create Task
    Server->>Server: Queue Task
    Server->>Agent: Assign Task (Control stream)
    Agent->>AI: Execute Prompt
    AI-->>Agent: Progress/Logs
    Agent-->>Server: Stream Logs (Event stream)
    Server-->>User: Real-time Logs (SSE/WS)
    AI->>Agent: Permission Request
    Agent->>Server: Request Permission
    Server->>User: Permission Prompt
    User->>Server: Approve/Deny
    Server->>Agent: Permission Response
    AI-->>Agent: Continue/Abort
    AI->>Agent: Task Complete
    Agent->>Server: Task Result
    Server->>User: Task Status

Storage Architecture¶

Database (PostgreSQL)¶

Stores all persistent state:

Sessions, tasks, runs
Runner registrations
Workspace metadata
API keys and tokens
Audit logs

Object Storage¶

For large data:

Workspace snapshots (CAS chunks)
Log archives
Agent context snapshots

See Storage for details on content-addressable storage.

Security Model¶

Multi-Tenant Isolation¶

All resources are scoped by tenant_id
Injected by auth middleware (never from user input)
Cross-tenant access prevented at application layer

Credential Handling¶

Mode	Storage	Use Case
Managed	Encrypted in DB	Operator provides keys
BYOK	Memory only	Users bring own keys

Transport Security¶

Production: mTLS required for agent connections
Development: TLS can be disabled with skip_tls: true

Scalability¶

Horizontal Scaling¶

Server: Stateless, can run multiple replicas
Agents: Scale independently per provider
Database: PostgreSQL with read replicas

High Availability¶

DiagramText

flowchart TB
    LB[Load Balancer]

    LB --> S1[Server 1]
    LB --> S2[Server 2]
    LB --> S3[Server 3]

    S1 --> PG[(PostgreSQL Primary)]
    S2 --> PG
    S3 --> PG

    PG --> R1[(Replica 1)]
    PG --> R2[(Replica 2)]

┌─────────────────────────────────────────────────────────┐
│                    Load Balancer                        │
└─────────────────────────┬───────────────────────────────┘
                          │
        ┌─────────────────┼─────────────────┐
        │                 │                 │
        ▼                 ▼                 ▼
┌───────────────┐ ┌───────────────┐ ┌───────────────┐
│   Server 1    │ │   Server 2    │ │   Server 3    │
└───────┬───────┘ └───────┬───────┘ └───────┬───────┘
        │                 │                 │
        └─────────────────┼─────────────────┘
                          │
                          ▼
                ┌─────────────────┐
                │   PostgreSQL    │
                │   (Primary)     │
                └────────┬────────┘
                         │
              ┌──────────┴──────────┐
              ▼                     ▼
      ┌─────────────┐       ┌─────────────┐
      │   Replica   │       │   Replica   │
      └─────────────┘       └─────────────┘

Next Steps¶

Sessions & Tasks - Understand the execution model
Providers - Learn about different provider types
Workspaces - Workspace persistence and sync