Agent OS Architecture Reference¶

How agent-utilities (kernel), systems-manager (OS layer), container-manager-mcp (container runtime), tunnel-manager (network stack), and repository-manager (workspace) work together as a unified Agent Operating System, with the Knowledge Graph as the routing fabric.

Architecture Overview¶

The Agent OS is a multi-subsystem architecture where the Knowledge Graph drives all tool discovery, routing, and infrastructure orchestration.

Subsystem Map¶

graph TB
    subgraph KERNEL["🧠 KERNEL — agent-utilities"]
        KG[KG-2.0: Knowledge Graph]
        SEC[OS-5.1: Secret Engine]
        REG[ECO-4.6: Agent Registry]
        SCHED[OS-5.2: Cognitive Scheduler]
        ID[OS-5.1: Identity/Policy]
    end

    subgraph OS_LAYER["⚙️ OS SUBSYSTEMS"]
        SM["ECO-4.0: systems-manager\n23 tools\nHost OS operations"]
        CM["ECO-4.0: container-manager-mcp\n60+ tools\nDocker/Compose/Swarm\n(multi-endpoint)"]
        TM["ECO-4.0: tunnel-manager\n43 tools\nSSH/Remote/Network"]
        RM["ECO-4.0: repository-manager\n24 tools\nWorkspace/Git lifecycle"]
    end

    subgraph OS_SERVICES["🔌 OS SERVICES (deployable)"]
        SX["ECO-4.0: searxng-mcp\nInternet Gateway\n(public instance default)"]
        LF["OS-5.4: langfuse-agent\nObservability Bus\n(deploy via template)"]
    end

    subgraph HOSTS["🖥️ REMOTE HOSTS (KG HostNodes)"]
        H1["ECO-4.0: host:server1\ntcp://192.168.1.10:2375"]
        H2["ECO-4.0: host:server2\ntcp://192.168.1.11:2375"]
        H3["ECO-4.0: host:server3\nssh://admin@192.168.1.12"]
    end

    KG --> SM & CM & TM & RM
    SEC -->|"secret:// creds"| TM & CM
    KG -->|HostNode lookup| CM
    KG -->|HostNode lookup| TM
    CM -->|"per-endpoint"| H1 & H2 & H3
    TM -->|"SSH exec/file"| H1 & H2 & H3
    CM -->|"compose_up(template)"| SX & LF

    style KERNEL fill:#1a1a2e,stroke:#e94560,color:#fff
    style OS_LAYER fill:#16213e,stroke:#0f3460,color:#fff
    style OS_SERVICES fill:#0f3460,stroke:#533483,color:#fff
    style HOSTS fill:#2d132c,stroke:#ee4540,color:#fff

Default Communication: MCP via KG¶

All MCP servers are loaded into the Knowledge Graph at startup via sync_mcp_agents(). The graph router discovers and invokes tools through the KG — this is the native path.

flowchart LR
    Q[ORCH-1.0: User Query] --> GR[ORCH-1.2: Graph Router]
    GR --> KG[KG-2.0: Knowledge Graph]
    KG --> |"MCP tool lookup"| SM[systems-manager tools]
    KG --> |"MCP tool lookup"| CM[container-manager tools]
    KG --> |"MCP tool lookup"| TM[tunnel-manager tools]
    KG --> |"MCP tool lookup"| RM[repository-manager tools]
    TM -.-> |"SSH tunnel"| RH[Remote hosts]
    CM -.-> |"Docker endpoint"| RH

Priority order: 1. MCP tools via KG (default) — tools are registered in the KG and invoked directly 2. A2A protocol (fallback) — only for remote/networked agents

Subsystem Tiers¶

Tier 1: OS Subsystems (auto-installed)¶

These are always-on core primitives. Auto-installed on first AgentRegistry init.

Subsystem	Package	Key Capabilities	# Tools
⚙️ OS Layer	`systems-manager`	Host OS ops, Agent OS MCP wrappers, process/cron/services	23+
📦 Container Runtime	`container-manager-mcp`	Docker/Compose/Swarm lifecycle, multi-endpoint, specialist deploy	60+
🌐 Network Stack	`tunnel-manager`	SSH tunnels, remote exec, file transfer, inventory, security audit	43
📂 Workspace	`repository-manager`	Git workspace mgmt, project install/build/validate, dep graphs	24

Tier 2: OS Services (deploy-on-demand)¶

External dependencies that can be deployed via Infrastructure Templates.

Service	Package	Default Behavior
🔍 Internet Gateway	`searxng-mcp`	Uses random public instance — no deployment required
📊 Observability	`langfuse-agent`	Deployed via compose template when observability is requested

Tier 3: Domain Specialists (26 packages)¶

Available in the default catalog for on-demand install. See agent-registry.md for the full package table.

Functional Boundaries¶

Layer 1: `agent-utilities` — The Kernel¶

Pure Python library. Owns all models, logic, graph orchestration, and KG. Never runs as a standalone server.

Responsibility	Module	Concept
Scheduler logic	`core/cognitive_scheduler.py`	CONCEPT:OS-5.2
Identity & policy logic	`security/permissions_kernel.py`	CONCEPT:OS-5.2
Registry logic + default catalog	`core/registry/package_adapter.py` (`AgentRegistry`) + `core/default_catalog.py`	CONCEPT:OS-5.2
File watcher	`automation/file_watcher.py`	CONCEPT:OS-5.0
Maintenance cron	`automation/maintenance_cron.py`	CONCEPT:OS-5.2
KG models (incl. HostNode, InfrastructureTemplateNode)	`models/knowledge_graph.py`	CONCEPT:KG-2.0
Tool guard	`security/tool_guard.py`	CONCEPT:ORCH-1.0
Self-model + ACO	`knowledge_graph/self_model.py`	CONCEPT:KG-2.1

Layer 2: `systems-manager` — The OS Layer¶

MCP server + CLI for host-level operations. Exposes Agent OS kernel operations as privileged MCP tools by importing from agent-utilities.

Tool Group	# Tools	Wraps
Identity management	4	`PermissionsKernel`
Policy management	4	`PermissionsKernel`
Specialist registry	4	`AgentRegistry`
Agent health	4	`CognitiveScheduler`
File watcher	3	`FileWatcher`
Maintenance scheduling	4	`MaintenanceCron`
Plus 67 existing OS tools	67	Native

Layer 3: `container-manager-mcp` — The Container Runtime¶

MCP server + CLI for container runtime operations. Supports multi-endpoint targeting — can manage Docker daemons on multiple remote hosts.

Tool Group	# Tools	Purpose
Specialist deployment	4	Containerized specialist lifecycle
Infrastructure templates	N/A	Deploy compose-based services on-demand
Plus 32 existing container tools	32	Native Docker/Podman ops

Multi-endpoint support: Uses Docker SDK's native DOCKER_HOST env var, which supports tcp://, ssh://, and unix:// connections. Can also discover Docker hosts via KG HostNode entries.

Layer 4: `tunnel-manager` — The Network Stack¶

MCP server + CLI for SSH-based remote operations. Not required for container access (container-manager-mcp handles that natively), but essential for general remote ops.

Tool Group	Purpose
Host inventory	Manage SSH hosts as Pydantic-native `HostNode` entries in KG
Remote execution	Run commands on remote hosts
File transfer	Send/receive files via SFTP
Network topology	Discover and map network structure
Security audit	SSH config audit and key management

Layer 5: `repository-manager` — The Workspace¶

MCP server + CLI for git workspace lifecycle management.

Tool Group	Purpose
Git operations	Clone, pull, push, branch, merge
Workspace management	Multi-repo workspace setup and validation
Dependency graphs	Build and query project dependency trees
Project lifecycle	Install, build, test, validate projects

KG Host Nodes¶

Remote hosts are first-class KG citizens via HostNode. Credentials are resolved through the secret:// engine — passwords and keys are never stored in plaintext.

HostNode(
    id="host:media-server",
    name="media-server",
    hostname="192.168.1.10",
    alias="media-server",
    user="admin",
    credential_ref="secret://hosts/media-server/password",
    identity_file_ref="secret://hosts/media-server/identity",
    docker_endpoint="tcp://192.168.1.10:2375",
    docker_host=True,
    swarm_role="manager",
    container_manager_url="http://192.168.1.10:9050",
    labels={"role": "media", "location": "rack-2"},
)

How Hosts Are Used¶

container-manager-mcp queries KG for node_type=host AND docker_host=true, targets docker_endpoint per-operation
tunnel-manager queries KG for any HostNode, resolves credential_ref via Secret Engine, connects via SSH
systems-manager health checks poll HostNode.last_seen and update health_status
Infrastructure Templates deploy to a specific host by targeting its docker_endpoint

Infrastructure Templates¶

Every agent package ships a compose.yml. These become Infrastructure Templates — blueprints that container-manager-mcp can reference to scaffold dependencies on-demand.

sequenceDiagram
    participant A as Agent
    participant GR as Graph Router
    participant KG as Knowledge Graph
    participant CM as container-manager-mcp

    A->>GR: "I need langfuse for tracing"
    GR->>KG: Lookup InfrastructureTemplateNode("langfuse")
    KG-->>GR: Template found (compose_ref, required_env, deps)
    GR->>CM: compose_up("agents/langfuse-agent/compose.yml")
    CM->>CM: Resolve env from Secret Engine
    CM->>CM: Deploy to target HostNode
    CM-->>A: "✓ Langfuse deployed on media-server:9001"

Template Resolution¶

Agent needs langfuse → KG lookup → InfrastructureTemplateNode found
  → Check deps: needs postgres? → Deploy postgres template first
  → Resolve env: secret://langfuse/token → Secret Engine
  → Pick target host: KG query docker_host=true
  → container-manager-mcp.compose_up(template.compose_ref)

KG-Driven Specialist Installation¶

The canonical flow. The Knowledge Graph drives tool discovery, routing, and hydration:

sequenceDiagram
    participant U as User / Agent
    participant GR as Graph Router (AU)
    participant KG as Knowledge Graph (AU)
    participant SM as systems-manager MCP
    participant CM as container-manager MCP

    U->>GR: "Install the salesforce specialist"
    GR->>KG: Route query → discover install_specialist tool
    KG-->>GR: Matched: systems-manager.install_specialist
    GR->>SM: invoke install_specialist("salesforce")
    SM->>SM: AgentRegistry.install("salesforce")
    SM->>SM: Load package JSON, check dependencies

    alt Package requires container
        SM->>KG: Discover deploy_specialist_container tool
        KG-->>SM: Matched: container-manager.deploy_specialist_container
        SM->>CM: invoke deploy_specialist_container(image, env, ports)
        CM->>CM: docker/podman run
        CM-->>SM: container_id, status
    end

    SM->>SM: Merge MCP config into active config
    SM->>KG: Hydrate SpecialistPackageNode + tool nodes
    KG->>KG: New tools available for routing
    SM-->>GR: "✓ Installed salesforce v0.1.0 (3 tools)"
    GR-->>U: Result

    Note over KG: Future queries can now route<br/>to salesforce tools via KG

After installation, the new specialist's tools are immediately discoverable by the graph router. No restart required.

Identity-Based Authorization Flow¶

How the Permissions Kernel (CONCEPT:OS-5.2) enforces role-based access across the OS layer:

sequenceDiagram
    participant Agent as Specialist Agent
    participant TG as tool_guard.py (AU)
    participant PK as PermissionsKernel (AU)
    participant SM as systems-manager MCP
    participant EU as Eunomia (SM)

    Agent->>TG: call tool("reboot_server")
    TG->>PK: authorize_tool(identity, "reboot_server")
    PK->>PK: Check role policy (specialist → denied)
    PK-->>TG: DENY
    TG-->>Agent: Blocked — insufficient privileges

    Note over Agent: Admin-role agent can perform the operation

    Agent->>SM: A2A request("reboot_server", admin_identity)
    SM->>EU: Eunomia policy check (admin role)
    EU-->>SM: ALLOW
    SM->>SM: Execute reboot
    SM-->>Agent: Result

Deny takes precedence over a generic wildcard allow. Only explicit non-wildcard allow patterns can override denials. See permissions-kernel.md for the full policy schema.

Default Catalog¶

The Agent Registry ships 37 packages out-of-the-box via default_catalog.py:

Category	Count	Auto-installed?
OS Subsystems	4	✅ Yes
OS Services	2	❌ Available
Domain Specialists	26	❌ Available
Community MCPs	5	❌ Available

OS subsystems are auto-installed on first AgentRegistry.__init__(). All others are placed in available/ for on-demand install via specialist_install.

Use reseed_defaults() to refresh the available catalog (like apt update).

Responsibility Matrix¶

Capability	agent-utilities	systems-manager	container-manager-mcp	tunnel-manager	repository-manager
KG routing fabric	✅ Owner	—	—	—	—
Scheduler logic (CONCEPT:OS-5.2)	✅ Owner	—	—	—	—
Scheduler MCP tools	—	✅ Owner	—	—	—
Identity/Policy logic (CONCEPT:OS-5.2)	✅ Owner	—	—	—	—
Identity/Policy MCP tools	—	✅ Owner	—	—	—
Registry logic + catalog (CONCEPT:OS-5.2)	✅ Owner	—	—	—	—
Registry MCP tools	—	✅ Owner	—	—	—
Container lifecycle	—	—	✅ Owner	—	—
Compose/Swarm orchestration	—	—	✅ Owner	—	—
Specialist container deploy	—	—	✅ Owner	—	—
Infrastructure templates	✅ Models	—	✅ Executor	—	—
SSH remote operations	—	—	—	✅ Owner	—
Host inventory (KG HostNodes)	✅ Models	—	Consumer	✅ Writer	—
File transfer	—	—	—	✅ Owner	—
Git operations	—	—	—	—	✅ Owner
Workspace management	—	—	—	—	✅ Owner
Dependency graphs	—	—	—	—	✅ Owner
Process/service/cron OS ops	—	✅ Owner	—	—	—
Firewall/SSH/users	—	✅ Owner	—	—	—
Eunomia RBAC enforcement	—	✅ Both	✅ Both	—	—

Future Architecture Concepts¶

DevOps Integration (gitlab-api + github-agent)¶

The repository-manager handles git primitives. The gitlab-api and github-agent packages provide the platform layer — issues, merge requests, pipelines, code review. With codebases ingested into the KG, linking issues to code nodes (IssueNode --affects--> CodeNode) enables full traceability.

KG Extension Packs¶

Each MCP repo could ship a kg_extension/ module with domain ontology (OWL/Pydantic node types + relationships). When a specialist is installed via AgentRegistry.install(), the extension auto-hydrates the KG with domain-specific schema. Example: from servicenow_api.kg_extension import register_ontology.