Start Here — What agent-utilities Is & How to Use It¶

The single page to read first. If you are an AI agent or a developer who just wants to use this, everything you need is below or one click away.

What it is, in one paragraph¶

agent-utilities is a batteries-included harness for building Pydantic-AI agents that come with a knowledge graph, orchestration, memory, and tools out of the box. The heavy graph compute runs in a separate Rust engine (epistemic-graph) reached out-of-process over a socket — but you don't need Rust, Postgres, or any server to start: the default knowledge graph runs in-process and costs you nothing to turn on. You can consume it three ways: import it as a library, run it as an MCP server (graph-os), or call its REST gateway.

One ontology over the whole ecosystem¶

agent-utilities maps the entire ecosystem — agent-packages/agents/* + services/* + enterprise systems + research papers — into ONE ontology-driven knowledge-graph orchestration system. OWL/RDF reasoning runs over all of it to extrapolate new relationships (transitive/inverse/subclass/property-chain closures) across domains that were never explicitly linked — research connects to the real deployed estate, not a silo. Long-running objectives (Loops — research, develop, or skill execution) make that reasoning the engine: each cycle promotes new information, reasons over the ecosystem, and harvests the inferred cross-domain relationships back as the next iteration's inputs. Automated research produces Agent-Native Research Artifacts (ARA) — OWL-native, 4-layer, ecosystem-grounded, OWL/SHACL-sealed — exposed via the research_artifact MCP tool and POST /api/research/*. See OWL/RDF Layer.

The 5 pillars (what's inside)¶

Pillar	What it gives you	Deep dive
1. Graph Orchestration	A router→planner→dispatcher that turns a goal into a coordinated team/swarm of agents at runtime	pillar 1
2. Epistemic Knowledge Graph	A temporal, OWL-aware KG with ingestion, hybrid search, and a Palantir-parity ontology — the agent's memory and world model	pillar 2
3. Agentic Harness Engineering	Self-models, evaluation, and evolution-from-failure for the agents themselves	pillar 3
4. Ecosystem & Peripherals	The `graph-os` MCP tools, the hardened MCP multiplexer, and connectors to the wider `*-mcp` fleet	pillar 4
5. Agent OS	Sessions, goals, the REST gateway, server-minted JWT identity, the fleet supervisor + autonomy control plane, Prometheus metrics, tool safety	pillar 5

Three ways to use it (pick one)¶

See Consumption Models for the full trade-offs. The short version:

You want to…	Use	One-liner
Build a standalone agent in Python	Library	`from agent_utilities import create_agent`
Give an existing agent (Claude Code, Cursor, your own) KG + tools	MCP `graph-os`	`uv run graph-os` (stdio)
Share one KG/agent backend across many clients/containers	MCP over HTTP or REST gateway	`uv run graph-os --transport streamable-http` / `python -m agent_utilities` (REST, default port 9000)

1. As a library (standalone agent)¶

from agent_utilities import create_agent

# Skills + universal tools + the in-process knowledge graph, ready to run.
agent, toolsets = create_agent(name="assistant", skill_types=["universal", "graphs"])
print(agent.run_sync("What can you do?").output)

2. As an MCP server (give any agent the KG + tools)¶

uv run graph-os                       # stdio — for Claude Code / Cursor / IDEs
uv run graph-os --transport streamable-http --host 0.0.0.0 --port 8004   # HTTP

Register it in your client's mcp_config.json:

{ "mcpServers": { "graph-os": { "command": "uv", "args": ["run", "graph-os"] } } }

The agent now has graph_query, graph_search, graph_ingest, graph_orchestrate, ontology_*, and more — see Capabilities.

3. As a REST gateway (one backend, many clients)¶

python -m agent_utilities             # REST gateway, default :9000
curl -s localhost:9000/api/graph/query -d '{"cypher":"MATCH (n) RETURN n LIMIT 5"}'

The knowledge graph is free and native¶

You do not need a database to use the KG. The default backend is tiered: the Rust epistemic_graph working store (L1) in front of an embedded LadybugDB (L2). Zero servers, zero config:

export GRAPH_BACKEND=tiered     # this is already the default

When you outgrow it, point GRAPH_DB_URI at Postgres/pg-age and the durable tier switches automatically — nothing else changes. See Deployment Recipes for tiny → single-node → enterprise, and Stardog + pg-age databases to push your ontology to Stardog (or a local SPARQL endpoint) and backfill relationships into Apache AGE from .env/OpenBao in one command. For a config-complete, end-to-end install (the path Claude follows to set itself up), see the Self-Setup guide — one command generates a config.json covering every option and a doctor validates the deployment.

When one host is not enough¶

Every scale-out lever is opt-in and leaves the zero-infra default untouched: one shared Postgres state store (STATE_DB_URI), tenant-sharded KG engines behind client-side HRW routing (GRAPH_SERVICE_ENDPOINTS), Kafka-backed ingest workers (TASK_QUEUE_BACKEND=kafka + kg-ingest-worker), a queue-driven agent-dispatch fleet (AGENT_DISPATCH_BACKEND=queue + agent-dispatch-worker), and pre-forked gateway workers with per-tenant rate limiting (GATEWAY_WORKERS). The flagship guide walks every configuration from laptop to fleet: Deployment Configurations.

Where to go next¶

Capabilities — the concrete list of what an agent can do, with copy-paste snippets.
Consumption Models — library vs MCP stdio vs MCP HTTP vs REST.
Loop Engine — run the self-improvement / research / goal loop (formerly the "golden loop"): the graph_loops entrypoint, the autonomous daemon tick, and the rename/migration notes.
Deployment Configurations — the flagship guide: every deployment shape from zero-infra laptop to sharded, queue-driven fleet.
Ecosystem — how agent-utilities anchors the wider agent-packages/* fleet.
Day-0 Deployment — from scripts/bootstrap.sh to a full enterprise swarm.
Operational examples — focused walkthroughs: ontology→workflow, fleet events wiring, action-policy postures, autoscaling signals, engine sharding, queue dispatch, evolution publication, observability, identity/JWT, MCP consumption.
Metrics reference — every Prometheus series the platform emits.
Reference agent — runnable end-to-end examples.
AGENTS.md — conventions & architecture rules for contributors/AIs editing the repo.