Deployment¶
Deployment Options¶
langfuse-agent exposes its MCP server (console script langfuse-mcp) four ways. Pick the row that
matches where the server runs relative to your MCP client, then copy the matching
mcp_config.json below. Replace the <your-…> placeholders with the values from the Configuration / Environment Variables section.
| # | Option | Transport | Where it runs | mcp_config.json key |
|---|---|---|---|---|
| 1 | stdio | stdio |
client launches a subprocess | command |
| 2 | Streamable-HTTP (local) | streamable-http |
a local network port | command or url |
| 3 | Local container / uv | stdio or streamable-http |
Docker / Podman / uv on this host | command or url |
| 4 | Remote URL | streamable-http |
a remote host behind Caddy | url |
1. stdio (local subprocess)¶
The client launches the server over stdio via uvx — best for local IDEs
(Cursor, Claude Desktop, VS Code):
{
"mcpServers": {
"langfuse-mcp": {
"command": "uvx",
"args": ["--from", "langfuse-agent", "langfuse-mcp"],
"env": {
"LANGFUSE_BASE_URL": "<your-langfuse_base_url>",
"LANGFUSE_TOKEN": "<your-langfuse_token>",
"LANGFUSE_PUBLIC_KEY": "<your-langfuse_public_key>"
}
}
}
}
2. Streamable-HTTP (local process)¶
Run the server as a long-lived HTTP process:
uvx --from langfuse-agent langfuse-mcp --transport streamable-http --host 0.0.0.0 --port 8000
curl -s http://localhost:8000/health # {"status":"OK"}
Then either let the client launch it:
{
"mcpServers": {
"langfuse-mcp": {
"command": "uvx",
"args": ["--from", "langfuse-agent", "langfuse-mcp", "--transport", "streamable-http", "--port", "8000"],
"env": {
"TRANSPORT": "streamable-http",
"HOST": "0.0.0.0",
"PORT": "8000",
"LANGFUSE_BASE_URL": "<your-langfuse_base_url>",
"LANGFUSE_TOKEN": "<your-langfuse_token>",
"LANGFUSE_PUBLIC_KEY": "<your-langfuse_public_key>"
}
}
}
}
…or connect to the already-running process by URL:
3. Local container / uv¶
(a) Launch a container directly from mcp_config.json (stdio over the container —
no ports to manage). Swap docker for podman for a daemonless runtime:
{
"mcpServers": {
"langfuse-mcp": {
"command": "docker",
"args": [
"run", "-i", "--rm",
"-e", "TRANSPORT=stdio",
"-e", "LANGFUSE_BASE_URL=<your-langfuse_base_url>",
"-e", "LANGFUSE_TOKEN=<your-langfuse_token>",
"-e", "LANGFUSE_PUBLIC_KEY=<your-langfuse_public_key>",
"knucklessg1/langfuse-agent:latest"
]
}
}
}
(b) Run a local streamable-http container, then connect by URL:
docker run -d --name langfuse-mcp -p 8000:8000 \
-e TRANSPORT=streamable-http \
-e PORT=8000 \
-e LANGFUSE_BASE_URL="<your-langfuse_base_url>" \
-e LANGFUSE_TOKEN="<your-langfuse_token>" \
-e LANGFUSE_PUBLIC_KEY="<your-langfuse_public_key>" \
knucklessg1/langfuse-agent:latest
# or, from a clone of this repo:
docker compose -f docker/mcp.compose.yml up -d
(c) From a local checkout with uv:
4. Remote URL (deployed behind Caddy)¶
When the server is deployed remotely (e.g. as a Docker service) and published through
Caddy on the internal *.arpa zone, connect with the "url" key — no local process or
image required:
Caddy reverse-proxies http://langfuse-mcp.arpa to the container's :8000
streamable-http listener; http://langfuse-mcp.arpa/health returns
{"status":"OK"} when the service is live.
This page covers running langfuse-agent as a long-lived service: the transports, a
Docker Compose stack, the A2A agent server, putting it behind a Caddy reverse proxy,
and giving it a DNS name with Technitium. To provision the Langfuse platform it
connects to, see Backing Platform.
langfuse-agentships two console scripts: an MCP server (langfuse-mcp) that exposes the typed tool surface, and an A2A agent server (langfuse-agent) that wraps those tools in a Pydantic-AI graph agent. They can be deployed together or independently.
Run the MCP server¶
The transport is selected with --transport (or the TRANSPORT env var):
Health check (HTTP transports):
Configuration (environment)¶
langfuse-agent is configured entirely from the environment. The required set:
| Var | Default | Meaning |
|---|---|---|
LANGFUSE_BASE_URL |
http://localhost:8080 |
Langfuse instance URL |
LANGFUSE_PUBLIC_KEY |
(unset) | Project public key (pk-...) |
LANGFUSE_SECRET_KEY |
(unset) | Project secret key (sk-...) |
AUTH_TYPE |
key |
Auth mode: key, delegated, none |
HOST |
0.0.0.0 |
Bind address (HTTP transports) |
PORT |
8004 |
Bind port (HTTP transports) |
TRANSPORT |
stdio |
stdio, streamable-http, sse |
Per-domain tool registration is toggled with OBSERVABILITY_TOOL, DATASETS_TOOL,
PROMPTS_MODELS_TOOL, MANAGEMENT_TOOL and the finer-grained category switches
(TRACE_TOOL, SCORES_TOOL, PROMPTS_TOOL, …). The full set is documented in
.env.example.
Copy it to .env and populate only what you use.
Docker Compose¶
The repo ships docker/mcp.compose.yml.
It reads a sibling .env and publishes the HTTP server on :8004:
services:
langfuse-agent-mcp:
image: knucklessg1/langfuse-agent:latest
container_name: langfuse-agent-mcp
hostname: langfuse-agent-mcp
restart: always
env_file:
- ../.env
environment:
- PYTHONUNBUFFERED=1
- HOST=0.0.0.0
- PORT=8004
- TRANSPORT=streamable-http
ports:
- "8004:8004"
healthcheck:
test: ["CMD", "python3", "-c", "import urllib.request; urllib.request.urlopen('http://localhost:8004/health')"]
interval: 30s
timeout: 10s
retries: 3
cp .env.example .env # then edit LANGFUSE_* values
docker compose -f docker/mcp.compose.yml up -d
docker compose -f docker/mcp.compose.yml logs -f
A2A agent server¶
The A2A agent server (langfuse-agent console script) wraps the MCP tool surface in
a Pydantic-AI graph agent with a web UI and OpenTelemetry tracing. It auto-discovers
tools from mcp_config.json and connects to the MCP server over MCP_URL. The repo
ships docker/agent.compose.yml,
which deploys the MCP server on :8004 and the agent server on :9004:
export LANGFUSE_BASE_URL=http://your-langfuse:3000
export LANGFUSE_PUBLIC_KEY=pk-...
export LANGFUSE_SECRET_KEY=sk-...
langfuse-agent --provider openai --model-id gpt-4o --api-key sk-...
services:
langfuse-agent-agent:
image: knucklessg1/langfuse-agent:latest
container_name: langfuse-agent-agent
depends_on:
- langfuse-agent-mcp
env_file:
- ../.env
command: [ "langfuse-agent" ]
environment:
- PYTHONUNBUFFERED=1
- HOST=0.0.0.0
- PORT=9004
- MCP_URL=http://langfuse-agent-mcp:8004/mcp
- PROVIDER=${PROVIDER:-openai}
- MODEL_ID=${MODEL_ID:-gpt-4o}
- ENABLE_WEB_UI=True
ports:
- "9004:9004"
Behind a Caddy reverse proxy¶
Expose the HTTP server on a hostname with automatic TLS. Add to your Caddyfile:
# Internal (self-signed) — homelab .arpa zone
langfuse-agent.arpa {
tls internal
reverse_proxy langfuse-agent-mcp:8004
}
# Public — automatic Let's Encrypt
langfuse-agent.example.com {
reverse_proxy langfuse-agent-mcp:8004
}
Reload Caddy:
DNS with Technitium¶
Point the hostname at the host running Caddy. Via the Technitium API:
curl -s "http://technitium.arpa:5380/api/zones/records/add" \
--data-urlencode "token=$TECHNITIUM_DNS_TOKEN" \
--data-urlencode "domain=langfuse-agent.arpa" \
--data-urlencode "zone=arpa" \
--data-urlencode "type=A" \
--data-urlencode "ipAddress=10.0.0.10" \
--data-urlencode "ttl=3600"
…or add an A record langfuse-agent.arpa → <caddy-host-ip> in the Technitium web
console (http://technitium.arpa:5380). The ecosystem
technitium-dns-mcp automates
this as a tool.
Register with an MCP client¶
Add to your client's mcp_config.json (multiplexer nickname lf):
{
"mcpServers": {
"langfuse-agent": {
"command": "uv",
"args": ["run", "langfuse-mcp"],
"env": {
"LANGFUSE_BASE_URL": "http://your-langfuse:3000",
"LANGFUSE_PUBLIC_KEY": "pk-...",
"LANGFUSE_SECRET_KEY": "sk-..."
}
}
}
}
For a remote HTTP server, point the client at http://langfuse-agent.arpa/mcp instead.