AGENT-NATIVE CLOUD

Deploy agents
to production

Deployment and hosting infrastructure for Agent SDKs. Works with the Claude Agent SDK today, more soon.

Get started

Deploying agents is the hard part

Your agent works locally. It should work in production too.

sandbox

max_session: 24h

storage: ephemeral

ci/cd: none

◻storage

▸tools

▦database

◇network

Sandboxes weren't built for agents

Most sandbox platforms are built for code execution, not long-running AI agents. Your agent needs its own computer — persistent workspace, unlimited sessions, and a deployment workflow that works.

research-agent

progress0%

session limit24:00:00

Session limits kill your agents

Session caps, CPU-time limits, and cold timeouts. Your research agent analyzing 50 papers? Killed mid-task. Your coding agent on a large refactor? Gone.

agent workspace

0 items

storage: ephemeral-2 KB

Persistence is an afterthought

Ephemeral filesystems, manual volume mounts, pause-and-resume workarounds. None of it works automatically across sessions the way agents need.

From local to production

Build your agent locally, deploy to our cloud or your VPC with a single command

superserve.yamlmain.py

local

Terminal

zsh

Sessions

—

sess-01

waiting

a7f3k

sess-02

waiting

b2m8p

sess-03

waiting

c5n1q

sess-04

waiting

d9j4r

Built for agents

Everything your agent needs to run reliably in production

✓gVisor sandboxisolated

✓encrypted secretsAES-256

●audit: session.startlogged

●audit: tool.invokelogged

●egress: proxy-only✓

Security & Compliance

gVisor-sandboxed sessions with encrypted secrets, full audit trails, proxy-only egress, logging and isolation.

allocated ✓

sandbox-01

ready0.18s

sandbox-02

ready0.12s

sandbox-03

ready0.09s

cold start:0ms

Sub-second allocation

Your agent sessions start in a sandbox in under a second, eliminating the cold-start problem with spinning up new containers

∞

72:00:00

running— no limit

session active∞

Unlimited Duration

No session duration limits. Agents run as long as they need to complete the task — hours or days. No CPU-time caps.

session 1session 2session 3

workspaceNFS

·workspace/findings.md

8KB✓

·workspace/model.pkl

24KB✓

·workspace/output.csv

12KB✓

files intact across restarts

Persistent Workspace

Each agent gets a persistent filesystem that survives restarts and inactivity. Agents resume where they left off, and files remain accessible after completion.

SSE live

Starting research in workspace/...

tool: WebSearchAI papers 2025

Found 12 relevant papers. Downloading...

tool: Bashcurl -O paper.pdf

Extracting key findings from each paper—

tool: Writeworkspace/findings.md

Saved analysis to workspace/findings.md

tool: Greptransformer arch

Summary complete. 12 papers processed.

Real-time Streaming

Stream agent responses in real time to your app via our Python and TypeScript SDKs, REST API, or CLI. See every token and tool call as it happens.

✓git push

✓detecting changes0.8s

✓building image2.0s

✓deploying1.0s

●live

main → production ✓

CI/CD Integration

Deploy from GitHub, GitLab, or other providers. Push to your branch and your agent is live in production.

MCP SERVERS

Co-host your tools alongside your agents

Run your own MCP servers in the same environment as your agents. No cross-network latency, no separate infrastructure.

✓MCP Servers scale with Ray on Kubernetes for parallelizing agentic workloads
✓Heterogeneous compute — CPU and GPU for embedding generation, custom model inference, training, and RL

AGENT

▪

MCP SERVER

tools

replica 1

replica 2

replica 3

MCP SERVER

compute

same environment

Why Superserve?

Skip the infrastructure work, ship your agent

DIY

Sandbox platforms

Superserve

Setup

✗K8s, Docker, gVisor

✗Dockerfiles, platform config

✓superserve deploy

Sessions

✗You manage lifecycle

✗Hourly, or duration capped

✓Unlimited, managed

Cold-start

✗Optimize yourself

✗Locked behind pricing tiers

✓Sub-second, nearly instant

Storage

✗Self-managed filesystems

✗Ephemeral by default

✓Persistent, automatic

Agent SDK support

✗Integrate yourself

✗Generic code execution

✓Native integrations

Security

✗Roll your own stack

✗Sandbox isolation only

✓gVisor + secrets + audit trails

MCP Servers

✗Host separately

✗Not supported

✓Co-hosted, scaled with Ray

Agent Deployment

✗Your cloud, your ops

✗Not supported

✓Managed with CI/CD integration

FAQ

All you need to know

Open Source

Built in the open

Superserve is open source. Contribute, request features, or add support for your favorite agent SDK.

Explore on GitHub

Support the project by giving it a star ⭐

Contribute

Add support for your favorite agent SDK

Request a feature

Open an issue on GitHub

Your agent, production-ready
in minutes

Deploy your agent to production in minutes, not weeks

Get Started

✓gVisor sandboxed✓Unlimited sessions✓NFS persistence✓Sub-second cold starts✓MCP server co-hosting✓Cloud or VPC

Deploy agentsto production

Deploying agents is the hard part

Sandboxes weren't built for agents

Session limits kill your agents

Persistence is an afterthought

From local to production

Built for agents

Security & Compliance

Sub-second allocation

Unlimited Duration

Persistent Workspace

Real-time Streaming

CI/CD Integration

Co-host your tools alongside your agents

Why Superserve?

All you need to know

Built in the open

Your agent, production-readyin minutes

Deploy agents
to production

Your agent, production-ready
in minutes