Your single sign-on, with per-team access control. Sign in through the OIDC identity you already trust; every member, key, and agent stays scoped to its team.
One orchestrator. Every fleet, every workflow, every render.
Cloud is the orchestrator underneath your whole fleet. It schedules the GPUs you own, signs your team in through one identity, and packages the workflows your team writes — all over a single outbound tunnel that reaches in through any firewall. Managed, on-prem, or air-gapped.
Every team building AI in 2026 ends up building the same five things.
Identity. Secure connections. Scheduling. Audit. Packaging. Cloud ships all five — so the workflows and services your team writes have a platform underneath them, not a pile of glue code.
Reach a GPU behind any firewall over a single outbound link — no inbound ports to open. The same tunnel carries HTTP, raw TCP, and a live remote-screen channel when you need to look inside a box.
Work lands on the right machine by label, each team holds its own quota, and progress streams back live. Out of capacity? Spill over to external compute on demand.
Every result is traceable back to the run that produced it, and the webhooks Cloud sends are cryptographically signed so your systems can trust them.
Upload workflows, custom nodes, and models once — Cloud distributes them across the fleet, snapshots every version, and shows API consumers the inputs while keeping your internals hidden.
Teams that own their compute and care about control.
Cloud is built for teams who can't put their data on someone else's GPU — and who've stopped pretending that's a niche position.
Media studios & agencies
Standardize how a 50-person team uses AI and productionize your internal pipelines. Give each client an API key that shows the inputs but never your internals — and keep the IP on your own GPUs.
AI service providers
Sell AI capabilities to your partners without sub-leasing GPUs. Cloud keeps each partner isolated, routes their work to the right machines, and signs every callback so they can trust it.
Internal AI platforms
A paved road for your engineers to ship internal AI features — without every team rolling their own workflow runtime.
Regulated industries
Finance, manufacturing, government. Runs on-prem or fully air-gapped, with a license bound to your hardware and no phone-home required.
One codebase. Three trust boundaries.
The same Cloud — identity, scheduler, secure connections, packaging — runs as managed SaaS, in your Kubernetes cluster, or air-gapped behind your firewall. The boundary moves to where you need it, not the other way around.
Managed SaaS
We run the control plane and dashboard for you. You bring GPUs. Fastest path to production.
- We host the platform + Gateway
- You install agents
- Per-tenant data isolation
- Our SLA, our patching
On-Prem Kubernetes
Deploy the whole stack into your own Kubernetes cluster. GitOps-native — declarative manifests, no bespoke installer.
- Your cluster, your keys
- Per-cluster license
- Operator-managed data layer
- Conformant ingress
Air-Gap-Friendly
Designed for networks with no internet egress. The license is bound to your hardware and validated entirely offline.
- Cryptographically signed offline license
- Hardware fingerprint binding
- Lease-token cache
- No phone-home
How Cloud actually fits together.
Three tiers: surface, control plane, compute. The control plane is yours to deploy or ours to host.
Cloud speaks MCP. Your AI agent already knows how to call it.
The same orchestrator that powers Cyberun's dashboard is also an MCP server. Point your AI client at one endpoint with one integration key, and it can drive your workflows, tasks, agents, and container services — fourteen tools in all.
- Endpoint
- https://core.cyberun.cloud/api/v1/mcp
- Auth header
- Authorization: Bearer sk-…
- Transport
- Streamable HTTP
Fourteen tools, four surfaces.
Every MCP tool is scoped to the team the credential belongs to. The agent picks the right one — you don't wire each call by hand.
Workflows
list_workflows · get_workflow · get_workflow_by_slug · presign_file_upload — the agent discovers what's available and gets the parameter schema before submitting.
Tasks
run_workflow · list_tasks · get_task · get_task_result · stream_task_events · cancel_task — submit, watch progress over MCP notifications/progress, fetch artifacts when done.
Agents & container services
list_agents · list_container_services · get_container_service · call_container_service — see what's online and invoke any HTTP service the team has deployed.
Your team's services become AI capabilities.
Deploy Ollama, vLLM, or your own Flask app as a long-running container service. Write a one-line usage_prompt describing how AI should call it. From any MCP client, the agent discovers the service, reads the prompt, and invokes it through call_container_service — no extra wiring, no second integration.
vLLM · Ollama · Whisper · SAM · custom FastAPI · brand checkers · internal model routers
One credential. Workflows and container services in the same agent loop.
Install the Cyberun skill
One command teaches Claude Code, Cursor, Windsurf, or Codex CLI the API + MCP layout. After that, ask: "Submit a Cyberun task for the text-to-image workflow." The agent picks the right path, supplies the right headers, and quotes the right endpoints.
$ npx skills add cyberun-cloud/skillsCompute is pluggable. The product isn't.
The agent isn't a specific binary — it's a protocol. Anything that speaks it is a fleet Cloud can schedule against. That's the difference between betting on a vendor and betting on a layer.
Today the agent is a small Go binary you drop onto a GPU you own — on-prem, a workstation under your desk, or rented from a hyperscaler. It runs ComfyUI graphs, Nerfstudio pipelines, and long-running services, dialing out to Cloud over a single connection so there's nothing to open inbound.
Tomorrow the agent is anything that speaks the protocol. Wrap an external GPU provider, a serverless inference platform, or another team's fleet — Cloud just sees another set of agents. Same scheduler, same identity, same billing surface.
You commit to the orchestrator, not to a compute provider. When the underlying market shifts — who's cheap, who's out of capacity, who launched the new GPU — you don't migrate platforms. We point at a different fleet.
Agents dial out — no inbound ports. Cloud routes between clients and agents.
What air-gap actually requires.
Most AI tooling assumes the model API is one HTTPS call away. Regulated, classified, and on-prem rooms can't. Cloud is built for those rooms — not retrofitted for them.
Runs for days without the internet. A signed lease-token cache keeps a deployment working offline, then re-authenticates with a challenge-response when the network returns. Phone-home is never a hard requirement.
Bound to the machine it's installed on. The license ties to the host fingerprint; move the install and you re-sign it. That stops a copy from quietly spreading sideways inside a regulated org.
Reach in without opening inbound holes. Four tunnel modes — HTTP, TCP, WebSocket, WebRTC — all ride one outbound connection, so inbound paths work without poking holes in your firewall.
You stay in control of outbound AI calls. When a workflow does reach a public AI provider, the request is signed end-to-end — so you can revoke it at the relay, not just at the provider.
Cross-tenant access isn't blocked by audit — it's impossible by design. Access is scoped to each team's own domain, so one tenant simply can't reach another.
The basics are assumed, not bragged about: identity tokens rotate with OIDC revocation, credentials are encrypted at rest, and internal and public APIs stay separate. We don't treat table stakes as features.
One scope, one purpose, one revoke.
Cyberun issues three kinds of credential, each prefixed to make their use obvious. Lose one, revoke one — the others keep working.
The general-purpose API key. Scripts, CI, partner integrations, and MCP clients (Claude Code, Cursor) all carry an sk-. Scoped to one team.
Used by the agent process to connect a GPU machine into the team's pool. Limited to the agent runtime — can't read dashboards or submit tasks on a member's behalf.
Issued when a member pairs a device through the cyberun CLI. User-bound and team-scoped. Stolen laptop? Admins revoke the device key — the user's other access stays intact.
Use cases we hear most.
Patterns we've shaped Cloud around. Yours probably looks like one of these.
Studio-internal AI platform
Standardize how your 200-artist team uses generative tooling. Track every clip back to the workflow and seed that made it.
AI feature in a partner product
Upload your workflow once — a ComfyUI graph, a Nerfstudio pipeline, or a container service. Hand the partner an API key that shows the inputs but hides your internals, and bill by how much they run.
Sovereign AI compute
On-prem deployment in your air-gapped DC. Cloud is the control plane; the data never leaves your network.
From one workstation to a fleet
You bought a DGX. Then two more. Then bare-metal GPU servers, plus a few hyperscaler-rented instances on the side. Cyberun pools them into one team fleet — models and custom nodes auto-sync across every box, MCP gives any AI client a single surface to work against, and the gateway tunnels let you debug a machine without opening a port per box.
Start free today.
Sign up with email, Google, or GitHub, spin up your first team, and run a workflow. Free during launch — no card required, no quotas to debug. Email us when you need a managed, on-prem, or air-gap deployment.
Get started
sales@cyberun.cloud