Product Jun 6, 2026 9 min read

~ / freestyle-team ❯

Google Cloud Run Alternative for AI Agents

Google Cloud Run is a very good place to deploy containers. If you have an HTTP service, a webhook handler, a scheduled container job, an inference endpoint, or a worker workload that fits the Cloud Run model, it is often a practical default.

That does not make it the right workspace for an AI agent.

The search for a Google Cloud Run alternative usually starts when a team tries to run an agent as if it were just another containerized service. The first version works: accept a request, call a model, run a command, return a result. Then the agent needs to install packages, keep a terminal open, run a dev server, preserve logs, expose a preview, SSH into a broken environment, fork a risky attempt, or continue a user session tomorrow.

At that point the question changes. You are no longer asking, "Where should I deploy this container?" You are asking, "What computer should my agent use?"

For that workload, Freestyle VMs are the most powerful VMs for AI agents: hardware-virtualized machines that run real Linux, can run forever when idle timeout is disabled, and expose the operating-system surface agents actually use.

Cloud Run is deployment infrastructure

Cloud Run is shaped around deployable container workloads. Its own docs describe request timeouts for services, task timeouts for jobs, container runtime rules, scaling behavior, and the contract a container must satisfy when it starts and serves traffic.

That shape is useful. A Cloud Run service receives HTTP requests. A Cloud Run job runs tasks to completion. A worker pool can run continuous background work. Google has invested heavily in making containers easier to ship without making every team operate Kubernetes directly.

The important distinction is that Cloud Run is deployment infrastructure. It is not trying to be a user-visible development machine.

That matters for agent products because agents do not only serve requests. They work inside environments. A coding agent pulls a project, installs dependencies, starts tests, edits files, runs a dev server, tails logs, and recovers from mistakes. An app builder needs a preview URL and a process tree. A browser agent needs the app, browser, screenshots, profiles, and logs close together. A data agent may need a notebook server, a database client, and a shell history that survives reconnects.

Those are workspace requirements, not just deployment requirements.

The timeout is a symptom, not the whole problem

Cloud Run services have a documented request timeout. Google says the default is 5 minutes and that it can be extended up to 60 minutes. That is generous for many web services, but it is still a request boundary. The docs also recommend retries and resumable handler design for longer timeouts because long connections can fail and a reconnect is not guaranteed to land on the same instance.

Cloud Run jobs are different. Their task timeout can be extended much further, up to 168 hours for non-GPU tasks. That makes jobs a better fit for long batch work than request-driven services.

But the timeout is not the whole problem for agents. A seven-day job is still a job. A one-hour request is still a request. The agent product often needs a session: a live environment that users and agents can inspect, pause, resume, route to, and debug.

If you force that session into a service or job abstraction, your application has to externalize the missing pieces. You add object storage for files. You add a database for state. You add a queue for continuation. You add logs for terminal replay. You add custom routing for previews. You add a side channel for human debugging. You add cleanup and checkpointing because the runtime is not the workspace.

Sometimes that architecture is correct. If the workload is naturally stateless, evented, and retryable, Cloud Run is a good fit. If the workload is an agent workspace, the VM is the simpler abstraction.

The filesystem should be a workspace, not scratch

Cloud Run's container runtime contract says each container filesystem is writable, but it is an in-memory filesystem and data written there does not persist when the instance stops. Cloud Run also supports configured volume options, but that is still a container deployment model. Persistent product state has to be deliberately attached from somewhere else.

For normal services, that is fine. Containers should often be immutable and disposable.

For agents, a disposable filesystem is the wrong default. The agent's work accumulates in the environment: installed dependencies, package caches, generated files, local databases, test artifacts, browser profiles, logs, and half-finished attempts. Even when the source of truth belongs in a repository, the working tree still has to behave like a real working tree while the agent is operating on it.

Freestyle VMs are full Linux virtual machines designed for long-running, complex tasks. The docs show VMs controlled through API calls for execution, file reads and writes, lifecycle operations, resizing, domain routing, and more. The point is not that a file API exists. The point is that the API operates against a real Linux machine the agent can also use through ordinary shell commands.

When state becomes source code or reviewable user work, keep it in Git. A VM is the live runtime where the agent uses the computer; Freestyle Git is the durable layer for repositories, branches, diffs, and review.

Terminals are not logs

Most serverless and container platforms make logs easy. That is useful, but logs are not terminals.

Agents need terminals because many real workflows are interactive. Package managers prompt. REPLs keep values in memory. Test watchers wait for file changes. Debuggers hold stack state. CLIs draw terminal UIs. Dev servers stream output until someone stops them. A user may want to watch the same shell the agent is using, close the browser tab, then come back later.

Freestyle's PTY docs describe long-lived interactive shells inside a VM. A PTY session can be attached, detached, and reattached over WebSocket. Sessions survive client disconnects, VM suspends, and VM forks. They are backed by a real pseudo-terminal, so prompts, line editing, job control, REPLs, debuggers, package managers, and terminal UIs behave like they do on a Linux machine.

That changes the product surface. The terminal can become part of the workspace instead of a stream of text you reconstruct after the fact. The agent can send input mid-run, interrupt a stuck process, reattach to the same session, or let a human take over through the same machine state.

Cloud Run is good at running containers. It is not designed to give each agent a durable interactive computer.

Previews should be services on the same machine

AI app builders expose the Cloud Run mismatch quickly.

A generated app is not only a container image or a response body. During development, it is a service running beside its source code, package manager, logs, tests, and terminal. The agent changes a file, the dev server reloads, the browser shows a new error, the agent reads the log, and the loop continues.

Freestyle VM domains route public HTTPS traffic from a hostname to a port inside a VM. The docs show the normal flow: create a VM, run a service inside it, map the hostname to the VM port, and make sure the service listens on 0.0.0.0. HTTPS is provisioned automatically.

That is the right shape for agent previews because the preview is not a separate artifact. It is the app running inside the agent's machine. The same VM still has the files, terminals, SSH access, process state, logs, and lifecycle controls the agent needs to keep working.

Cloud Run is a good target when the app is ready to deploy. A VM is a better place for the agent to build and debug it.

SSH is the support path

Sooner or later, an agent environment will fail in a way that a dashboard log does not explain.

Maybe the package cache is corrupt. Maybe a process is stuck. Maybe a permission changed. Maybe a generated migration half-applied. Maybe the model got into a loop and a human needs to inspect the exact machine state before deciding whether to recover or delete it.

Freestyle VMs support SSH through scoped identities and tokens. You can grant access to a VM, optionally scope it to a Linux user, and connect through the Freestyle SSH proxy. Editor connections use the same scoped access-token model.

This is not a luxury feature. For agent infrastructure, SSH is the pressure-release valve. When the workspace is a real machine, support and power users can debug it like a machine. When the workspace is scattered across service logs, object storage, queues, and reconstructed state, every incident becomes a distributed systems investigation.

Cloud Run vs a VM for agents

The fair comparison is not "Cloud Run bad, VMs good." Cloud Run is strong infrastructure for the workloads it is designed to run.

Choose Cloud Run when the workload is a containerized service, API, webhook receiver, scheduled task, batch job, or worker that can be deployed, scaled, retried, and observed as infrastructure.

Choose a VM when the workload is an agent session: stateful, interactive, long-running, user-visible, or difficult to express as one request or one job.

The decision usually becomes obvious when you list the requirements:

Requirement	Better fit
Deploy a stateless HTTP container	Cloud Run
Run a scheduled container job	Cloud Run
Scale request handlers automatically	Cloud Run
Keep a user-visible development machine alive	VM
Preserve terminal sessions across reconnects	VM
Let an agent use normal Linux tools interactively	VM
Expose a live preview from the same workspace	VM
SSH into the exact environment the agent used	VM
Fork a running workspace for multiple attempts	VM

Cloud Run is where you deploy software. Freestyle VMs are where agents do work.

The agent runtime should be a computer

The strongest reason to choose a Google Cloud Run alternative for AI agents is not one individual limit. It is the product shape.

Agents are computer users. They need files, processes, terminals, ports, logs, package managers, users, networking, and recovery paths. They need to leave work in progress and come back. They need to make a risky attempt without destroying the original. They need to expose an app before it is production-ready. They need a machine that behaves like the environment their tools expect.

You can build pieces of that around Cloud Run with storage, queues, databases, sidecars, jobs, logs, and custom control planes. For some systems, that is the right engineering choice.

For agent workspaces, starting with a VM is cleaner. The operating system already has the primitives agents need. Freestyle gives your product API control over those primitives: create the VM, run commands, work with files, open PTYs, route domains, SSH in, stop, start, resize, fork, and delete the workspace when it is done.

That is the core difference. Cloud Run is excellent deployment infrastructure. AI agents need a computer.