squadbase.yml
Learn how to configure `squadbase.yml`.
Squadbase uses a YAML-formatted file to describe how your app should be built and which cloud settings to apply at deploy time.
By defining these parameters, the platform picks up the settings automatically during deployment.
The file is usually saved as squadbase.yml
, but you can rename it by changing Setting file path in the dashboard.
Required keys vary depending on the runtime and framework you choose.
Build
Define build-time settings under build:
.
Squadbase builds your app with Docker. For frameworks listed in framework
, you don’t need to write your own Dockerfile—one is provided automatically.
If you’re using an unsupported framework or need custom OS packages, you can supply your own Dockerfile.
Using an officially supported framework
Set the following keys when you rely on Squadbase’s built-in images:
Key | Values | Description |
---|---|---|
framework | streamlit nextjs morph | Framework |
runtime | python3.9 python3.10 python3.11 python3.12 node18 node20 node22 | Runtime |
package_manager | pip poetry uv npm yarn pnpm | Package manager |
entrypoint | src/app.py src/main.py src/index.py | App entry point |
entrypoint
is used only for Streamlit to point to the Python file to run.
It is ignored for other frameworks.
Using a custom Dockerfile
If a Dockerfile
exists in the project root, Squadbase ignores framework
and builds with your Dockerfile instead.
context
and build_args
apply only when you bring your own Dockerfile.
Deployment
Deployment settings go under deployment:
.
Choose the cloud provider with provider
(defaults to gcp
).
AWS
Key | Type | Description |
---|---|---|
memory | number | Memory (MB) |
timeout | number | Timeout (sec) — max 900 |
provisioned_concurrency | number | Provisioned concurrency |
ephemeral_storage | number | Ephemeral storage (MB) |
GCP
Key | Type | Description |
---|---|---|
memory | string (e.g. 1Gi, 512Mi) | Memory |
cpu | number | CPU |
timeout | number | Timeout (sec) — max 900 |
concurrency | number | Max concurrent requests |
min_instances | number | Minimum instances |
ephemeral_storage | string (e.g. 1Gi, 512Mi) | Ephemeral storage |
Pricing model
Runtime charges are calculated on memory, request, and CPU.
You pay for resources used according to the spec you define under deployment
.
Memory-based ($/GB-sec) | Request-based ($/1 M) | CPU-based ($/vCPU-sec) | |
---|---|---|---|
1. AWS | $0.0000200000 | $0.25 | – |
2. AWS – standby instances | $0.0000050000 | – | – |
3. GCP | $0.0000030000 | $0.50 | $0.0000270 |
4. GCP – standby instances | $0.0000030000 | – | $0.0000270 |
Standby instances map to provisioned_concurrency
on AWS and min_instances
on GCP.
Keeping a standby instance running shortens cold-start delays.