005 · Sizing calculator

Size your deployment.

Configure your workload and see the recommended infrastructure in real time. Choose your target environment, add your models, and adjust options to get a production-ready sizing estimate.

Target environment

Workload

Active conversations running simultaneously. Each holds state in Redis. This is the primary driver of memory scaling.

Total daily volume across all agents. Drives storage and Kafka throughput sizing.

How long to retain conversation history and audit logs.

Self-hosted models

Add models to size GPU nodes. Leave empty to use models as a service (MaaS).

Options

Self-hosted observability

Deploy Prometheus, Loki, Tempo, and Grafana on your infrastructure.

High availability (multi-AZ)

Ensure minimum 3 nodes per group distributed across Availability Zones.