Skip to content

Deployment Overview

Agentcy is a self-hostable platform. You can run it anywhere that supports Linux containers or a static binary. Pick the option that matches your operational comfort, scale, and compliance requirements.

Decision Matrix

OptionBest forEffortHAManaged databases
Local DevelopmentBuilding features, testing locallyLowestNoNo
Docker ComposeSingle VM, small teams, demosLowNoNo
Bare MetalSingle Linux host, no Docker, air-gapped sitesLow–MediumNoOptional
Desktop AppIndividual users on macOSLowestNoNo
RailwayFastest cloud path, hobby–small teamLowLimitedYes (PG, Redis)
AWS CloudFormationSingle-account AWS, push-button stackMediumYesYes (RDS, ElastiCache)
AWS ECSAWS shops on Fargate, no k8sMediumYesYes
AWS EKSAWS shops standardized on KubernetesHighYesYes
Kubernetes (any)GKE / AKS / on-prem k8s, GitOps shopsHighYesOptional
GCPCloud Run for stateless tier, GKE for full HAMedium–HighYesYes

Architecture Recap

Every deployment runs the same components:

ComponentProcessDefault portNotes
agentcy-apiRust/Axum backend8080Single binary, scales horizontally
agentcy-frontendNext.js 16 static + server3000Stateless, scales horizontally
PostgreSQLManaged or self-hosted5432Source of truth — relational data
Context EngineEmbedded graph store (Basic) OR kyma sidecar (Advanced)7687 (Basic, Bolt) / 8080 (Advanced, HTTP + Flight gRPC)Knowledge graph — see Architecture & Tech Stack
RedisManaged or self-hosted6379Job queues, caching
OpenFang (optional)Sidecar4200Sub-agents and orchestration

The backend runs database migrations on startup. There is no separate migration step.

Sizing Guidance

Starting points — adjust based on your workload (tools/min, agent concurrency, graph size):

TierUsersAPIFrontendPostgreSQLNeo4jRedis
Small1–101 × 0.5 vCPU / 512 MB1 × 0.25 vCPU / 256 MB1 vCPU / 1 GB1 vCPU / 2 GB0.25 vCPU / 256 MB
Team10–502 × 1 vCPU / 1 GB2 × 0.5 vCPU / 512 MB2 vCPU / 4 GB2 vCPU / 4 GB0.5 vCPU / 512 MB
Org50–5003+ × 2 vCPU / 2 GB2 × 1 vCPU / 1 GB4 vCPU / 16 GB4 vCPU / 16 GB1 vCPU / 1 GB

The Rust backend is memory-light. Most workloads stay well under 1 GB per replica.

Choosing the Context Engine

Agentcy ships two interchangeable Context Engine providers behind one agentcy-graph interface — Cypher works on both, application code is provider-agnostic.

ProviderBacked byQuery languagesWhere it runs
Basic (default)Embedded graph store, Neo4j-compatibleCypherBundled with every self-hosted install. Optionally swap in Neo4j AuraDB or self-managed Neo4j Enterprise for HA.
Advancedkyma — Rust columnar engine on S3 + pgvectorKQL / SQL / CypherAgentcy Cloud, on-premise Enterprise, and self-host for evaluation. Set CONTEXT_ENGINE=advanced and point at a kyma sidecar.

The Advanced provider unlocks structured KQL/SQL, time-series queries, schema evolution, multi-billion-node scale, and Apache Arrow Flight streaming. On Agentcy Cloud the engine, S3 bucket, and pgvector are provisioned for you — bring-your-own-bucket (AWS S3, GCS, Cloudflare R2, MinIO) is supported for data residency. See Architecture & Tech Stack → Context Engine for the full breakdown and the kyma project at getkyma.dev.

Production Checklist

Before going live on any platform:

  • [ ] All default passwords replaced (POSTGRES_PASSWORD, JWT_SECRET, plus context-engine creds — NEO4J_PASSWORD on Basic or KYMA_TOKEN on Advanced)
  • [ ] TLS terminated in front of the API and frontend
  • [ ] CORS_ORIGINS restricted to your real domains
  • [ ] Database backups scheduled (PG: pg_dump nightly; Neo4j: dump before upgrades)
  • [ ] Logs shipped off the host (CloudWatch, GCP Logging, Loki, Datadog)
  • [ ] Metrics scraped (/metrics Prometheus endpoint on the API)
  • [ ] Auth provider chosen — local for solo, oidc for teams (see OIDC)
  • [ ] Resource limits set on every container/pod
  • [ ] Zero-trust policies enabled if needed (see Policies)

Getting Help

Built by AgentcyLabs. For in-house deployment or Agentcy Cloud (PaaS) access, visit agentcylabs.com.