Skip to content

What is Agentcy?

Agentcy is an AI-powered agent orchestration platform that connects to your infrastructure, builds a knowledge graph of everything, and gives you a conversational AI that can see and act across all your systems — from a single interface.

The Problem

Modern engineering teams rely on dozens of tools: GitHub for code, AWS for compute, Kubernetes for orchestration, PostgreSQL for data, Vercel for deploys, and on and on. Each tool has its own dashboard, its own mental model, and its own API. When an incident fires at 3 AM, you flip between 10 tabs trying to piece together what went wrong. When a new engineer joins, they spend weeks just mapping out how things connect.

There is no single place to see, search, and act on your entire stack.

The Solution

Agentcy ingests data from all your systems into a unified Context Engine — a pluggable graph + search + structured-query layer — then exposes an AI agent that can:

  • Search across repositories, cloud resources, database schemas, deployments, and teams in one query
  • Visualize the relationships between services, repos, clusters, and infrastructure
  • Execute live actions on any connected system through real-time connector tools
  • Orchestrate multi-step workflows with sub-agents, triggers, and schedules
  • Enforce zero-trust policies on every action with OPA/Rego and full audit logging

Architecture at a Glance

Agentcy at a glance A high-level view of Agentcy: the Next.js 16 frontend (chat, graph explorer, connectors, workflows) talks to the Rust/Axum backend, which is composed of focused crates (chat, rag, ingest, auth, policy) and runs on top of three data stores (PostgreSQL, Neo4j, Redis) plus an optional OpenFang sub-agent orchestrator. You see this Frontend (Next.js 16) React 19 · streaming UI · TanStack Query · shadcn/ui · Tailwind v4 Chat Graph Explorer Connectors Workflows REST · SSE · WebSocket Backend · Rust workspace agentcy-api Axum 0.8 · 29-crate workspace · streaming · tools · RAG agentcy-chat agentcy-rag agentcy-ingest agentcy-auth agentcy-policy agentcy-agent agentcy-graph + 22 more crates: connectors, channels, workers, sandbox, tasks, voice, SDK … Data stores Optional PostgreSQL SQL Org · auth · config Conversation history Workflow runs · audit Neo4j Graph + vectors Knowledge graph Realm-partitioned HNSW vector index Redis Cache · queue Job queues · pub/sub Rate limits · cache Sync state OpenFang Sub-agents · sidecar 40+ channel adapters Workflow orchestration Disabled by default Self-hostable · Docker Compose · Railway · macOS desktop
Frontend, Rust backend, three data stores, and an optional OpenFang sub-agent sidecar.

Key Capabilities

Universal Data Connectors

Connect to 15+ data sources, each implementing a dual-trait architecture: IngestionSource for ETL into the knowledge graph, and ConnectorToolProvider for live, real-time tool execution by the AI agent.

CategoryConnectorsLive Tools
Code & DevOpsGitHub (PAT, OAuth, App)12 tools (repos, PRs, issues, commits, files, actions)
CloudAWS, GCP8 tools each (EC2, S3, Lambda, IAM / GCE, GCS, GKE, IAM)
PlatformVercel, Supabase8-10 tools each (deployments, projects, domains / tables, auth, storage)
DatabasesPostgreSQL, MySQL, SQL Server, MongoDB8-10 tools each (query, schema, stats, indexes)
InfrastructureKubernetes10 tools (pods, deployments, services, logs, namespaces)
APIOpenAPI, MCP (Model Context Protocol)Dynamic (auto-discovered from spec / server)
FilesCSV, JSONSchema inference, bulk import
ExecutionRemote ExecutionCommand execution on remote hosts
BIPower BIDataset queries, report metadata

Context Engine

All ingested data lives in a pluggable Context Engine with two providers:

  • Basic — embedded graph store, default for self-hosted installs, zero external dependencies
  • Advanced — columnar engine + S3-compatible object store + pgvector. Available on Agentcy Cloud and on-premise Enterprise deployments. Adds structured KQL/SQL, time-series queries, and multi-billion-node scale.

Both providers expose the same surface:

  • Full-text search across all nodes and properties
  • Vector embeddings for semantic search via fastembed (local, no API calls)
  • Interactive visualization — explore your context with the Explore page (Graph / Search / Query / Pipelines tabs)
  • Structured queries — Cypher on Basic, KQL/SQL on Advanced — auto-charted in the Query tab
  • Multi-tenant isolation — every node carries org_id and realm; queries are scoped by default

AI Chat with Tool Calling

A streaming conversational AI with access to every connected system:

  • Context tools — search nodes, traverse relationships, run structured queries (KQL/SQL on Advanced, Cypher on Basic)
  • Connector tools — execute live API calls to GitHub, AWS, K8s, databases, and more
  • Tool catalog — meta-tools that let the LLM discover and invoke connector tools on demand, instead of loading all tools upfront
  • Approval flow — the agent requests permission before executing sensitive operations, with configurable timeouts
  • Policy enforcement — every tool invocation is checked against zero-trust policies before execution
  • Streaming SSE — real-time token streaming with reasoning blocks, tool calls, and status updates

Supports multiple LLM providers:

ProviderModelsNotes
AnthropicClaude Sonnet 4, Claude Opus 4Recommended. Best tool-calling performance
OpenAIGPT-4o, GPT-4.1Full support including streaming
OllamaLlama 3, Mistral, Qwen, etc.Local inference, no API key required
vLLMAny supported modelSelf-hosted, OpenAI-compatible
LM StudioAny GGUF modelDesktop local inference
llama.cppAny GGUF modelLightweight local inference
Vercel AI GatewayMultiple providersUnified gateway

Sub-Agent Orchestration

Build complex automation with the OpenFang sidecar:

  • Visual workflow editor — drag-and-drop canvas with conditional branching, loops, and parallel execution
  • Sub-agent spawning — create specialized agents with isolated tool sets and contexts
  • Triggers — cron schedules, webhooks, and lifecycle events
  • Gateway management — route requests to different AI providers with load balancing
  • Templates — pre-built workflow templates for common automation patterns

Zero-Trust Security

Enterprise-grade policy enforcement powered by OPA/Rego:

  • In-process policy engine — uses regorus (Rego in Rust), no OPA sidecar needed
  • Default-allow model — policies define deny[msg] rules; any denial blocks the action
  • 15 granular permissions — from connectors:read to policies:manage
  • Role-based access control — define custom roles mapping to permission sets
  • Audit logging — every policy evaluation recorded with full context
  • Policy sources — sync policies from GitHub repos or HTTP endpoints
  • Policy simulator — test rules against sample inputs before deployment
  • API and tool enforcement — middleware checks on every HTTP request, plus pre-execution checks in the agent loop

Distributed Workers

Run long-running jobs and remote execution across your infrastructure:

  • Redis-based job queue — reliable task distribution with at-least-once delivery
  • Heartbeat monitoring — track worker health and auto-reassign stalled jobs
  • Remote execution — run commands on remote hosts through registered workers
  • Horizontal scaling — add workers to handle increased load

Deployment Options

Agentcy runs wherever you need it:

OptionBest ForSetup Time
Docker ComposeDevelopment, small teams5 minutes
Desktop App (macOS)Individual use, demos2 minutes
RailwayQuick cloud deployment10 minutes
Fly.ioEdge deployment10 minutes
AWS (ECS/EKS)Production, enterprise30 minutes
Kubernetes / HelmLarge-scale, self-hosted30 minutes

Feature Comparison

How Agentcy compares to alternative approaches:

CapabilityAgentcyBackstagePortKubecostCustom Scripts
Context Engine (graph + search + structured query)Yes (pluggable; Cloud-grade Advanced engine optional)NoNoNoNo
AI chat with tool callingYes (multi-LLM)NoLimitedNoNo
Live connector tools15+ sources, 100+ toolsPlugins (read-only)IntegrationsK8s onlyManual
Visual graph explorerYesNoNoNoNo
Sub-agent orchestrationYes (OpenFang)NoNoNoNo
Zero-trust policiesOPA/Rego, 15 permissionsBasic RBACRBACN/AN/A
Semantic search (RAG)Local embeddingsNoNoNoNo
Desktop appTauri v2 (macOS)NoNoNoNo
Self-hosted LLM supportOllama, vLLM, llama.cppN/ANoN/AN/A
Deployment modelSelf-hosted or Agentcy Cloud (PaaS)Self-hostedSaaSSelf-hostedDIY

Technology Stack

LayerTechnology
BackendRust, Axum, Tokio, sqlx
FrontendNext.js 16, React 19, Tailwind CSS v4, shadcn/ui
Context EnginePluggable. Basic = embedded graph store; Advanced (Cloud / on-prem) = columnar engine + S3 + pgvector
SQL DBPostgreSQL 16
Cache / QueueRedis 7
DesktopTauri v2 (macOS, WKWebView)
AIClaude, GPT-4o, Ollama, vLLM, LM Studio, llama.cpp
PoliciesOPA/Rego via regorus
Embeddingsfastembed (local, no API)
OrchestrationOpenFang (optional sidecar)

Who Is It For?

  • Platform Engineering teams managing complex multi-cloud infrastructure
  • DevOps / SRE teams needing unified visibility and incident response across tools
  • Security teams enforcing access policies and auditing actions across systems
  • Developers building AI-powered internal tools and automations
  • Data Engineers exploring relationships across databases, APIs, and services
  • Startup CTOs who want one tool instead of ten dashboards

Next Steps

Built by AgentcyLabs. For in-house deployment or Agentcy Cloud (PaaS) access, visit agentcylabs.com.