// What happens when you let AI agents run the factory

We Gave AI Agents
the Keys to
Everything.

90 days. 3 agents. 1 mission: build, deploy, and ship production software — autonomously. Here's what we unlocked at each stage.

MCP PROTOCOL AGENT-TO-AGENT RAG MEMORY MULTI-MODEL ROUTING
SCROLL TO BEGIN ↓
STAGE 01
The Awakening — One Agent, One Terminal

It started with a single AI agent in a terminal. Claude Code on a VPS. It could read files, write code, run commands. But it had no memory. No persistence. Every session was a blank slate. We needed to teach it who it was.

🧠 Agent Identity (SOUL.md)

A markdown file that defines who the agent is, what it can do, and what it must never do. Loaded at every session start. The agent's constitution.

UNLOCKED
📝 Persistent Memory (MEMORY.md)

Agents write what they learn to a memory file. Next session, they read it first. Context survives reboots. No more blank slates.

UNLOCKED
Bash + Filesystem Access

Full shell access. Create files, install packages, configure servers, run builds. The agent operates the machine like a senior engineer.

UNLOCKED
🔐 SSH Hardening

Ed25519 keys only. No passwords. Fail2ban. UFW firewall. The agent secured its own infrastructure before we asked it to.

UNLOCKED
STAGE 02
The Protocol — MCP, Tools, and Standards

Raw terminal access wasn't enough. We plugged into the Model Context Protocol — the universal standard for AI-tool communication. Now the agent could talk to databases, browsers, git repos, and external APIs through a standardised interface.

🔌 MCP Integration

Model Context Protocol — the USB-C of AI. One standard protocol connecting agents to any tool: filesystems, databases, browsers, APIs.

UNLOCKED
🎯 Agent Skills System

Reusable skill modules (SKILL.md) with YAML frontmatter. Progressive disclosure: metadata → instructions → full context. 17 skills deployed.

UNLOCKED
🪝 Hook System

PreToolUse, PostToolUse, Stop hooks. Gate dangerous actions, log everything, enforce policies. The agent's immune system.

UNLOCKED
🌐 ACP (Agent Control Protocol)

OpenClaw's native protocol for agent orchestration. Device pairing, challenge-response auth, gateway routing, session management.

UNLOCKED
STAGE 03
The Civilisation — Multiple Agents, One Mission

One agent was powerful. Three agents changed everything. A Guardian that monitors and deploys. An Intelligence that codes and builds. A Grace that designs and tests. Each with its own identity, memory, and model — but sharing one mission.

Guardian Agent

Always-on orchestrator. Receives instructions via Telegram. Decomposes tasks. Routes to the right agent. Monitors health 24/7. Ships to production.

UNLOCKED
🕉 Intelligence Agent

The builder. Claude Code with full filesystem access. Writes APIs, frontends, scripts. Commits to git. Creates PRs. Debugs production issues.

UNLOCKED
🐠 Grace Agent

The artisan. Runs on-demand on laptops. Mobile builds (iOS + Android). Browser testing. UI/UX refinement. Cross-platform validation.

UNLOCKED
🤝 Agent-to-Agent (A2A)

Agents discover each other's capabilities, delegate tasks, and track cross-agent work. Standard inter-agent communication protocol.

EXPERIMENTAL
STAGE 04
The Brain — RAG, Knowledge, and Context Engineering

Agents needed to learn from the past without re-reading everything. We built a modular knowledge system — 6 knowledge modules compiled into agent context. RAG for retrieval. Embeddings for search. The agents became domain experts overnight.

📚 Knowledge Modules

6 structured modules: identity, security patterns, infrastructure, distillation strategies, operational history, agent lifecycle. Compiled at boot.

UNLOCKED
🔍 RAG Pipeline

Retrieval-Augmented Generation. Agents search their knowledge base before answering. No hallucination on known topics. Grounded responses.

EXPERIMENTAL
🧬 Context Compaction

30+ hours of continuous operation without losing context. Server-side summarisation of old turns. The agent never forgets what matters.

UNLOCKED
📊 Decision Logging

Every architectural decision, model choice, and cost trade-off is logged. Full audit trail. The agent can explain why it did what it did.

UNLOCKED
STAGE 05
The Engine — Multi-Model Routing and Cost Control

Not every task needs a $75/M-token model. We built a 3-tier routing system: fast models for simple tasks, balanced models for coding, deep models for architecture. Budget enforcement triggers automatic downgrades. API costs dropped 60%.

🔀 3-Tier Model Routing

Fast (Haiku): status, health, lookups. Everyday (Sonnet): coding, debugging. Deep (Opus): architecture, strategy. Auto-selected per task.

UNLOCKED
💰 Budget Guardian

Daily budget cap with automatic downgrade at 80% threshold. Per-agent cost ceilings. Fallback to local models (Ollama) when budget exhausted.

UNLOCKED
🏠 Local Model Fallback

Ollama runs on VPS with custom-tuned models. When cloud APIs are expensive or slow, agents fall back to local inference. Zero cost for routine tasks.

UNLOCKED
🔄 Cross-Provider Routing

OpenRouter as universal gateway. Anthropic, OpenAI, Google, Meta — route to the best model for each task. Never locked to one provider.

UNLOCKED
STAGE 06
The Factory — Auto-Build, Auto-Deploy, Auto-Heal

The final form. Agents don't just write code — they containerise it, deploy it, configure SSL, set up domains, monitor uptime, and auto-heal when things break. A message on Telegram becomes a live product with a payment link. Autonomously.

🐳 Docker Containerisation

Every app is containerised. PostgreSQL, Redis, n8n, APIs — all orchestrated via Docker Compose. Reproducible deployments, zero "works on my machine".

UNLOCKED
🔁 CI/CD Pipeline

GitHub Actions: lint, test, security review (by Claude), Docker build, auto-deploy on merge. Every push is validated. Every deploy is tracked.

UNLOCKED
🩺 Self-Healing Validator

Runs every 5 minutes. Checks all services. Restarts what's down. Fixes misconfigurations. Kills zombie processes. Reports issues to the owner.

UNLOCKED
🔄 n8n Workflow Automation

Visual workflow engine. Git push triggers deploy. Payment triggers onboarding. Alert triggers incident response. No code needed for automation.

UNLOCKED
🔒 SSL + Security Headers

Let's Encrypt auto-renewal. HSTS, CSP, X-Frame. Every subdomain secured. Every API endpoint hardened. Security isn't optional — it's default.

UNLOCKED
📡 Lead Capture + CRM

Website contact forms → API → database → Telegram notification. Every lead tracked. Every interaction logged. Ready for n8n-powered follow-ups.

UNLOCKED
BOSS LEVEL

What Would You Build
If Your Agents Never Slept?

This isn't a demo. This is a production system that builds, deploys, and earns — 24/7. We're looking for founders, builders, and operators who want to harness the same infrastructure.

REQUEST EARLY ACCESS
OMMILO
"The universal force that guards with grace"