Infrastructure ActiveActive Infrastructure

OS-NEXUS

Architecture multi-agents souveraine pour Umbra Labs — 8 agents spécialisés, infrastructure hybride VPS/Local, données 100% sous contrôle. Sovereign multi-agent architecture for Umbra Labs — 8 specialized agents, hybrid VPS/Local infrastructure, 100% data control.

CrewAI n8n WireGuard gVisor Ollama ChromaDB

Topologie Réseau SécuriséeSecure Network Topology

Zone SécuriséeSecure Zone

Machine LocaleLocal Machine

IsoléeIsolated

Orchestrateur PythonPython Orchestrator

Pull Redis → Agent CrewAIPull Redis → CrewAI Agent

Ollama Local

qwen2.5-coder:32b / :72b

gVisor Sandbox

runtime runsc — réseau=noneruntime runsc — network=none

ChromaDB

Base vectorielle RAGRAG vector database

▸ Connexion sortante uniquementOutbound connection only

Tunnel ChiffréEncrypted Tunnel

WireGuard VPN

Headscale (souverain)Headscale (sovereign)

← BLPOP tasks (pull)

→ SET results

Zone PubliquePublic Zone

VPS Offshore

Suisse / IslandeSwitzerland / Iceland

n8n Orchestrateurn8n Orchestrator

Webhooks + Queue ModeWebhooks + Queue Mode

:5678

Redis Queue

File d'attente des tâchesTask queue

:6379

AI Gateway (FastAPI)

POST /task → ECHO → Router

:8080

Ollama VPS

deepseek-v2:16b (rapide)

:11434

▲ Ports exposés: 80, 443, 51820/UDPExposed ports: 80, 443, 51820/UDP

Stack LogicielleSoftware Stack

n8n

Orchestrateur centralCentral orchestrator

Gère webhooks (Shopify, GitHub), polling IMAP, déclencheurs cron. Le patron ECHO (risk_validator.js) filtre toutes les intentions avant envoi à l'AI Gateway.Manages webhooks (Shopify, GitHub), IMAP polling, cron triggers. The ECHO pattern (risk_validator.js) filters all intents before sending to the AI Gateway.

VPS

CrewAI

Framework Multi-AgentsMulti-Agent Framework

8 agents spécialisés avec leurs tools. Processus séquentiels et hiérarchiques. Context passé entre agents via Task.context.8 specialized agents with their tools. Sequential and hierarchical processes. Context passed between agents via Task.context.

Local

Ollama + FastAPI

Inférence + GatewayInference + Gateway

Ollama expose une API OpenAI-compatible. Le Gateway FastAPI route les tâches vers Redis. Chaque agent sélectionne son LLM selon la complexité.Ollama exposes an OpenAI-compatible API. The FastAPI Gateway routes tasks to Redis. Each agent selects its LLM based on task complexity.

Hybride

gVisor + ChromaDB

Sécurité + MémoireSecurity + Memory

gVisor isole l'exécution du code généré (réseau coupé, read-only). ChromaDB stocke la base de connaissances vectorielle RAG des agents.gVisor isolates generated code execution (network off, read-only). ChromaDB stores the agents' RAG vector knowledge base.

Local

Sélection des ModèlesModel Selection

Capacité de raisonnement vs VRAM requise (taille de bulle)Reasoning capability vs required VRAM (bubble size)

Reasoning: qwen2.5:72b — Marketing, OPSEC, UX, Designer

Coding: qwen2.5-coder:32b — Dev/Ops (local)

Fast: deepseek-v2:16b — Support, Social (VPS)

Escouade d'Agents Umbra LabsUmbra Labs Agent Squad

Sélectionner une unitéSelect a unit

▶ Sélectionner une unité pour afficher ses paramètres▶ Select a unit to display its parameters

Patron ECHO — Flux de SécuritéECHO Pattern — Security Flow

Trigger

Événement externeExternal event

GitHub, IMAP, Cron, Webhook

→

n8n ECHO

risk_validator.js

LOW / MEDIUM / HIGH

→

LOW → AUTO_RUN

MEDIUM → gVisor Sandbox

HIGH → Human Gate

→

AI Gateway

POST /task

FastAPI → Redis → Agent