Every node fires through DEMAS.
Six gates, every block, fail-closed. The lines marked demas are the JIT audit layer intercepting at every node boundary.
11 tenants · 5 countries · 234 surfaces · $0.0006 per page.
The Writer node (Node 5, the only sustained inference path) runs on a self-hosted Gemma 4 26B-A4B MoE on a single A100 80GB. Fixed monthly VM cost ÷ pages served. Below: cost derivation, then the volume that produces it.
Anonymized sample from a recent pipeline run. Scroll to walk through what each line means.
Six gates, every block, fail-closed. The lines marked demas are the JIT audit layer intercepting at every node boundary.
~11 seconds of generation, ~1 millisecond of validation. The probabilistic surface is small and pinned at one node — every other node is deterministic.
The line marked FAIL halts downstream. Firestore captures the trace; the orchestrator retries with a corrected prompt. Nothing below threshold reaches production.
Eleven tenants, seven nodes, ~10.5M product-location pages — a full run writes about 73.5M telemetry events to Langfuse. The pipeline is silent everywhere else.
Node 1 (City DNA) injects language, regulation, and timezone before any downstream node fires. City data is cached and reused across all products in that geography.
From self-hosted Gemma on A100 to cloud-hosted Gemini. Langfuse-traced executions per model.
The structural details — node anatomy, MoE routing, the deterministic gate engine — live one page over.