The autonomous pipeline
that scales organic traffic.

11 tenants · 5 countries · 234 surfaces · $0.0006 per page.

Running Active since 2024 Last execution 
Daniel Manzela
PDPs per cycle
~10.5M product-location pages
Cost per PDP
Self-hosted Gemma 4 MoE on A100
Enterprise tenants
Countries
Surfaces
DEMAS pass
Intentional fail-closed gate
Unit Economics

$0.0006 per page. Here’s the math.

The Writer node (Node 5, the only sustained inference path) runs on a self-hosted Gemma 4 26B-A4B MoE on a single A100 80GB. Fixed monthly VM cost ÷ pages served. Below: cost derivation, then the volume that produces it.

$5.06/hr × 730 hrs/mo = ~$3,694/month (a2-ultragpu-1g · 1× A100 80GB)
~$3,694/month ÷ ~6.16M PDPs/month = ~$0.0006/PDP
11 tenants × ~45K products × ~21 locations = ~10.5M PDPs per full run
~10.5M PDPs × 7 DAG nodes = ~73.5M ops per full run
80%
Deterministic compute
Near-zero marginal cost
$0
Per-token API fees
Self-hosted Gemma 4 MoE
~20%
Actual LLM calls
City + client caching
Execution Telemetry

Reading the trace.

Anonymized sample from a recent pipeline run. Scroll to walk through what each line means.

Beat 1 of 5

Every node fires through DEMAS.

Six gates, every block, fail-closed. The lines marked demas are the JIT audit layer intercepting at every node boundary.

Beat 2 of 5

Node 5 is the only inference call.

~11 seconds of generation, ~1 millisecond of validation. The probabilistic surface is small and pinned at one node — every other node is deterministic.

Beat 3 of 5

Failures don’t propagate.

The line marked FAIL halts downstream. Firestore captures the trace; the orchestrator retries with a corrected prompt. Nothing below threshold reaches production.

Beat 4 of 5

~73.5M of these lines per cycle.

Eleven tenants, seven nodes, ~10.5M product-location pages — a full run writes about 73.5M telemetry events to Langfuse. The pipeline is silent everywhere else.

Beat 5 of 5

Last full execution.

Trace log · sample
Geography & Deployment

5 countries. 234 surfaces.

Node 1 (City DNA) injects language, regulation, and timezone before any downstream node fires. City data is cached and reused across all products in that geography.

ES
Spain
Spanish
PT
Portugal
Portuguese
IL
Israel
Hebrew
US
United States
English
MX
Mexico
Spanish (MX)
234
Managed surfaces
11
Domains
5
Countries
~230
Locations served
Model Evolution

4 models in production.

From self-hosted Gemma on A100 to cloud-hosted Gemini. Langfuse-traced executions per model.

Langfuse traced executions by model

The structural details — node anatomy, MoE routing, the deterministic gate engine — live one page over.

Architecture Deep Dive
7-node execution DAG · O-R-A-V · DPO flywheel
Node anatomy, MoE sparse routing, deterministic quality gates, closed-loop model improvement.