Builder of Autonomous AI Systems · 2016 → Now

AI systems
that run themselves.

Ten years, six systems shipped. Today: a pipeline serving 11 enterprise clients across 5 countries — every step has a safety check before the AI runs.

Active 2016 → 2026 TNG Shopper · current 5 prior shipped systems

Now · 2024 → Present

What's running right now

~10.5M
product pages under autonomous management

I lead the autonomous content pipeline at TNG Shopper — seven steps, each one a safety check before the AI runs. The model is Google's Gemma 4 (a 26-billion-parameter Mixture-of-Experts model), self-hosted on our own infrastructure. Every full run does about 73.5 million sub-tasks. Outputs are scored on four things — Originality, Relevance, Accuracy, Value — and only 68.9% pass. The rest get rejected.


The work · 2016 → 2026

Six projects. The pattern: how to unblock human-dependencies.

Each one taught me something I still use today. The earliest taught the reps; the rest compounded.

  1. Junior Years

    Three years of freelance work for startups and small companies — web development, inventory imports, spreadsheet automation, market research.

    3 startups, 2 low-techs, side-projects · Solo

    Three years of contractor work nobody else wanted — the data-transformation reps that every later pipeline compounded on.
    • 6 categories of work
    • 3 startups, 2 low-techs, side-projects
    • ETL by hand · ERP → website

    LessonDoing data transformation, normalization, enrichment, and formatting by hand for three years builds the reflex that makes every later pipeline feel like the same job at scale.

    BridgeThe same extract → normalize → enrich → format pattern, scaled up: Data Mining.

    Read the case study →
  2. Data Mining

    Automated lead-finding pipeline for an Israeli wealth-management firm.

    Israeli Financial Services Firm · Solo

    A pipeline is a series of filters, not a series of steps.
    • 17 mo in production
    • ₪50M+ in new Assets Under Management
    • 5-stage manually orchestrated pipeline

    LessonThe five-stage filter is the architectural primitive everything later compounds. Frame any system as a chain of deterministic filters before adding probabilistic compute.

    BridgeEleven months in, a parallel co-founded venture began — a camera-first retail computer-vision tool: Seller App.

    Read the case study →
  3. Seller App

    Camera-first onboarding tool that turned retail stockrooms into online storefronts.

    Co-founded with Dr. Eli Osherovich · Bootstrapped 4 yrs

    Three computer-vision generations — barcode, then image, then video — against a canonical catalog of 60 million Stock-Keeping Units. Bootstrapped four years.
    • 3 computer-vision generations: Barcode → Image → Video + Retrieval-Augmented Generation
    • 60M+ canonical Stock-Keeping Units
    • $10K Monthly Recurring Revenue plateau

    LessonVision-language pivot is when I learned that Product-Market Fit without unit economics is a research project, not a company.

    BridgeNine months in, a third concurrent track started — an AI assistant on WhatsApp: Tasko AI.

    Read the case study →
  4. Tasko AI

    AI assistant on WhatsApp that handled real-world errands for paying clients.

    Hasherut · Production agentic system

    From 21 million WhatsApp messages, 1,561 unique intents — classified, routed, executed, and verified across four production layers.
    • 21,102 labeled tasks
    • 153 unique clients
    • 1,561 intent patterns
    • 4-layer Classify / Retrieve / Execute / Verify

    LessonThe four-layer pattern is what every modern agent framework now calls "agentic with tool use." It shipped in production in 2020.

    BridgeAfter Tasko AI wound down, the next retail-AI investment moved to physical-context intelligence: Elysium.

    Read the case study →
  5. Elysium

    Physical-context AI platform for retail demand prediction — 13 brands signed before pause.

    Physical-Context AI for Retail · Paused-pending-Pipeline

    A platform fusing search intent, geo-coordinates, and foot-traffic data into store-level demand signals — eleven specialized agents, fifteen patent claims.
    • 13 brands validated
    • 15,600+ store locations
    • 15 patent claims (3 independent + 12 dependent)
    • 11 specialized agents

    LessonThe right product can be the wrong order of operations. Pause is a routing decision, not a failure mode.

    BridgeSequenced behind a prerequisite layer: the Pipeline ships first; Elysium re-activates downstream once it's operating across enough tenants to feed it.

    Read the case study →
  6. Pipeline / TNG Shopper

    Autonomous content pipeline serving 11 retailers across 5 countries — running today.

    Production · Live observatory

    The synthesis. A seven-node directed acyclic graph, deterministic gates first, fail-closed by default.
    • ~10.5M product detail pages / cycle
    • ~73.5M agent operations / run
    • 68.9% pass rate · Originality, Relevance, Accuracy, Value
    • $0.0006 / product detail page
    • Gemma 4 26B-A4B Mixture-of-Experts on self-hosted vLLM

    LessonThe five-stage filter, the four-layer agent, the fail-closed governance — same architecture, finally fused.


What it proved · Four claims

What ten years taught me

  1. Build the boring parts first.

    Frame the system as a chain of safety checks before adding any AI. Saves cost. Prevents disasters.

    Source: Junior Years (2016)Data Mining (2019).

  2. I shipped "agentic AI" in 2020 — before the industry had a name for it.

    Classify, Retrieve, Execute, Verify — running on AWS Lambda + a fine-tuned OpenAI model for 153 paying clients on WhatsApp.

    Source: Tasko AI (2020 — 23).

  3. A great product can be the wrong order of operations.

    13 brands wanted it. 15 patent claims filed. Pausing was the right call — the underlying system needed to ship first.

    Source: Elysium (2024 — 25).

  4. Always check before the AI runs.

    Every step has a deterministic safety gate. The AI activates only if the gate passes. 80% deterministic, 20% AI.

    Source: Pipeline (2024 — present).


Stack · Production system

What I run, where

Inference & serving
Vertex AI · self-hosted Gemma 4 26B-A4B Mixture-of-Experts on vLLM 0.17.2rc1 · PagedAttention · Google Cloud Storage FUSE · NVIDIA A100 80GB
Orchestration & data
Google Agent Development Kit · Python · FastAPI · Pydantic · Jinja2 · BigQuery · Google Cloud Storage · Firestore · Redis · Cloud Functions
Observability & governance
Langfuse · OpenTelemetry · Model Context Protocol · Deterministic Evaluation and Monitoring Audit System for fail-closed safety · Four-axis evaluation across Originality, Relevance, Accuracy, Value · Direct Preference Optimization via Reinforcement Learning from AI Feedback

Parallel track · Open-source distillations

Code you can read


Writing · Archive

Recent essays


Contact

Reach out

danielq1603@gmail.com LinkedIn GitHub