SignalPilot is now #1 on Spider 2.0-DBT with 65.63.

// blog

Latest

Engineering deep-dives, benchmark breakdowns, and updates from the team.

Plausibly Wrong Is Worse Than No Agent at All

Plausibly Wrong Is Worse Than No Agent at All

trending

In production, an AI data agent that's plausibly wrong is worse than no agent at all: inflated revenue, dropped customers, double-counted metrics that pass review and surface in a board deck weeks later. SignalPilot produces data that's correct, not just plausible, and stays safe on a real warehouse because AutoFyn, the security agent behind disclosed Next.js and MetaMask bugs, audits it without stopping. The benchmarks (#1 on Spider 2.0-DBT, 96.9% on ADE-Bench) just prove it.

Daniel Schaffield//8 min read
SignalPilot ADE-Bench Report: 96.9%, Highest Score Ever

SignalPilot ADE-Bench Report: 96.9%, Highest Score Ever

trending

SignalPilot resolves 62 of 64 ADE-Bench tasks, the highest score on dbt Labs' analytics engineering benchmark. Three skills and one sentence took us from 71% to 96.9%.

Tarik Moon//8 min read
How We Beat JetBrains to #1 on the World's Hardest Data Benchmark

How We Beat JetBrains to #1 on the World's Hardest Data Benchmark

trending

Today, we're thrilled to announce that SignalPilot has claimed the #1 spot on the Spider 2.0-DBT leaderboard — beating JetBrains' Databao by over 7 points. By solving dbt, the hardest problem in the transformation layer, we proved AI agents can be trusted with enterprise data pipelines when they have the right guardrails.

Tarik Moon//5 min read
How Claude Code + SignalPilot Builds Production-Grade dbt Models

How Claude Code + SignalPilot Builds Production-Grade dbt Models

From empty directory to 16 models, 25 passing tests, and full documentation — in a single conversation. A step-by-step trace of how Claude Code and SignalPilot MCP built a complete dbt project.

Luiz Fernando//7 min read
Benefits of Using SignalPilot Governed MCP to Build dbt Models

Benefits of Using SignalPilot Governed MCP to Build dbt Models

Why governed database access through MCP changes the AI-assisted analytics engineering workflow — from AST-level query parsing to automatic LIMIT injection and full audit trails.

Luiz Fernando//5 min read