Founder Insight

Why 2% Accuracy Is the Only Real AI Moat

Alex Reichenbach, CEO at Structify

Listen on TL;Listen Prefer to listen? Hear this article read aloud.

Most AI startups talk about moats in the language of features. Better UX. More integrations. A faster onboarding flow. The trouble with feature moats is that they collapse the moment a competitor copies the feature, which usually happens in a quarter.

Alex Reichenbach, CEO of Structify — the AI data team for enterprises — has a different answer. The only moat he believes in is accuracy, and not by a lot. By 2%.

“If our pipelines are 2% more accurate than our competitors,” Alex says, “it makes any bank that doesn’t use us, use our competitors instead, negligent.”

That’s the entire competitive thesis in one sentence. And it explains a series of decisions Structify has made — about pricing, architecture, and which capabilities to commoditize away — that look strange until you see them through this lens.

Why feature moats collapse but accuracy doesn’t

Features are visible. Anyone can see them. A competitor watches your demo, lists the features, and ships a clone within a quarter. The moat erodes the moment the competitor’s clone hits 80% feature parity, because the buyer’s perceived risk drops below the threshold required to switch.

Accuracy moats are different. Accuracy isn’t visible from a demo. It only shows up in production, after the customer has run thousands of pipelines against thousands of edge cases. The competitor watching your demo can’t replicate the accuracy by watching — they have to do the work themselves, against the same edge cases, and that takes years.

Even better, accuracy compounds with usage. Every customer query that produces a wrong result is a learning signal. Every nightly functional test against historical queries catches a regression. The vendor who’s been running the system longest, against the most diverse set of customer pipelines, has the strongest accuracy lead — and that lead grows, not shrinks, with time.

The 2% threshold and why it matters at the bank

The 2% number isn’t arbitrary. It’s the threshold above which the buyer’s choice becomes a fiduciary issue.

Imagine you’re a head of data at a large bank. You’re choosing between two AI data infrastructure providers. Provider A has slightly better UX and a fancier dashboard. Provider B has 2% more accurate pipelines on your specific workflows. The board asks why you went with A. Your answer is “the dashboard was nicer.” That’s a career-ending answer.

The board’s actual question, when accuracy is on the table, is “did you know there was a 2% more accurate option, and if so, why didn’t you choose it?” The answer “it’s a 2% difference, doesn’t matter” doesn’t survive scrutiny. Two percent more accuracy on a million-row pipeline is twenty thousand correct answers instead of wrong ones. Twenty thousand wrong answers in a financial pipeline is a regulatory issue.

Alex’s framing — that not using the more accurate provider becomes “negligent” — captures this perfectly. At enterprise scale, accuracy isn’t a preference. It’s a duty.

Why text-to-SQL is commoditized but pipeline accuracy isn’t

The companion insight Alex offers is which AI capabilities are commoditized and which still have moats. Text-to-SQL — converting natural language to a SQL query against a single database — is now commoditized. Multiple frontier models can do it well. Building a startup whose core value prop is text-to-SQL means competing against OpenAI, Anthropic, and a dozen open-source equivalents.

“Text-to-SQL is now sufficiently easy to do that it has basically become a commoditized thing,” Alex says. “Doing large-scale data pipelines is a next step that we need to make sure doesn’t become commoditized.”

Why is large-scale pipeline accuracy harder to commoditize? Because the work isn’t language understanding — it’s the long tail of edge cases. Source connectivity. Schema drift. Compounding errors across multi-step transformations. Metadata management across hundreds of tables. Each of these is solved through customer-specific work that doesn’t generalize cleanly into a foundation model. The accuracy moat lives in that long tail.

What this means for pricing

The pricing model Structify uses — per-seat with cost passthrough on LLM and infrastructure — is the financial expression of the accuracy thesis.

If accuracy is the moat, then revenue should come from the value of the answer, not from the cost of the compute. Structify makes no money on the LLM calls or the scrape jobs. Customers pay them at the same rate Structify pays cloud providers. The Structify margin lives entirely in the per-seat fee, which is justified by the accuracy advantage.

“We pass our costs on directly,” Alex explains, “and then we just want to expand as much as possible to getting more people to use us. I orchestrated that way because I think it really aligns price or incentives. It means that you’re not thinking that you are getting shortchanged for using LLM calls. We’re able to be incentivized to try to make you have a good experience so that you want a coworker to have a good experience.”

The pricing model only works because accuracy is real. If Structify weren’t actually 2% more accurate, customers would simply choose the cheaper alternative. The accuracy is what justifies the per-seat premium.

What this means for builders

If you’re building an AI infrastructure product, the takeaway runs against most startup advice. Don’t compete on features. Don’t try to win on UX alone. Find the accuracy threshold for your category — the percentage above which the buyer’s choice becomes fiduciary, regulatory, or career-ending — and engineer toward it.

Pour your engineering investment into the long tail of edge cases that don’t generalize. Build the nightly functional test suite that catches regressions. Build the metadata layer that handles schema drift. Build the deterministic execution layer that prevents compound errors. None of this is glamorous. None of it shows up in a demo. All of it compounds into a moat.

And once the accuracy is real, charge for it. Charge enough that the customer sees the moat reflected in the price. Pass through the variable costs. Make the value clear: you’re paying for the answer being right, not for the LLM call that happened on the way there.

FAQ

What does it mean for AI accuracy to be a moat?

A moat is a defensive advantage competitors can’t easily replicate. Most AI startups try to moat on features (which copy quickly) or scale (which requires capital). Accuracy is harder to copy because it’s not visible in demos — it only shows up after thousands of production runs against diverse edge cases. The vendor with the longest deployment history and the most diverse customer base has the strongest accuracy lead, and that lead compounds with usage.

Why does a 2% accuracy advantage matter to enterprise customers?

Because at enterprise scale, accuracy choices become fiduciary. A bank choosing between two data providers with a 2% accuracy gap can’t justify picking the less accurate option to a board or regulator. Alex Reichenbach calls this the “negligent” threshold — once a more accurate alternative exists, choosing the less accurate one becomes hard to defend. Two percent on a million-row pipeline is twenty thousand correct answers instead of wrong ones.

How does Structify build and maintain its accuracy lead?

Through three mechanisms. First, the architectural choice to separate LLM-driven interpretation from deterministic code execution — pipelines run the same way every time once written. Second, hundreds of nightly functional tests that catch regressions against historical queries. Third, a metadata layer (data hub integration) that handles schema drift, governance flags, and column-level annotations across enterprise data sources.

What’s commoditized in AI infrastructure and what isn’t?

Text-to-SQL is commoditized — frontier LLMs can do it competently, and the value differentiation is small. Single-database querying, basic document extraction, and conversational interfaces are also approaching commoditization. What’s not commoditized: large-scale pipeline accuracy, multi-source data integration, schema drift handling, metadata governance, and compound-error mitigation. The moat lives in the long tail, not the surface features.

Why does Structify pass through LLM and infrastructure costs at zero margin?

Because the company’s value proposition is the accuracy of the answer, not the cost of the compute. Charging margin on LLM costs would create misaligned incentives — Structify would be incentivized to slow queries down, use more expensive models, or run unnecessary calls. By passing costs through, Structify is incentivized to make the user experience good enough that customers expand seat usage, which is where the margin lives.

What outcomes do Structify customers see from accuracy improvements?

M&A teams reduce manual document classification work from days of human effort to fully automated pipelines. Finance teams stop chasing Excel formula errors across passed-around files because the entire pipeline is reproducible. Banks cut data engineering backlogs that previously consumed entire quarters of capacity. The common thread: latency on insight collapses from weeks to minutes when accuracy is high enough to trust the output without manual review.

What does it cost to compete with Structify on accuracy?

Years and a lot of customer-specific engineering. Accuracy comes from the long tail of edge cases — source-specific connectors, schema-specific governance handling, deterministic execution against compound transformations. None of these scale through prompt engineering or foundation model upgrades. They scale through customer-paid engineering work that doesn’t transfer cleanly. That’s why the accuracy lead compounds for the vendor with the longest deployment history.

Should AI startups always compete on accuracy instead of features?

Only if the buyer’s category has an accuracy threshold above which the choice becomes fiduciary. For consumer AI, accuracy is one factor among many — convenience, brand, and price often matter more. For enterprise AI in regulated or high-stakes domains (financial, healthcare, legal), accuracy is the dominant factor. Identify whether your category has the threshold. If yes, engineer toward it. If no, find a different moat.

How does accuracy interact with pricing power?

Accuracy creates pricing power because it changes the comparison the buyer is making. Without accuracy advantage, the comparison is “your tool vs. their tool” — features and price compete head-on. With clear accuracy advantage, the comparison becomes “the right answer vs. the wrong answer” — and price elasticity collapses. Buyers pay multiples more for the right answer in regulated or high-stakes work, which is why Structify’s per-seat pricing can sustain a premium against cheaper alternatives.

Full episode coming soon

This conversation with Alex Reichenbach is on its way. Check out other episodes in the meantime.

Visit the Channel

More from Alex Reichenbach

Founder Archetype

Read Alex Reichenbach's archetype profile

The Magician · Classical: Daedalus · Tests & Allies

Related Insights