Systems thinking ·AI · May 27, 2026 · 6 min read

Is AI Hiding Our Failure Modes?

By John Irving

AI is making it easier than ever to build things that look like they work. The failure modes are still there. We just can't see them anymore.

Understanding Failure Modes

When most people hear "failure mode," they take it literally. A setup that is bound to fail. A situation with the conditions for failure already baked in, even if nothing has broken yet. That reading is not wrong. It is actually a useful place to start.

The engineering definition adds one layer of precision: a failure mode is not just the fact that something will fail, but the specific way it will. The mechanism. The route. "This will fail" is an observation. "This will fail at the point where we need to support a second customer's workflow, because every product decision was made around the first one" is a failure mode. The difference is that you can design against the second one.

That idea is not new. The U.S. military built a formal methodology around it in 1949 after soldiers were dying because ammunition misfired in the field. NASA used it for the Apollo program. Ford adopted it in the 1970s after the Pinto fires made clear what happens when failure modes go unexamined. The industries changed but the principle stayed the same: you can only design against a failure you can name.

Which brings us to what AI is quietly doing to that.

How AI Is Baking In Hidden Failure Modes

AI makes it possible to build things that look like they work without fully understanding why they work. A solo founder can use AI to write a codebase they could not have written alone. A small team can ship a product without the domain expertise the product would traditionally require. A startup can generate marketing that converts, customer support that scales, and a pitch deck that sounds authoritative — all without the institutional depth those things usually signal.

None of that is inherently wrong. But each of those shortcuts has a failure mode attached to it that may not surface for a long time.

The infrastructure holds at a hundred users but was never architected for the load, compliance requirements, or security standards that come when a real enterprise customer shows up. The marketing that sounds authoritative sets expectations the team cannot meet once the product has to perform under pressure.

And then there is the customer problem, which is the subtlest one. AI helps teams move fast enough to generate revenue before they have had enough conversations to know who they are actually selling to. Early adopters show up, pay, and everything reads as validation. But paying for something does not make someone the right customer for where the business needs to go. Early adopters are a specific type: they tolerate rough edges, they figure things out, they pay because they are excited about the possibility. They are not the mainstream buyer who needs something that just works, or the enterprise buyer with procurement requirements and a team to convince. The product gets quietly shaped around whoever is using it. Features, onboarding, messaging — all of it drifts toward the customer you have. By the time you go after the customer you actually need, the product has been optimized for someone else. Payment got treated as validation of the whole thesis when it only confirmed that one type of person found value in this version of the product. Those are not the same thing.

These are not accidents waiting to happen. They are structural decisions made at the beginning, before there is any pressure to examine them, and then buried under layer after layer of outputs that look like progress. By the time the product hits the wall it was always heading toward, the original decision is invisible. All anyone can see is the wall.

So what do you do with this?

The first step is to look for the failure modes deliberately. They will not surface on their own. AI will not flag them. The metrics that tell you things are working will not tell you about the assumptions underneath that are not. Someone has to ask the question: how could this break, and when?

Once you can name them, you can plan for them. A team that ships fast with AI assistance and gets to revenue has not necessarily done something wrong. But they need to be honest about what they have: a product with real signal and known structural limits underneath it. The revenue they generated can fund addressing exactly those limits. Refactor the architecture before the enterprise deal requires it. Fix the security gaps before a customer audit finds them. Sharpen the positioning before the wrong customer segment becomes the majority of the base. A named failure mode can be designed against. That is the whole point.

The problem is the team that ships, hits the numbers, and concludes the approach was sound. What they actually learned is that they got through this one. Not the same thing. And when you build your next product on that conclusion, you are not building on what worked. You are building on what you did not notice.

There is a reason experienced investors spend more time on what a founder does not know than on what they do. A founder who can walk through their failure modes clearly — the infrastructure that will need rebuilding at scale, the customer segment that may churn, the security gaps that will need closing before an enterprise deal closes — is a founder who has priced those risks into their plan. The investor can work with that. What makes a deal genuinely risky is not the presence of failure modes. Every startup has them. It is the team that does not know theirs. Because those unknown risks are not just a strategic problem. They are a valuation problem. A startup carrying hidden failure modes is mispriced relative to its actual risk profile. The revenue is real, the growth may be real, but the number on the term sheet is being held up by assumptions that have not been tested yet. An investor who finds them discounts accordingly. One who does not finds out later. And a founder who never looked is operating as if the discount does not exist, right up until it does.

Why This Matters Beyond Your Product

This is not just a product question. It is a question about how an entire generation of builders is learning to build.

AI is making it possible to move faster, with less expertise, and with fewer of the natural friction points that used to slow things down and, in slowing them down, force some amount of examination. That friction was often inefficient. It was also, sometimes, the thing that caught the failure mode before it got baked in.

If the pattern becomes: ship with AI, survive the early stage, call it success, repeat — then the organizational knowledge being built is not knowledge at all. It is a library of outcomes with no understanding of why they happened. And the next product gets built on that library.

The tools are not the problem. Moving fast is not the problem. The problem is mistaking the absence of visible failure for the presence of sound design. Those are not the same thing, and the gap between them is exactly where the next failure is being built.

The question worth asking, before you ship, is not just whether it works. It is: what is this built on, and what happens when that assumption gets tested?

References

American Society for Quality. (n.d.). Failure mode and effects analysis (FMEA). ASQ. https://asq.org/quality-resources/fmea

Brain Wire Studio. (n.d.). The history and evolution of failure mode and effects analysis (FMEA). Brain Wire Studio. https://brainwirestudio.com/the-history-and-evolution-of-failure-mode-and-effects-analysis-fmea/

Super Engineer. (n.d.). FMEA history. Super Engineer. https://www.superengineer.net/blog/fmea-history

Visure Solutions. (n.d.). History and evolution of FMEA. Visure Solutions. https://visuresolutions.com/risk-management-fmea-guide/history-and-evolution/

Wikipedia contributors. (n.d.). Failure mode and effects analysis. Wikipedia. https://en.wikipedia.org/wiki/Failure_mode_and_effects_analysis