Many AI Applications Are Overvalued—and Blind to the Disruption Risk of Foundation Models
In the current AI gold rush, startups are racing to deliver applications built atop foundation models—offering everything from legal brief drafting to sales emails, AI tutors to coding assistants. Yet a harsh reality is beginning to set in: many of these companies are overvalued, chasing short-term hype while ignoring a long-term risk that could erode their entire business model.
That risk? Foundation model disruption.
Large language models like OpenAI’s ChatGPT, Anthropic’s Claude, or Meta’s LLaMA are evolving rapidly—eating up downstream markets by becoming smarter, more versatile, and increasingly integrated with agents and tools. The same general-purpose intelligence that powers these applications today could make many of them obsolete tomorrow.
To help evaluate this risk, we propose a simple 2×2 framework to classify AI applications and their vulnerability to foundation model disruption.
The AI Application Disruption Risk Framework
Our 2×2 matrix is defined by two key dimensions:
-
Horizontal Axis: Specialization vs. Generalization
-
Specialized applications are tailored for a narrow vertical (e.g., legal contracts, medical imaging).
-
General applications operate across many use cases or user groups (e.g., writing tools, customer support bots).
-
-
Vertical Axis: Added Value Layer vs. Thin Wrapper
-
High-value layers involve proprietary data, workflow integration, strong UX, regulatory insight, or domain-specific reasoning.
-
Thin wrappers are applications that merely repackage foundation models with a prompt and UI.
-
| General | Specialized | |
|---|---|---|
| Thin Wrapper | 🚨 High Risk – Easily replaced by foundation model APIs or plugins (e.g., AI email assistant with no added data or UX innovation). | ⚠️ Medium Risk – Might survive short term with niche language or formats, but long-term viability is weak. |
| Value-Added Layer | 🛡️ Defensible – Broad apps that aggregate data, fine-tune UX, or build community/moats (e.g., Notion, Grammarly). | ✅ Best Positioned – Deep IP, domain knowledge, regulatory integration, or workflow fit make these apps hard to displace. |
Examples Across the Matrix
-
High Risk (Thin + General): Tools that let users write poems, generate basic emails, or summarize content—easily replicated by ChatGPT plugins or agents.
-
Medium Risk (Thin + Specialized): A platform that only reformats medical papers using GPT with little else added.
-
Defensible (Value + General): A collaborative productivity app that incorporates retrieval-augmented generation (RAG), version control, and a social layer.
-
Best Positioned (Value + Specialized): AI-driven legal platforms with access to private case law databases, client-attorney communication workflows, and compliance tooling.
The Illusion of Moats
Many AI application founders pitch defensibility based on brand, first-mover advantage, or a slick user interface. But in a world where foundation models become platforms with plugins, tools, and agent orchestration, those moats can dry up quickly.
What truly differentiates a long-term, disruption-resilient AI application is deep integration with proprietary data, workflows, domain knowledge, and compliance. Without that, you’re just renting attention from a foundation model that could one day rent it back—without you.
If your product can be described as “an easy way to prompt ChatGPT to do X,” chances are it will be replaced—by ChatGPT itself. As foundation models become multimodal, memory-enhanced, and agentic, their scope will stretch even further.
Founders, investors, and product leaders must ask themselves:
-
What is our true moat beyond the model?
-
Are we adding unique value, or just reselling intelligence?
-
What happens when foundation models offer our exact feature as a native plugin?
AI disruption is not just coming from outside your market—it’s coming from beneath it.
You might enjoy listening to AI World Deep Dive Podcast: