The gap between the AI capabilities the tech industry experiences daily and what the general public perceives. We show you with real data why this gap is dangerous.
Perception Gap
Most people use the free version of AI. Matt Shumer describes this as "judging the smartphone era with a flip phone." Free AI is more than a year behind the latest paid models.
"When you ask AI to build an entire app, it tests itself, iterates, and completes it. It shows not just coding ability, but judgment."
"It gives useful answers sometimes, but gets a lot wrong too. Isn't it just a chatbot that can't handle complex tasks?"
How accurately each group understands AI's current level
Exponential Growth
According to METR benchmarks, AI autonomous task capability doubles every ~6.5 months overall, and has accelerated to every ~89 days (~3 months) since 2024. GPT-5.3-Codex and Claude Opus 4.6, released simultaneously on Feb 5, show this acceleration intensifying.
Task duration AI can complete with 50% probability without human expert help (50% time horizon)
OpenAI's GPT-5.3-Codex and Anthropic's Claude Opus 4.6 were released on the same day. GPT-5.3-Codex was announced as "the first model to contribute to building itself," while Opus 4.6 nearly doubled its predecessor's score on ARC-AGI-2. Official METR evaluations haven't been published yet, but benchmark improvements show the curve steepening.
| Benchmark | GPT-5.2 | GPT-5.3-CodexNew | Opus 4.5 | Opus 4.6New |
|---|---|---|---|---|
| SWE-bench Verified | 80.0% | — | 80.9% | 80.8% |
| SWE-Bench Pro | 56.4% | 56.8% | — | — |
| Terminal-Bench 2.0 | 64.0% | 77.3% | 59.8% | 65.4% |
| OSWorld | 38.2% | 64.7% | 66.3% | 72.7% |
| ARC-AGI-2 | — | — | 37.6% | 68.8% |
| GPQA Diamond | — | — | 87.0% | 91.3% |
| HLE (with tools) | — | — | 43.4% | 53.1% |
| GDPval-AA Elo | ~1462 | — | 1416 | 1606 |
| BrowseComp | — | — | 67.8% | 84.0% |
Source: OpenAI, Anthropic official announcements (2026.02.05). GPT-5.3-Codex: 25% faster + 400K context. Opus 4.6: 1M context + adaptive thinking.
How much they improved over the previous generation on the same benchmarks
50% time horizon change by model release date (log scale). The curve has steepened since 2024.
Timeline
From AI that couldn't do basic arithmetic to autonomous expert-level complex tasks.
Interactive
The Dangerous Gap
Actual AI capability is rising exponentially, but public perception barely moves. Society faces shocks it's not prepared for.
The wider the red area, the greater the societal shock
"Within the next 1–5 years, 50% of entry-level office jobs could be eliminated. We are only 1–2 years away from AI being able to build models that are fundamentally far superior to the current generation."
Impact
Click an item to see details.
What You Can Do
Recognizing this gap already puts you one step ahead.
A $20/month subscription can close a 1+ year technology gap.
Integrate AI into daily tasks like writing, analysis, and coding.
One hour a day experimenting with AI tools. In a month, you'll be a completely different person.
The ability to learn itself — not any specific skill — becomes your most valuable asset.
The fastest way to close the gap is to share this information.