Apple Study Challenges Assumptions About AGI Readiness

Jun 19, 2025

This post is also available in: עברית (Hebrew)

A recent study by Apple researchers has raised serious doubts about the capabilities of today’s most advanced AI systems, particularly those designed for complex reasoning. The findings suggest that current-generation large reasoning models (LRMs) are still far from achieving the kind of general intelligence often associated with artificial general intelligence (AGI).

In their technical paper, titled “The Illusion of Thinking: Understanding the Strength and Limitations of Reasoning Models via the Lens of Problem Complexity,” Apple tested leading LRMs such as OpenAI’s O1/o3, Claude 3.7 Sonnet Thinking, DeepSeek-R1, and Google’s Gemini Thinking. Instead of traditional AI benchmarks, the researchers subjected these models to a progression of increasingly complex and non-standard problems designed to test their reasoning ability in unfamiliar contexts.

The results were stark: although LRMs perform reasonably well on simpler tasks, their accuracy collapsed when faced with problems of growing complexity. Apple’s team concluded that these systems—despite being branded as “thinking” models—fail to develop true general problem-solving skills. Instead, they tend to reproduce learned patterns without generalizing their reasoning strategies across new scenarios.

“Accuracy ultimately collapses to zero beyond certain complexities,” the researchers noted, arguing that these models merely mimic reasoning rather than engaging in it. This finding runs counter to recent claims from some AI industry leaders, including OpenAI and Anthropic, who have publicly stated that AGI could be achieved within the next few years.

From a technological perspective, the implications are significant. The study points to a structural limitation in how LRMs are built and trained, raising concerns about over-reliance on large language models for tasks requiring robust, adaptive reasoning.

Critics within the AI research community have echoed Apple’s conclusions, warning that LRMs may not be the pathway to AGI that some have envisioned. For now, it appears that the leap from impressive pattern recognition to true intelligence remains a work in progress—reminding technologists and policymakers alike that claims of near-term AGI may be premature.

Apple Study Challenges Assumptions About AGI Readiness

Latest

3D-Printed Antennas Bend Without Breaking the Signal

New Method Enhances AI’s Ability to Recognize Personalized Objects

AI-Powered Eye Chip: A New Chapter for the Blind

New Tech Solves Key Weakness in Solid-State Batteries

99% Accuracy: How Never Mine is Shaping the Future of Demining

Stronger Magnets, Smaller Motors: A Boost for Clean Energy Tech

Batteries, Not Flux Capacitors: The Real Future of Urban Flight

Magnetic Origami Bots Take a Step Toward Smart Medicine

Smart, Scalable, Mobile: The Next-Gen Turret System

Tracking Mosquitoes and Floods from Space

Microscopic DNA Petals Mimic Nature to Perform Medical Tasks

The Engine That Breaks the Thermodynamic Rulebook

Smartwatch Breakthrough Enables Centimeter-Level GPS Accuracy

When AI Becomes Your Space Medic

Can’t be Cloned: Hydrogel Gives Products a Unique ID

Bridging the Global Talent Gap with AI: How Iverse Is Redefining...

Print Smarter, Not Harder: Open-Source Multi-Material 3D Printing

Clear Tech, Clear Skin: UV Safety Goes Wearable

Atlas: The AI Browser Trying to Change How You Search Online

Swallowable Bioprinter Targets Internal Wounds Without Surgery