Summary of "Yann LeCun on What Comes After LLMs"

Overview

Yann LeCun argues that the current AI boom—centered on large language models (LLMs)—is a major commercial success but an incorrect “endgame” for building human- or animal-like intelligence.


Why LLMs Aren’t the Route to Intelligence

LeCun says LLMs are useful and “great” for language manipulation and many products, but they do not model the world well enough to achieve intelligence or generally capable agent behavior.

He highlights a fundamental limitation:


What Comes After LLMs: World Models + Planning

LeCun proposes that progress should shift toward:

He emphasizes two key capabilities:

  1. Predicting consequences of actions
  2. Planning by search/optimization to achieve a goal

He contrasts this approach with primarily reactive, autoregressive behavior.


Critique of Vision-Language-Action (VAs)

LeCun criticizes “vision-language-action” models as a dead end for now, arguing that:


Why His Company Exists (Emmy/Amy Labs; AMI Labs)

LeCun says he launched a startup focused on advanced machine intelligence with the explicit goal of scaling world-model learning using JEPA-style (joint embedding predictive architecture) methods he pioneered at Meta.

He also claims Meta was no longer the right environment for this direction.


World-Model Learning Should Be Non-Generative Representation Learning (JEPA vs Pixel Generation)

A central point in his argument is that effective representation learning for images/videos often relies on non-generative/self-supervised objectives, whereas pixel-level generative prediction tends to fail or be inefficient.

What JEPA Does (Conceptually)

Bottleneck: Representation Collapse

LeCun identifies representation collapse (learning trivial constant representations) as a key research bottleneck.

He connects anti-collapse approaches to:

He cites promising early results from applying these ideas to world-model training.


Robotics / Automation Implications

LeCun acknowledges impressive robotics demos from generative approaches, but argues they are often:

He claims this makes them expensive and brittle, and argues world models could improve:


Data-Efficiency vs Scaling Dynamics

LeCun argues industry competition increasingly rewards digging the “same trench” by scaling current methods with more compute and data instead of investing in more data-efficient approaches.

He concludes that robotics and generalization likely require a paradigm shift, not just more scaling.


Safety: Why He Is Particularly Bearish About LLM Reliability and Controllability

LeCun argues LLMs are intrinsically unsafe, mainly because:

“Objective-Driven AI” Alternative

He proposes objective-driven AI:

He notes failures can still occur if:

But he argues this is more controllable than relying on LLM prompting.


Healthcare Example

LeCun suggests LLMs may be limited to knowledge regurgitation, while major clinical breakthroughs require modeling dynamics—for example:

He positions world models as central to enabling this kind of approach.


Background at Meta/FAIR and Departure

LeCun describes himself as a long-time leader at FAIR, helping build it and emphasizing FAIR’s openness and its research-to-practice pipeline.

He claims that in the last year or two, FAIR’s direction shifted away from the world-model research he believed was necessary.

He also clarifies misconceptions about his role relative to Alex and internal LLM strategy, stating:


Why His Views Diverged Around 2023

LeCun says he didn’t change his mind—others did.

He points to:

LeCun rejects that claim and maintains that a different blueprint is required.


Timeline Claims (Presented Humorously, but with Conviction)

LeCun frames a forecast partly as a joke:

He also acknowledges that this doesn’t guarantee fully ready solutions by then.


Presenters / Contributors

Category ?

News and Commentary


Share this summary


Is the summary off?

If you think the summary is inaccurate, you can reprocess it with the latest model.

Video