Summary of "Terence Tao - Mathematics in the Age of AI"

Overview

This document summarizes Terence Tao’s talk “Mathematics in the Age of AI.” It outlines the main themes and lessons, barriers to large-scale open collaboration in mathematics, technological and workflow changes enabling broader participation, a detailed case study (the Equational Theories Project), an ongoing collaboration with DeepMind, practical takeaways about AI in mathematics, and recommended components for scalable mathematical collaboration.

Main themes and lessons

Mathematics is conservative and shows strong continuity with centuries-old practices (textbooks, notation, blackboards). That continuity is a strength but slows adoption of new workflows.
Historically, mathematics has lower average collaboration and higher barriers to entry compared with other sciences: many problems require deep, specialized knowledge and proofs must be fully correct.
New technologies and workflows (AI, collaboration platforms, formal verification) are beginning to change mathematical practice.

Barriers to large-scale, open collaboration

High technical entry cost: many problems require PhD-level background.
Requirement of fully correct proofs: a single error can invalidate a collaborative result.
Typical workflows (in-person blackboard work) do not scale to hundreds of remote contributors.

What is changing and why

New tools and workflows (GitHub, discussion boards, proof assistants, automated theorem provers, LLMs) enable larger-scale projects and broader participation, including non-professional contributors.
Formal verification (proof assistants) is a critical enabling technology: machine-checked correctness breaks the “trust barrier” and allows accepting submissions from anonymous or untrusted contributors.
AI is useful but must be combined with verification and appropriate workflows; AI alone is unreliable.

Detailed example: The Equational Theories Project

Goal and scale

Procedurally generated roughly 22 million algebraic implication problems (examples: “does commutativity imply associativity?”).
Each problem is often easy for a trained algebraist, but the sheer number makes the collection infeasible for traditional human-only methods.

Workflow and tools used

Centralized repository on GitHub to store all statements and proofs.
Formalization in the Lean proof assistant so proofs could be machine-checked.
Active online discussion board for collaboration and cross-checking.
Combination of human-generated and computer-generated proofs, with translations between human-readable proofs and Lean formalizations.
Use of automated theorem provers and other automated tools to attack many problems in bulk.

Key success factors

Modularity: the project decomposed naturally into many related but separable subproblems, letting participants claim and work on subsets independently.
Clear metric: tracking the count of unresolved statements enabled decentralized progress and friendly competition to “move the needle.”
Formal verification: every submission had to pass the proof assistant check, allowing trustless contributions and precise discussion of micro-steps.
Decentralized, role-specialized participation: some participants wrote informal proofs, others translated to Lean, others ran automatic searches.

Outcome

The project completed in a few months with proofs or disproofs for all statements, demonstrating the feasibility of large, distributed, formalized mathematics.

Half example (work in progress with Google DeepMind)

Large language models (LLMs) can already solve many problems but are error-prone (even on trivial arithmetic).
Combining LLM outputs with verifiers (automated checking) creates a useful loop: generate → verify → send error/feedback → revise.
DeepMind collaboration (tool named AlphaEvolve) and similar approaches use randomized generation plus feedback/evolution to improve numerical/optimization results.
Early successes include small improvements on packing/optimization benchmark problems; work is ongoing and some results are pending publication.

Practical takeaways about AI in mathematics

Current useful AI applications

Secondary and auxiliary tasks: writing code, literature review, and translating jargon for broader audiences.
Generating conjectures or patterns from datasets in isolated proofs-of-concept.
Running first-pass sweeps over large collections of medium-difficulty problems, escalating hard cases to human experts.

Essential cautions and requirements

Unverified AI output is often unreliable; it must be combined with rigorous, preferably formal, verification before being used for mathematical claims.
AI is not plug-and-play: effective use is situational and requires designing appropriate workflows and metrics.
AI will likely be most effective as part of broader collaborative ecosystems—filling gaps and expanding economically feasible objectives—rather than outright replacing mathematicians’ creative work.

Recommended components for scalable mathematical collaboration

Problem design that decomposes into many modular subproblems.
Public repository (e.g., GitHub) to store statements, proofs, and track progress.
Use of a proof assistant (formal verification language such as Lean) so results can be machine-checked.
Active discussion channels for fine-grained, atomized conversations about specific proof steps.
Clear, measurable progress metric (e.g., count of unresolved items) to coordinate decentralized effort.
Mixed workflows:
1. Humans produce intuition-level proofs and insights.
2. Automated theorem provers and LLMs attempt bulk or randomized generation of proofs.
3. People or translators formalize successful human/computer proofs in the proof assistant.
Feedback loop for AI: generate → verify → send failure/diagnostic back to model → iterate.
Role specialization to lower barriers to contribution (e.g., proof writers, formalizers, automated-tool engineers).

Entities, projects, and tools mentioned

Speaker: Terence Tao
Projects and tools:
- Equational Theories Project (Tao’s 22-million problem project)
- Lean (proof assistant / formal verification language)
- GitHub (collaboration platform)
- Automated theorem provers
- Large language models (LLMs)
- Google DeepMind
- AlphaEvolve (DeepMind-related tool)
People / references:
- Koshi (historic textbook author referenced as an example)
- Jessica Wyn (photographer who made a book of mathematicians’ chalkboards)

Key concepts

Formal verification / proof assistants
Citizen mathematics / broader participation
Modularity, metrics, and decentralization in collaborative workflows
Generate-and-verify loop for AI/LLMs

Share this summary

Is the summary off?

If you think the summary is inaccurate, you can reprocess it with the latest model.

Summarize another video

Summary of "Terence Tao - Mathematics in the Age of AI"

Overview

Main themes and lessons

Barriers to large-scale, open collaboration

What is changing and why