The Dwarkesh Reference
2026-05-15

Eric Jang

Building AlphaGo from scratch

Listen / read the full episode ↗

Predictions (5)

Where they disagreed

Dwarkesh
Eric Jang
Does understanding AlphaGo in detail make it less impressive?
Yes: explicit tree search and hand-tuned heuristics make it look engineered rather than emergent; simple RLVR on LLMs is more surprising.
No: the profundity is that a 10-layer network compresses an intractable search into one forward pass, which is genuinely mysterious.
Is a verifiable outer loop like Go win-rate enough to drive meaningful AI self-improvement?
Skeptical: a win-rate loop doesn't capture paradigm-shifting discoveries like scaling laws; we improve only what we measure.
More optimistic: Go encapsulates many research sub-problems and backstops reward hacking; skills should transfer to harder domains.

Mental models (4)

Claims (4)