The Dwarkesh Reference
← Back
Claim✓ Verified

AlphaGo Lee (2016) used two separate networks for policy and value; every later variant (Zero, AlphaZero, MuZero) merged them into one network with two heads.

Who
Eric Jang
Topic
Network architecture
Verification note
Well-documented in the published Nature papers.