The Dwarkesh Reference
← Back
Claim✓ Verified

AlphaGo Lee mixed the value-network estimate with a real Monte Carlo rollout, a technique dropped in all later AlphaGo variants.

Who
Eric Jang
Topic
Value estimation
Verification note
Described explicitly in the AlphaGo Lee paper; removed in AlphaGo Zero.