Claim✓ Verified
AlphaGo Lee (2016) used two separate networks for policy and value; every later variant (Zero, AlphaZero, MuZero) merged them into one network with two heads.
- Who
- Eric Jang
- Topic
- Network architecture
- Verification note
- Well-documented in the published Nature papers.