deepmind 在nature发布自学习AlphaGo Zero

frank

谷歌旗下人工智能公司DeepMind宣布AlphaGo出现了升级版AlphaGoZero,证实无监督强化学习在围棋这类问题中有state of art 级别的有效性。AlphaGo打败李世石用了3000万盘比赛作为训练数据，而AlphaGo Zero仅用了490万盘比赛数据。经过3天的训练，AlphaGo Zero以100：0的战绩完胜AlphaGo。
论文地址: https://deepmind.com/documents/119/agz_unformatted_nature.pdf
官方博客: https://deepmind.com/blog/alphago-zero-learning-scratch