AlphaGo Zero | Abstract 第6文

AlphaGo Zero

AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo’s own move selections and also the winner of AlphaGo’s games. David Silver, et al., "Mastering the game of Go without human knowledge" Mastering the game of Go…

2020-01-19

AlphaGo Zero | Abstract 第5文

AlphaGo Zero

Here we introduce an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules. David Silver, et al., "Mastering the game of Go without human knowledge" Mastering the game of Go wi…

2020-01-19

AlphaGo Zero | Abstract 第4文

AlphaGo Zero

These neural networks were trained by supervised learning from human expert moves, and by reinforcement learning from self-play. David Silver, et al., "Mastering the game of Go without human knowledge" Mastering the game of Go without huma…

AI Paper English F.o.R.

人工知能(AI)に関する論文を英語リーディング教本のFrame of Reference(F.o.R.)を使いこなして読むブログです。

2020-01-01から1ヶ月間の記事一覧

AlphaGo Zero | Abstract 第6文

AlphaGo Zero | Abstract 第5文

AlphaGo Zero | Abstract 第4文