2020-01-01から1ヶ月間の記事一覧
AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo’s own move selections and also the winner of AlphaGo’s games. David Silver, et al., "Mastering the game of Go without human knowledge" Mastering the game of Go…
Here we introduce an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules. David Silver, et al., "Mastering the game of Go without human knowledge" Mastering the game of Go wi…
These neural networks were trained by supervised learning from human expert moves, and by reinforcement learning from self-play. David Silver, et al., "Mastering the game of Go without human knowledge" Mastering the game of Go without huma…