AlphaZero is a general reinforcement learning algorithm that masters chess, shogi, and Go through self-play, outperforming specialized state-of-the-art programs.
The paper presents AlphaZero, a reinforcement learning algorithm that teaches itself to play and master three different board games: chess, shogi, and Go. Unlike previous programs that relied on game-specific knowledge and techniques, AlphaZero uses a generalized approach starting from random play and only requires knowledge of the game rules. Through self-play, AlphaZero achieved superhuman performance and defeated world champion programs in all three games, demonstrating the potential for a general game-playing system.
The algorithm's performance in other types of games or real-world applications has not been explored, and further research is needed to assess its generalizability beyond board games.
Human-level control through deep reinforcement learning
semantic_scholar
Playing Atari with Deep Reinforcement Learning
semantic_scholar
RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning
semantic_scholar
PyTorch: An Imperative Style, High-Performance Deep Learning Library
semantic_scholar
Omnilingual MT: Machine Translation for 1,600 Languages
arxiv