The MuZero artificial intelligence (AI) developed by Alphabet subsidiary DeepMind can master games without learning their rules beforehand.
The system attempts first one action, then another, learning what the game rules permit, and concurrently noting the rewards proffered.
MuZero then modifies its strategy until it finds a way to collect rewards more readily; this observational learning is perfect for any AI confronting problems that cannot be specified easily.
MuZero outperforms previous DeepMind AIs in its economical use of data, because it models only environmental parameters that matter in decision-making.
Once trained, MuZero processes so little for decision-making that its entire function might be managed on a smartphone.
From IEEE Spectrum
View Full Article
Abstracts Copyright © 2020 SmithBucklin, Washington, DC, USA
No entries found