OpenAI has developed a Minecraft-playing bot that can build pixelated tools and buildings in the game that require more than 20,000 consecutive actions via a combination of imitation and reinforcement learning.
The bot, trained on 70,000 hours of human gameplay, is the first to build "diamond tools," which take human players 20 minutes and 24,000 actions, on average, to construct.
Imitation learning requires each step to be hand-labeled, but the researchers used a separate neural network to handle labeling via Video Pre-Training.
The researchers said the use of imitation and reinforcement learning in combination could pave the way for advancements in self-driving vehicles and nuclear fusion research.
From Popular Science
View Full Article
Abstracts Copyright © 2022 SmithBucklin, Washington, DC, USA
No entries found