Automated player learning by watching recorded gameplay
Reinforcement Learning

Behavioral Cloning Shootout

Neural networks have learned to play video games like Dota 2 via reinforcement learning by playing for the equivalent of thousands of years (compressed into far less time). In new work, an automated player learned not by playing for millennia but by watching a few days’ worth of recorded gameplay.
2 min read
A neural network wrote the blueprint for upcoming computer chips
Reinforcement Learning

Computers Making Computers

A neural network wrote the blueprint for upcoming computer chips that will accelerate deep learning itself. Google engineers used a reinforcement learning system to arrange the billions of minuscule transistors in an upcoming version of its Tensor Processing Unit (TPU) chips.
2 min read
Surgical robots performing different actions
Reinforcement Learning

Medical AI Gets a Grip

Surgical robots perform millions of delicate operations annually under human control. Now they’re getting ready to operate on their own.
2 min read
Video sequence showing military drones working
Reinforcement Learning

Drones For Defense

Drone startups are taking aim at military customers. As large tech companies have backed away from defense work, startups like Anduril, Shield AI, and Teal are picking up the slack. They’re developing autonomous fliers specifically for military operations.
2 min read
Graphs and data related to recurrent neural nets (RNNs)
Reinforcement Learning

Performance Guaranteed

Bayes-optimal algorithms always make the best decisions given their training and input, if certain assumptions hold true. New work shows that some neural networks can approach this kind of performance.
2 min read
Graphs related to world models
Reinforcement Learning

It’s a Small World Model After All

World models, which learn a compressed representation of a dynamic environment like, say, a video game, have delivered top results in reinforcement learning. A new method makes them much smaller.
2 min read
Ilya Sutskever
Reinforcement Learning

Ilya Sutskever: Fusion of Language and Vision

The past year was the first in which general-purpose models became economically useful. GPT-3, in particular, demonstrated that large language models have surprising linguistic competence and the ability to perform a wide variety of useful tasks.
2 min read
AI-driven balloon reaching high altitude
Reinforcement Learning

How to Drive a Balloon

Helium balloons that beam internet service to hard-to-serve areas are using AI to navigate amid high-altitude winds. Loon, the Alphabet division that provides wireless internet via polyethylene blimps.
2 min read
Fighter pilot in action
Reinforcement Learning

Phantom Menace

A fighter pilot battled a true-to-life virtual enemy in midair. In the skies over southern California, an airman pitted his dogfighting skills against an AI-controlled opponent that was projected onto his augmented-reality visor.
2 min read
Takes from Agence, an interactive VR project
Reinforcement Learning

RL Agents: SOS!

A new multimedia experience lets audience members help artificially intelligent creatures work together to survive. Agence, an interactive virtual reality (VR) project blends audience participation with reinforcement learning to create an experience that’s half film, half video game.
2 min read
Example of Occupancy Anticipation, a navigation system that predicts unseen obstacles, working
Reinforcement Learning

Guess What Happens Next

New research teaches robots to anticipate what’s coming rather than focusing on what’s right in front of them. Researchers developed Occupancy Anticipation (OA), a navigation system that predicts unseen obstacles in addition to observing those in its field of view.
2 min read
Different chess moves
Reinforcement Learning

Chess: The Next Move

AI has humbled human chess masters. Now it’s helping them take the game to the next level. DeepMind and retired chess champion Vladimir Kramnik trained AlphaZero, a reinforcement learning model that bested human experts in chess, Go, and Shogi, to play-test changes in the rules.
1 min read
Data related to experience replay
Reinforcement Learning

Experience Counts

If the world changes every second and you take a picture every 10 seconds, you won’t have enough pictures to observe the changes clearly, and storing a series of pictures won’t help. On the other hand, if you take a picture every tenth of a second, then storing a history will help model the world.
2 min read
Sequence of an autonomous fighter pilot
Reinforcement Learning

AI Versus Ace

An autonomous fighter pilot shot down a human aerial ace in virtual combat. Built by defense contractor Heron Systems, the system also defeated automated rivals from seven other companies to win the AlphaDogfight trial.
2 min read
Series of pictures of people smiling
Reinforcement Learning

Deepfakes for Good

A strategy manifesto from one of China’s biggest tech companies declares, amid familiar visions of ubiquitous AI, that deepfakes are more boon than bane.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox