Different chess moves
Reinforcement Learning

Chess: The Next Move

AI has humbled human chess masters. Now it’s helping them take the game to the next level. DeepMind and retired chess champion Vladimir Kramnik trained AlphaZero, a reinforcement learning model that bested human experts in chess, Go, and Shogi, to play-test changes in the rules.
Sequence of an autonomous fighter pilot
Reinforcement Learning

AI Versus Ace

An autonomous fighter pilot shot down a human aerial ace in virtual combat. Built by defense contractor Heron Systems, the system also defeated automated rivals from seven other companies to win the AlphaDogfight trial.
Data related to experience replay
Reinforcement Learning

Experience Counts

If the world changes every second and you take a picture every 10 seconds, you won’t have enough pictures to observe the changes clearly, and storing a series of pictures won’t help. On the other hand, if you take a picture every tenth of a second, then storing a history will help model the world.
Information related to Policy Adaptation during Deployment (Pad)
Reinforcement Learning

Same Job, Different Scenery

People who take driving lessons during daytime don’t need instruction in driving at night. They recognize that the difference doesn’t disturb their knowledge of how to drive. Similarly, a new reinforcement learning method manages superficial variations in the environment without re-training.
Series of pictures of people smiling
Reinforcement Learning

Deepfakes for Good

A strategy manifesto from one of China’s biggest tech companies declares, amid familiar visions of ubiquitous AI, that deepfakes are more boon than bane.
Man with prosthetic leg walking
Reinforcement Learning

AI Steps Up

A prosthetic leg that learns from the user’s motion could help amputees walk more naturally. Researchers from the University of Utah designed a robotic leg that uses machine learning to generate a human-like stride.
Data related to a new reinforcement learning approach
Reinforcement Learning

Eyes on the Prize

When the chips are down, humans can track critical details without being distracted by irrelevancies. New research helps reinforcement learning models similarly focus on the most important details.
Takes from videogame Source of Madness
Reinforcement Learning

Monsters in Motion

How do you control a video game that generates a host of unique monsters for every match? With machine learning, naturally. The otherworldly creatures in Source of Madness learn how to target players through reinforcement learning.
Graphs and data related to Plan2Vec
Reinforcement Learning

Visual Strategies for RL

Reinforcement learning can beat humans at video games, but humans are better at coming up with strategies to master more complex tasks. New work enables neural networks to connect the dots.
Data related to reinforcement learning and optimization of worker productivity and income equality
Reinforcement Learning

Taxation With Vector Representation

Governments have struggled to find a tax formula that promotes prosperity without creating extremes of wealth and poverty. Can machine learning show the way?
Data and information related to Contrastive Unsupervised Representations for Reinforcement Learning (CURL)
Reinforcement Learning

RL and Feature Extraction Combined

Which comes first, training a reinforcement learning model or extracting high-quality features? New work avoids this chicken-or-egg dilemma by doing both simultaneously.
Illustration of a patient in a hospital bed
Reinforcement Learning

Prognosis: Early Warning for Sepsis

An AI-driven alarm system helps rescue patients before infections become fatal. The problem: Machine learning can spot patterns in electronic health data indicating where a patient’s condition is headed that may be too subtle for doctors and nurses to catch.
Illustration of a syring with red liquid inside
Reinforcement Learning

Treatment: The Elusive Molecule

Will deep learning discover new medicines? Startups — and big-pharma partners — are betting on it. The problem: In theory, there’s a pharmacological cure for just about any ailment. In practice, discovering those therapies takes years and billions of dollars.
Schematic of a typical deep learning workflow
Reinforcement Learning

(Science) Community Outreach

Are your scientist friends intimidated by machine learning? They might be inspired by a primer from one of the world’s premier tech titans. Former Google CEO Eric Schmidt and Cornell PhD candidate Maithra Raghu school scientists in machine learning in a sprawling overview.
Packing robot
Reinforcement Learning

Packing Robots Get a Grip

Robots are moving into a job that traditionally required the human touch.What’s new: A commercial warehouse that ships electrical supplies deployed AI-driven robotic arms from Covariant, a high-profile Silicon Valley robotics firm.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox