Series of images showing some of the findings of the new study by researchers at Stanford’s Human AI Institute
Reinforcement Learning

Weak Foundations Make Weak Models: Foundation AI Models Pass Flaws to Fine-Tuned Variants

A new study examines a major strain of recent research: huge models pretrained on immense quantities of uncurated, unlabeled data and then fine-tuned on a smaller, curated corpus.
Sequence of famous arcade games' scenes
Reinforcement Learning

Solve RL With This One Weird Trick: How to get better performance from reinforcement learning.

The previous state-of-the-art model for playing vintage Atari games took advantage of a number of advances in reinforcement learning (RL). The new champion is a basic RL architecture plus a trick borrowed from image generation.
Forbidden sign over a robot's hand solving a Rubik's Cube
Reinforcement Learning

Bye Bye Bots

The independent research lab OpenAI wowed technology watchers in 2019 with a robotic hand that solved Rubik’s Cube. Now it has disbanded the team that built it. OpenAI cofounder Wojciech Zaremba revealed that OpenAI shuttered its robotics program last October.
A four-legged robot walking over difficult and changing terrain
Reinforcement Learning

Walking the Dog

A reinforcement learning system enabled a four-legged robot to amble over unfamiliar, rapidly changing terrain.
Automated player learning by watching recorded gameplay
Reinforcement Learning

Behavioral Cloning Shootout

Neural networks have learned to play video games like Dota 2 via reinforcement learning by playing for the equivalent of thousands of years (compressed into far less time). In new work, an automated player learned not by playing for millennia but by watching a few days’ worth of recorded gameplay.
A neural network wrote the blueprint for upcoming computer chips
Reinforcement Learning

Computers Making Computers

A neural network wrote the blueprint for upcoming computer chips that will accelerate deep learning itself. Google engineers used a reinforcement learning system to arrange the billions of minuscule transistors in an upcoming version of its Tensor Processing Unit (TPU) chips.
Surgical robots performing different actions
Reinforcement Learning

Medical AI Gets a Grip

Surgical robots perform millions of delicate operations annually under human control. Now they’re getting ready to operate on their own.
Video sequence showing military drones working
Reinforcement Learning

Drones For Defense

Drone startups are taking aim at military customers. As large tech companies have backed away from defense work, startups like Anduril, Shield AI, and Teal are picking up the slack. They’re developing autonomous fliers specifically for military operations.
Graphs and data related to recurrent neural nets (RNNs)
Reinforcement Learning

Performance Guaranteed

Bayes-optimal algorithms always make the best decisions given their training and input, if certain assumptions hold true. New work shows that some neural networks can approach this kind of performance.
Graphs related to world models
Reinforcement Learning

It’s a Small World Model After All

World models, which learn a compressed representation of a dynamic environment like, say, a video game, have delivered top results in reinforcement learning. A new method makes them much smaller.
Ilya Sutskever
Reinforcement Learning

Ilya Sutskever: Fusion of Language and Vision

The past year was the first in which general-purpose models became economically useful. GPT-3, in particular, demonstrated that large language models have surprising linguistic competence and the ability to perform a wide variety of useful tasks.
AI-driven balloon reaching high altitude
Reinforcement Learning

How to Drive a Balloon

Helium balloons that beam internet service to hard-to-serve areas are using AI to navigate amid high-altitude winds. Loon, the Alphabet division that provides wireless internet via polyethylene blimps.
Fighter pilot in action
Reinforcement Learning

Phantom Menace

A fighter pilot battled a true-to-life virtual enemy in midair. In the skies over southern California, an airman pitted his dogfighting skills against an AI-controlled opponent that was projected onto his augmented-reality visor.
Takes from Agence, an interactive VR project
Reinforcement Learning

RL Agents: SOS!

A new multimedia experience lets audience members help artificially intelligent creatures work together to survive. Agence, an interactive virtual reality (VR) project blends audience participation with reinforcement learning to create an experience that’s half film, half video game.
Example of Occupancy Anticipation, a navigation system that predicts unseen obstacles, working
Reinforcement Learning

Guess What Happens Next

New research teaches robots to anticipate what’s coming rather than focusing on what’s right in front of them. Researchers developed Occupancy Anticipation (OA), a navigation system that predicts unseen obstacles in addition to observing those in its field of view.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox