Illustration of a patient in a hospital bed
Reinforcement Learning

Prognosis — Early Warning for Sepsis: AI can provide an early warning for sepsis.

An AI-driven alarm system helps rescue patients before infections become fatal. The problem: Machine learning can spot patterns in electronic health data indicating where a patient’s condition is headed that may be too subtle for doctors and nurses to catch.
Illustration of a syring with red liquid inside
Reinforcement Learning

Treatment — The Elusive Molecule: How deep learning could speed up drug discovery

Will deep learning discover new medicines? Startups — and big-pharma partners — are betting on it. The problem: In theory, there’s a pharmacological cure for just about any ailment. In practice, discovering those therapies takes years and billions of dollars.
Schematic of a typical deep learning workflow
Reinforcement Learning

(Science) Community Outreach: A survey of machine learning from Eric Schmidt

Are your scientist friends intimidated by machine learning? They might be inspired by a primer from one of the world’s premier tech titans. Former Google CEO Eric Schmidt and Cornell PhD candidate Maithra Raghu school scientists in machine learning in a sprawling overview.
Packing robot
Reinforcement Learning

Packing Robots Get a Grip: This robot arm can handle over 10,000 different objects.

Robots are moving into a job that traditionally required the human touch.What’s new: A commercial warehouse that ships electrical supplies deployed AI-driven robotic arms from Covariant, a high-profile Silicon Valley robotics firm.
Maze action video game Pac-Man
Reinforcement Learning

Two-Way Winner: MuZero AI masters both video games and board games.

AlphaGo Zero demonstrates superhuman performance playing Go, chess, and shogi. Models like R2D2 do the same playing classic Atari titles. A new approach to deep reinforcement learning is the first to achieve state-of-the-art results playing both board and video games.
Yann LeCun
Reinforcement Learning

Yann LeCun — Learning From Observation: The power of self-supervised learning

How is it that many people learn to drive a car fairly safely in 20 hours of practice, while current imitation learning algorithms take hundreds of thousands of hours, and reinforcement learning algorithms take millions of hours? Clearly we’re missing something big.
Chelsea Finn
Reinforcement Learning

Chelsea Finn — Robots That Generalize: Generalization for robotics through reinforcement learning

Many people in the AI community focus on achieving flashy results, like building an agent that can win at Go or Jeopardy. This kind of work is impressive in terms of complexity.
Anima Anandkumar
Reinforcement Learning

Anima Anandkumar — The Power of Simulation: How simulation can be useful for supervised learning

We’ve had great success with supervised deep learning on labeled data. Now it’s time to explore other ways to learn: training on unlabeled data, lifelong learning, and especially letting models explore a simulated environment before transferring what they learn to the real world.
Illustration of a crystal snowball
Reinforcement Learning

Simulation Substitutes for Data: When simulation works wonders with deep learning

The future of machine learning may depend less on amassing ground-truth data than simulating the environment in which a model will operate. Deep learning works like magic with enough high-quality data. When examples are scarce, though, researchers are using simulation to fill the gap.
Information related to Implicit Reinforcement without Interaction at Scale (IRIS)
Reinforcement Learning

Different Skills From Different Demos: Implicit reinforcement without interaction at scale, explained

Reinforcement learning trains models by trial and error. In batch reinforcement learning (BRL), models learn by observing many demonstrations by a variety of actors. But what if one doctor is handier with a scalpel while another excels at suturing?
Observational dropout
Reinforcement Learning

Seeing the World Blindfolded: The observational dropout technique, explained

In reinforcement learning, if researchers want an agent to have an internal representation of its environment, they’ll build and train a world model that it can refer to. New research shows that world models can emerge from standard training, rather than needing to be built separately.
AlphaGo playing Go with Lee Sedol
Reinforcement Learning

Is AI Making Mastery Obsolete?: How human chess and go masters cope with being beaten by AI

Is there any reason to continue playing games that AI has mastered? Ask the former champions who have been toppled by machines.
Comparison between TrXL and GTrXL
Reinforcement Learning

Melding Transformers with RL

Large NLP models like BERT can answer questions about a document thanks to the transformer network, a sequence-processing architecture that retains information across much longer sequences than previous methods. But transformers have had little success in reinforcement learning — until now.
Bipedal robot crossing obstacles
Reinforcement Learning

Survival of the Overfittest

Neuroevolution, which combines neural networks with ideas drawn from Darwin, is gaining momentum. Its advocates claim that they can achieve faster, better results by generating a succession of new models, each slightly different than its predecessors, rather than relying on a purpose-built model.
StarCraft II videogame
Reinforcement Learning

Take That, Humans!

At the BlizzCon gaming convention last weekend, players of the strategy game StarCraft II stood in line to get walloped by DeepMind’s AI. After training for the better part of a year, the bot has become one of the world’s top players.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox