Reinforcement Learning

64 Posts

Google’s Rule-Respecting Chatbot: Research helps AI chatbots be more truthful and less hateful.
Reinforcement Learning

Google’s Rule-Respecting Chatbot: Research helps AI chatbots be more truthful and less hateful.

Amid speculation about the threat posed by OpenAI’s ChatGPT chatbot to Google’s search business, a paper shows how the search giant might address the tendency of such models to produce offensive, incoherent, or untruthful dialog.
Yoshua Bengio teaching
Reinforcement Learning

Yoshua Bengio - Models That Reason: Yoshua Bengio wants neural nets that reason.

Recent advances in deep learning largely have come by brute force: taking the latest architectures and scaling up compute power, data, and engineering. Do we have the architectures we need, and all that remains is to develop better hardware and datasets so we can keep...
Model that defeats KataGo, an open source Go-playing system
Reinforcement Learning

Champion Model Is No Go: Adversarial AI Beats Master KataGo Algorithm

A new algorithm defeated a championship-winning Go model using moves that even a middling human player could counter. Researchers trained a model to defeat KataGo, an open source Go-playing system that has beaten top human players.
Robot with an arm, camera, and gripper handing over a plastic bottle to a person
Reinforcement Learning

Parsing Commands Into Actions: NLP Helps Google Robot Understand Spoken Instructions

A new method enables robots to respond helpfully to verbal commands by pairing a natural language model with a repertoire of existing skills.
Humanized Training for Robot Arms
Reinforcement Learning

Humanized Training for Robot Arms: New Research Improves Robot Performance and Adaptability

Robots trained via reinforcement learning usually study videos of robots performing the task at hand. A new approach used videos of humans to pre-train robotic arms.
Explanation of Multi-Modal Delay Randomization (MMDR)
Reinforcement Learning

Coordinating Robot Limbs: Machine learning improves robot dog reaction time.

A dog doesn’t think twice about fetching a tennis ball, but an autonomous robot typically suffers from delays between perception and action. A new machine-learning model helped a quadruped robot coordinate its sensors and actuators.
Representation of the components of our controller design architecture
Reinforcement Learning

High-Energy Deep Learning: Machine learning helps stabilize nuclear fusion.

Nuclear fusion technology, long touted as an unlimited source of safe, clean energy, took a step toward reality with a machine learning algorithm that molds the fuel in a reactor’s core.
Gran Turismo Sport race
Reinforcement Learning

Fast and Daring Wins the Race: GT Sophy AI model beats humans at Gran Turismo Sport.

Armchair speed demons have a new nemesis. Peter Wurman and a team at Sony developed Gran Turismo Sophy (GT Sophy), a reinforcement learning model that defeated human champions of Gran Turismo Sport, a PlayStation game that simulates auto races right down to tire friction and air resistance.
Tax planning model AI Economist
Reinforcement Learning

Tax Relief the AI Way: AI Economist creates optimal tax rate.

Nothing is certain except death and taxes, the saying goes — but how to make taxes fair and beneficial remains an open question. New research aims to answer it.
Wolfram Burgard
Reinforcement Learning

Wolfram Burgard: Train robots in the real world.

Robots are tremendously useful machines, and I would like to see them applied to every task where they can do some good. Yet we don’t have enough programmers for all this hardware and all these tasks.
Google's Decision Transformer
Reinforcement Learning

Reinforcement Learning Transformed: Transformers succeed at reinforcemend learning tasks.

Transformers have matched or exceeded earlier architectures in language modeling and image classification. New work shows they can achieve state-of-the-art results in some reinforcement learning tasks as well.
Animated chart shows how AI can help robots locate key spatial coordinates.
Reinforcement Learning

Finding Useful Points in Space: Keypoint3D Helps Robots Locate Spatial Coordinates

A new machine learning method aims to improve a machine’s ability to determine and locate points of interest.
Animated video showing an image generator imitating brush strokes
Reinforcement Learning

Different Strokes for Robot Folks: Transformer-Based Image Generator Imitates Painters

A neural network can make a photo resemble a painting via neural style transfer, but it can also learn to reproduce an image by applying brush strokes. A new method taught a system this painterly skill without any training data.
Series of images explaining how the system Eva works
Reinforcement Learning

Who Needs a Covid Test? AI Decides: Greek Border Patrol Used AI to Screen for Covid

Greece’s border agents last year had enough Covid tests to swab only 17 percent of people who sought to enter the country. They managed the shortage by using an AI system to flag high-risk visitors.
Animation showing a simulated football team and how it works
Reinforcement Learning

Team Players: Football-Playing AI Blends Individual and Group Skills

Playing a team sport involves a fluid blend of individual and group skills. Researchers integrated both types of action into realistic humanoid agents that play football (known as soccer in the U.S.).

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox