Weather Forecast by GAN
Machine Learning Research

Weather Forecast by GAN

A new deep learning technique increased the precision of short-term rainfall forecasts.What's new: Suman Ravuri, Karel Lenc, Matthew Willson, and colleagues at DeepMind, UK Meteorological Office, University of Exeter,
3 min read
Fine-Tune Your Fine-Tuning
Machine Learning Research

Fine-Tune Your Fine-Tuning

Let’s say you have a pretrained language model and a small amount of data to fine-tune it to answer yes-or-no questions. Should you fine-tune it to classify yes/no or to fill in missing words — both viable approaches that are likely to yield different
3 min read
To Flow or Not to Flow
Machine Learning Research

To Flow or Not to Flow

Networked software is often built using a service-oriented architecture, but networked machine learning applications may be easier to manage using a different programming style.
2 min read
Diagram
Machine Learning Research

Competitive Coder

Programming is hard. Programming competitions are harder. Yet transformers proved themselves up to the task.What’s new: Yujia Li, David Choi, Junyoung Chung, and a team at DeepMind
2 min read
Graph
Machine Learning Research

The Limits of Pretraining

The higher the accuracy of a pretrained model, the better its performance after fine-tuning, right? Not necessarily.What’s new: Samira Abnar and colleagues at Google Research conducted
2 min read
Fake face diagram
Machine Learning Research

Fake Faces Are Good Training Data

Collecting and annotating a dataset of facial portraits is a big job. New research shows that synthetic data can work just as well.What's new: A team led by Erroll Wood and Tadas Baltrušaitis at Microsoft used
2 min read
Transformer Architecture
Machine Learning Research

Transformers See in 3D

Visual robots typically perceive the three-dimensional world through sequences of two-dimensional images, but they don’t always know what they’re looking at. For instance, Tesla’s self-driving system has been known to mistake a full moon for a
3 min read
MobileNet
Machine Learning Research

High Accuracy at Low Power

Equipment that relies on computer vision while unplugged — mobile phones, drones, satellites, autonomous cars — need power-efficient models. A new architecture set a record for accuracy per computation.
2 min read
Schematic of 8-bit optimizers via block-wise dynamic quantization
Machine Learning Research

More Learning With Less Memory

Researchers discovered a new way to reduce memory requirements when training large machine learning models. Tim Dettmers and colleagues at University of Washington released 8-bit optimizers that store gradient statistics as 8-bit values, while maintaining the same accuracy.
2 min read
Tax planning model AI Economist
Machine Learning Research

Tax Relief the AI Way

Nothing is certain except death and taxes, the saying goes — but how to make taxes fair and beneficial remains an open question. New research aims to answer it.
2 min read
Graphs
Machine Learning Research

Why Active Learning Fails

Where labeled training data is scarce, an algorithm can learn to request labels for key examples. While this practice, known as active learning, can supply labeled examples that improve performance in some tasks, it fails in others.
2 min read
Image Transformations Unmasked: CNNs for Vision that Aren't Fooled By Changing Backgrounds
Machine Learning Research

Image Transformations Unmasked: CNNs for Vision that Aren't Fooled By Changing Backgrounds

If you change an image by moving its subject within the frame, a well trained convolutional neural network may not recognize the fundamental similarity between the two versions. New research aims to make CNN wise to such alterations.
2 min read
Two images showing RETRO Architecture and Gopher (280B) vs State of the Art
Machine Learning Research

Large Language Models Shrink: Gopher and RETRO Prove Lean Language Models Can Push Boundaries

DeepMind released three papers that push the boundaries — and examine the issues — of large language models.
2 min read
Animated illustration shows the model architecture of a graph neural network.
Machine Learning Research

A Deeper Look at Graphs: Graph Neural Networks Work Better With More Layers

New research shows that drastically increasing the number of layers in a graph neural networks improves its performance on large datasets.
2 min read
A conversation between a human and an open-domain chatbot.
Machine Learning Research

Long-Haul Chatbot: Facebook Chatbot is Able to Carry on Long Conversations

Facebook released a chatbot that summarizes dialog on the fly and uses the summary to generate further repartee.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox