A living room made out of cups of coffee: the people, the seats, the chimney, the lamp, all gather around a cozy fire.
Language

One Architecture to Do Them All: Transformer: The AI architecture that can do it all.

The transformer architecture extended its reach to a variety of new domains.What happened: Originally developed for natural language processing, transformers are becoming the Swiss Army Knife of deep learning.
An illustration shows a cozy cabin where all the furniture is made out of coffee mugs.
Language

Transformers Take Over: Transformers Applied to Vision, Language, Video, and More

In 2021, transformers were harnessed to discover drugs, recognize speech, and paint pictures — and much more.
Illustration of a woman riding a sled
Language

Multimodal AI Takes Off: Multimodal Models, such as CLIP and DALL·E, are taking over AI.

While models like GPT-3 and EfficientNet, which work on text and images respectively, are responsible for some of deep learning’s highest-profile successes, approaches that find relationships between text and images made impressive
Illustration of giant Christmas tree in a town plaza
Language

Trillions of Parameters: Are AI models with trillions of parameters the new normal?

The trend toward ever-larger models crossed the threshold from immense to ginormous. Google kicked off 2021 with Switch Transformer, the first published work to exceed a trillion parameters, weighing in at 1.6 trillion.
Two images showing RETRO Architecture and Gopher (280B) vs State of the Art
Language

Large Language Models Shrink: Gopher and RETRO prove lean language models can push boundaries.

DeepMind released three papers that push the boundaries — and examine the issues — of large language models.
A conversation between a human and an open-domain chatbot.
Language

Long-Haul Chatbot: Facebook Chatbot is Able to Carry on Long Conversations

Facebook released a chatbot that summarizes dialog on the fly and uses the summary to generate further repartee.
Animated image showing different Zillow listings
Language

Price Prediction Turns Perilous: How Covid Broke Zillow's Pricing Algorithm

The real-estate website Zillow bought and sold homes based on prices estimated by an algorithm — until Covid-19 confounded the model’s predictive power. Zillow, whose core business is providing real-estate information for prospective buyers, shut down its house-flipping division after...
A graph shows the cost in dollars of training large natural language processing models.
Language

Who Can Afford to Train AI?: Cost of AI is Too Expensive for Many Small Companies

The cost of training top-performing machine learning models has grown beyond the reach of smaller companies.
Example comparing a nonaugmented model (left) to a model with internet-augmentation (right)
Language

This Chatbot Does Its Research: Facebook Chatbot Uses the Internet to Inform its Answers

Chatbots often respond to human input with incorrect or nonsensical answers. Why not enable them to search for helpful information?
Animation showing GPT-3 in full action
Language

GPT-3 for All: GPT-3 NLP Model is Available for Select Azure Users

Microsoft is making GPT-3 available to selected customers through its Azure cloud service.
Animation showing how MERLOT is able to match contextualized captions with their corresponding video frames
Language

Richer Video Representations: Pretraining Method Improves AI's Ability to Understand Video

To understand a movie scene, viewers often must remember or infer previous events and extrapolate potential consequences. New work improved a model’s ability to do the same.
First image showing the Google Tensor chip. Second image showing the Google Pixel 6 phone
Language

Competition Heats Up in Mobile AI: Google Designed Its Own Tensor AI Chip for Smartphones

Google designed its own AI chip for its new smartphone — a snub to Qualcomm, the dominant chip vendor in Android phones. What’s new: Google debuted the Tensor chip last week
Illustration showing a witch cooking a copy of the Mona Lisa wearing a witch hat)
Language

Artistry Is Obsolete: Is AI Making Human Artists Obsolete?

Is human creativity being replaced by the synthetic equivalent? The fear: AI is cranking out increasingly sophisticated visual, musical, and literary works. AI-generated media will flood the market, squeezing out human artists and depriving the world of their creativity.
Halloween family portrait showing the inheritance of some spooky characteristics
Language

New Models Inherit Old Flaws: AI Models May Inherit Flaws From Previous Systems

Is AI becoming inbred? The fear: The best models increasingly are fine-tuned versions of a small number of so-called foundation models that were pretrained on immense quantities of data scraped from the web.
Illustration of Thumbzilla destroying a city and shooting lightning from its mouth (T-Rex with Facebook thumbs up)
Language

Don’t Be Evil: What if AI Enables Corporations to Become Truly Evil?

Tech companies generally try to be (or to appear to be) socially responsible. Would some rather let AI’s negative impacts slide?

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox