Model predicting ingredients in a recipe and woman cooking
Language

Cake + Cookie = Cakie

AI may help revolutionize the human diet – or dessert, at least.What’s new: Google applied AI engineer Dale Markowitz and developer advocate Sara Robinson trained a model to predict whether a recipe is a
1 min read
System Oscar+ working
Language

Sharper Eyes For Vision+Language

Models that interpret the interplay of words and images tend to be trained on richer bodies of text than images. Recent research worked toward giving such models a more balanced knowledge of the two domains.
2 min read
Different graphs showing switch transformer data
Language

Bigger, Faster Transformers

Performance in language tasks rises with the size of the model — yet, as a model’s parameter count rises, so does the time it takes to render output. New work pumps up the number of parameters without slowing down the network.
2 min read
Art pieces with subjective commentary regarding their emotional impact
Language

How Art Makes AI Feel

An automated art critic spells out the emotional impact of images. Led by Panos Achlioptas, researchers at Ecole Polytechnique, King Abdullah University, and Stanford University trained a deep learning system to generate subjective interpretations of art.
2 min read
Series of images showing improvements in a multilingual language translator
Language

Better Zero-Shot Translations

Train a multilingual language translator to translate between Spanish and English and between English and German, and it may be able to translate directly between Spanish and German as well. New work proposes a simple path to better machine translation between languages.
2 min read
Graphs and data related to visualized tokens (or vokens)
Language

Better Language Through Vision

For children, associating a word with a picture that illustrates it helps them learn the word’s meaning. Research aims to do something similar for machine learning models. Researchers improved a BERT model’s performance on some language tasks by training it on a large dataset of image-word pairs.
2 min read
GPT-Neo related animation
Language

Language Models Want to Be Free

A grassroots research collective aims to make a GPT-3 clone that’s available to everyone. EleutherAI, a loose-knit group of independent researchers, is developing GPT-Neo, an open source, free-to-use version of OpenAI’s gargantuan language model.
1 min read
Data related to a language model that predicts mutations that would enable infectious viruses
Language

The Language of Viruses

A neural network learned to read the genes of viruses as though they were text. That could enable researchers to page ahead for potentially dangerous mutations. Researchers at MIT trained a language model to predict mutations that would enable infectious viruses to become even more virulent.
2 min read
Data and graphs related to a new model capable of detecting tremors
Language

Quake Watch

Detecting earthquakes is an important step toward warning surrounding communities that damaging seismic waves may be headed their way. A new model detects tremors and provides clues to their epicenter.
2 min read
Facebook service describing a photo on Instagram
Language

Every Picture Tells a Story

Facebook expanded a system of vision, language, and speech models designed to open the social network to users who are visually impaired. A Facebook service that describes photos in a synthesized voice now recognizes 1,200 visual concepts — 10 times more than the previous version.
2 min read
Data related to adversarial learning
Language

Adversarial Helper

Models that learn relationships between images and words are gaining a higher profile. New research shows that adversarial learning, usually a way to make models robust to deliberately misleading inputs, can boost vision-and-language performance.
2 min read
AI-generated images with the model DALL-E
Language

Tell Me a Picture

Two new models show a surprisingly sharp sense of the relationship between words and images. OpenAI, the for-profit research lab, announced a pair of models that have produced impressive results in multimodal learning: DALL·E.
2 min read
Animation alternating sad and happy emojis
Language

Online Clues to Mental Illness

Can social media posts reveal early signs of mental illness? A new machine learning model shows promising results. Researchers developed a model that analyzes messages and images posted by Facebook users for indicators of psychological problems.
2 min read
Ilya Sutskever
Language

Ilya Sutskever: Fusion of Language and Vision

The past year was the first in which general-purpose models became economically useful. GPT-3, in particular, demonstrated that large language models have surprising linguistic competence and the ability to perform a wide variety of useful tasks.
2 min read
Harry Shum
Language

Harry Shum: Assisted Artistry

In 2021, I envision that the AI community will create more tools to unleash human creativity. AI will help people across the globe to communicate and express emotions and moods in their own unique ways.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox