Sequence related to image processing
Language

Vision Models Get Some Attention

Self-attention is a key element in state-of-the-art language models, but it struggles to process images because its memory requirement rises rapidly with the size of the input. New research addresses the issue with a simple twist on a convolutional neural network.
Star Trek actor William Shatner
Language

Star Trek: The Videobot Generation

A digital doppelgänger of Star Trek’s original star will let fans chat with him — possibly well beyond his lifetime. AI startup StoryFile built a lifelike videobot of actor William Shatner, best known for playing Captain James T. Kirk on Star Trek.
Tag-Retrieve-Compose-Synthesize (TReCS)
Language

Pictures From Words and Gestures

A new system combines verbal descriptions and crude lines to visualize complex scenes. Google researchers led by Jing Yu Koh proposed Tag-Retrieve-Compose-Synthesize (TReCS), a system that generates photorealistic images by describing what they want to see while mousing around on a blank screen.
Commercial about The Trevor Lifeline
Language

Chatbots Against Depression

A language model is helping crisis-intervention volunteers practice their suicide-prevention skills. The Trevor Project, a nonprofit organization that operates a 24-hour hotline for LGBTQ youth, uses a “crisis contact simulator” to train its staff in how to talk with troubled teenagers.
Margaret Mitchell, Marian Croak and Timnit Gebru pictured
Language

Google Overhauls Ethical AI Team

Having dismissed two key researchers, Google restructured its efforts in AI ethics. Marian Croak, an accomplished software engineer and vice president of engineering at Google, will lead a new center of expertise in responsible AI, the company announced.
Graph showing information about different transformer models
Language

Transformer Variants Head to Head

The transformer architecture has inspired a plethora of variations. Yet researchers have used a patchwork of metrics to evaluate their performance, making them hard to compare. New work aims to level the playing field.
Model predicting ingredients in a recipe and woman cooking
Language

Cake + Cookie = Cakie

AI may help revolutionize the human diet – or dessert, at least.What’s new: Google applied AI engineer Dale Markowitz and developer advocate Sara Robinson trained a model to predict whether a recipe is a
System Oscar+ working
Language

Sharper Eyes For Vision+Language

Models that interpret the interplay of words and images tend to be trained on richer bodies of text than images. Recent research worked toward giving such models a more balanced knowledge of the two domains.
Different graphs showing switch transformer data
Language

Bigger, Faster Transformers: Google's Switch Transformer uses MoE for Efficient NLP

Performance in language tasks rises with the size of the model — yet, as a model’s parameter count rises, so does the time it takes to render output. New work pumps up the number of parameters without slowing down the network.
Art pieces with subjective commentary regarding their emotional impact
Language

How Art Makes AI Feel

An automated art critic spells out the emotional impact of images. Led by Panos Achlioptas, researchers at Ecole Polytechnique, King Abdullah University, and Stanford University trained a deep learning system to generate subjective interpretations of art.
Series of images showing improvements in a multilingual language translator
Language

Better Zero-Shot Translations

Train a multilingual language translator to translate between Spanish and English and between English and German, and it may be able to translate directly between Spanish and German as well. New work proposes a simple path to better machine translation between languages.
Graphs and data related to visualized tokens (or vokens)
Language

Better Language Through Vision

For children, associating a word with a picture that illustrates it helps them learn the word’s meaning. Research aims to do something similar for machine learning models. Researchers improved a BERT model’s performance on some language tasks by training it on a large dataset of image-word pairs.
GPT-Neo related animation
Language

Language Models Want to Be Free: EleutherAI is Making an Open Source Version of GPT-3

A grassroots research collective aims to make a GPT-3 clone that’s available to everyone. EleutherAI, a loose-knit group of independent researchers, is developing GPT-Neo, an open source, free-to-use version of OpenAI’s gargantuan language model.
Data related to a language model that predicts mutations that would enable infectious viruses
Language

The Language of Viruses

A neural network learned to read the genes of viruses as though they were text. That could enable researchers to page ahead for potentially dangerous mutations. Researchers at MIT trained a language model to predict mutations that would enable infectious viruses to become even more virulent.
Data and graphs related to a new model capable of detecting tremors
Language

Quake Watch

Detecting earthquakes is an important step toward warning surrounding communities that damaging seismic waves may be headed their way. A new model detects tremors and provides clues to their epicenter.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox