Plot demonstrating the relative sizes of parallel and monolingual examples
Language

Massively Multilingual Translation: Machine Learning Model Trained to Translate 1,000 Languages

Recent work showed that models for multilingual machine translation can increase the number of languages they translate by scraping the web for pairs of equivalent sentences in different languages. A new study radically expanded the language repertoire through training on untranslated web text.
Different logos from companies like OpenAI, Stability.ai, Jasper and the dollar sign
Language

Generating Investment: Generative AI Startups Raise Hundreds of Millions in Funding

The generative gold rush is on. Venture capitalists are betting hundreds of millions of dollars on startups that use AI to generate images, text, and more, Wired reported.
Illustration of Frankenstein connected to many chemical elements inside of a lab
Language

The Black Box Awakens: Confronting the Fear of Self-Aware AI in 2022

AI researchers are starting to see ghosts in their machines. Are they hallucinations, or does a dawning consciousness haunt the trained weights?
Technical components of No Language Left Behind and how they fit together
Language

Massively Multilingual Translation: NLP Model Translates 200 Different Languages

Sentence pairs that have equivalent meanings in different languages — typically used to train machine translation systems — have been available in sufficient quantities for only around 100 languages. New work doubled that number and produced a more capable model.
AI-generated image of Joe Rogan interviewing Steve Jobs
Language

All Synthetic, All the Time: Joe Rogan Meets Steve Jobs in an AI-Generated Podcast

For the debut episode of a new podcast series, Play.ht synthesized a 19-minute interview between the rock-star podcaster and late Apple CEO.
Example of a video produced from a story-like description
Language

Long-Form Videos from Text Stories: Google's Phenaki Generates Long-Form Video from Text

Only a week ago, researchers unveiled a system that generates a few seconds of video based on a text prompt. New work enables a text-to-video system to produce an entire visual narrative from several sentences of text.
Robot with an arm, camera, and gripper handing over a plastic bottle to a person
Language

Parsing Commands Into Actions: NLP Helps Google Robot Understand Spoken Instructions

A new method enables robots to respond helpfully to verbal commands by pairing a natural language model with a repertoire of existing skills.
Different Nvidia cloud-computing services
Language

Chipmaker Boosts AI as a Service: Nvidia Launches Cloud Service for NLP Models

Nvidia, known for chips designed to process AI systems, is providing access to large language models. Nvidia announced early access to NeMo LLM and BioNeMo, cloud-computing services that enable developers to generate text and biological sequences respectively.
Illustration of a laughing robot
Language

Toward Machines That LOL: Scientists Teach a Speech Recognition Robot to Laugh

Even if we manage to stop robots from taking over the world, they may still have the last laugh. Researchers at Kyoto University developed a series of neural networks that enable a robot engaged in spoken conversation to chortle along with its human interlocutor.
Information related to Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model (SERAC)
Language

Update Any Language Model: New Method to Update Pretrained Language Models

The ability to update language models is essential to incorporate new information and correct undesirable behaviors. Previous methods are unwieldy and often fail as the amount of new data increases. New work offers a workaround.
Captures from PromptBase
Language

Prompting DALL·E for Fun and Profit: A marketplace for phrases that produce art in DALL·E, Midjourney, and Stable Diffusion

An online marketplace enables people to buy text prompts designed to produce consistent output from the new generation of text-to-image generators.
Animated chart shows how AI can avoid mistaking an image's subject for its context.
Language

Taming Spurious Correlations: New Technique Helps AI Avoid Classification Mistakes

When a neural network learns image labels, it may confuse a background item for the labeled object. New research avoids such mistakes.
Clip from the movie Fall showing one of the protagonists falling from a radio antenna.
Language

Deepfakes Against Profanity: Film Makers of Fall Used AI to Remove F-Words

The filmmakers used technology from Flawless AI to clean up the language, enabling the film to earn a rating that welcomes younger viewers.
Bloom logo
Language

Large Language Models Unbound: BLOOM is the Largest Open Source NLP Model to Date

A worldwide collaboration produced the biggest open source language model to date. BLOOM is a family of language models built by the BigScience Research Workshop, a collective of over 1,000 researchers from 250 institutions around the globe.
A flowchart shows how a jury learning method reduces annotator bias in machine learning models.
Language

Choose the Right Annotators: Jury-Learning Helps Remove Bias from NLP Models

A new machine learning method attempts to account for biases that may be held by certain subsets of labelers.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox