A chorus of reindeers singing carols from a Reindeer Holiday Songbook on a snowy night
Generative Modeling

Voices for the Voiceless: Generative AI Models Are Creating Voices for Hollywood and Video Games

Musicians and filmmakers adopted AI as a standard part of the audio-production toolbox. What happened: Professional media makers embraced neural networks that generate new sounds and modify old ones. Voice actors bristled.
2 min read
An illustration shows a cozy cabin where all the furniture is made out of coffee mugs.
Generative Modeling

Transformers Take Over: Transformers Applied to Vision, Language, Video, and More

In 2021, transformers were harnessed to discover drugs, recognize speech, and paint pictures — and much more.
2 min read
Multimodal AI Takes Off: Multimodal Models, such as CLIP and Dall-E, Are Taking Over AI
Generative Modeling

Multimodal AI Takes Off: Multimodal Models, such as CLIP and Dall-E, Are Taking Over AI

While models like GPT-3 and EfficientNet, which work on text and images respectively, are responsible for some of deep learning’s highest-profile successes, approaches that find relationships between text and images made impressive
1 min read
Illustration showing a witch cooking a copy of the Mona Lisa wearing a witch hat)
Generative Modeling

Artistry Is Obsolete: Is AI Making Human Artists Obsolete?

Is human creativity being replaced by the synthetic equivalent? The fear: AI is cranking out increasingly sophisticated visual, musical, and literary works. AI-generated media will flood the market, squeezing out human artists and depriving the world of their creativity.
2 min read
GIF showing an orchestra playing music
Generative Modeling

Roll Over, Beethoven: AI Completes Beethoven's 10th Symphony

Ludwig van Beethoven died before he completed what would have been his tenth and final symphony. A team of computer scientists and music scholars approximated the music that might have been.
2 min read
Animation showing image-to-image style transfer — mapping process
Generative Modeling

AI With a Sense of Style: Style Transfer Method Produces Consistent Output in Successive Frames

The process known as image-to-image style transfer — mapping, say, the character of a painting’s brushstrokes onto a photo — can render inconsistent results. When they apply the styles of different artists to the same target
3 min read
Sequence of famous arcade games' scenes
Generative Modeling

Solve RL With This One Weird Trick

The previous state-of-the-art model for playing vintage Atari games took advantage of a number of advances in reinforcement learning (RL). The new champion is a basic RL architecture plus a trick borrowed from image generation.
2 min read
Graphs showing information about AI system as the inventor of a food container with unique properties
Generative Modeling

Invented By AI

An algorithm received a patent for its invention. What’s new: South Africa’s intellectual property office issued a patent that names an AI system as the inventor of a food container.
1 min read
Detection of a digitally altered image of a frog holding a violin
Generative Modeling

Fighting Fakes

A supergroup of machine learning models flags manipulated photos. Jigsaw, a tech incubator owned by Alphabet, released a system that detects digitally altered images.
1 min read
Frozen Pretrained Transformer (FPT) explained
Generative Modeling

Transformers: Smarter Than You Think

The transformer architecture has shown an uncanny ability to model not only language but also images and proteins. New research found that it can apply what it learns from the first domain to the others.
2 min read
Series of images showing how single trained network generates 3D reconstructions of multiple scenes
Generative Modeling

One Network, Many Scenes

To reconstruct the 3D world behind a set of 2D images, machine learning systems usually require a dedicated neural network for each scene. New research enables a single trained network to generate 3D reconstructions of multiple scenes.
2 min read
Series of AI generated imagery
Generative Modeling

CLIP Art

Creative engineers are combining deep learning systems to produce a groundswell of generated imagery. Researchers, hackers, and artists are producing new works by pairing CLIP, a pretrained image classifier, with a generative adversarial network (GAN).
1 min read
AI generated videos and VideoGPT training pipeline
Generative Modeling

Synthetic Videos on the Double

Using a neural network to generate realistic videos takes a lot of computation. New work performs the task efficiently enough to run on a beefy personal computer.
2 min read
Neural networks generating novel views of a 3D scene based on existing pictures
Generative Modeling

3D Scene Synthesis for the Real World

Researchers have used neural networks to generate novel views of a 3D scene based on existing pictures plus the positions and angles of the cameras that took them. In practice, though, you may not know the precise camera
2 min read
FastNeRF accelerates the photorealistic 3D rendering method
Generative Modeling

Virtual Reality in Real Time

Ideally, real-time 3D applications such as virtual and augmented reality transition smoothly between different viewpoints of a scene — but generating a fresh perspective can take time. New research speeds the process.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox