A generative adversarial network (GAN)
Generative Modeling

Image Generation Transformed

A recent generative adversarial network (GAN) produced more coherent images using modified transformers that replaced fully connected layers with convolutional layers. A new GAN achieved a similar end using transformers in their original form.
CogView home website
Generative Modeling

Large Language Models for Chinese

Researchers unveiled competition for the reigning large language model GPT-3. Four models collectively called Wu Dao were described by Beijing Academy of Artificial Intelligence, a research collective funded by the Chinese government, according to Synced Review.
System designed to isolate changes in the pose of a two-dimensional figure
Generative Modeling

Motion Mapper

In some animated games, different characters can perform the same actions — say, walking, jumping, or casting spells. A new system learned from unlabeled data to transfer such motions from one character to another.
Examples of image generators using GANsformer
Generative Modeling

Attention for Image Generation

Attention quantifies how each part of one input affects the various parts of another. Researchers added a step that reverses this comparison to produce more convincing images.
Recording unmixed
Generative Modeling

New Life for Old Songs

Neural networks can tease apart the different sounds in musical. Companies and hobbyists are using deep learning to separate voices and instruments in commercial recordings, Wired reported.
Star Trek actor William Shatner
Generative Modeling

Star Trek: The Videobot Generation

A digital doppelgänger of Star Trek’s original star will let fans chat with him — possibly well beyond his lifetime. AI startup StoryFile built a lifelike videobot of actor William Shatner, best known for playing Captain James T. Kirk on Star Trek.
Tag-Retrieve-Compose-Synthesize (TReCS)
Generative Modeling

Pictures From Words and Gestures

A new system combines verbal descriptions and crude lines to visualize complex scenes. Google researchers led by Jing Yu Koh proposed Tag-Retrieve-Compose-Synthesize (TReCS), a system that generates photorealistic images by describing what they want to see while mousing around on a blank screen.
Images generated by a network designed to visualize what goes on in peoples’ brains while they watch Doctor Who
Generative Modeling

What the Brain Sees

What’s creepier than images from the sci-fi TV series Doctor Who? Images generated by a network designed to visualize what goes on in peoples’ brains while they watch Doctor Who.
Commercial about The Trevor Lifeline
Generative Modeling

Chatbots Against Depression

A language model is helping crisis-intervention volunteers practice their suicide-prevention skills. The Trevor Project, a nonprofit organization that operates a 24-hour hotline for LGBTQ youth, uses a “crisis contact simulator” to train its staff in how to talk with troubled teenagers.
Homer Simpson talking to Anakin Skywalker in a clip from Star Wars: The Phantom Menace.
Generative Modeling

Your Words, Their Voices

Voice clones — the audio counterpart to deepfaked images — are poised to invade popular media and entertainment. Professionals and amateurs alike are using AI to emulate the voices of human actors, Wired reported.
Neural Body, a procedure that generates novel views of a single human character, working
Generative Modeling

Seeing People From a New Angle

The University of Hong Kong, and Cornell University to create Neural Body, a procedure that generates novel views of a single human character based on shots from only a few angles.
People in old photos smiling, blinking and turning their heads
Generative Modeling

Make Your Ancestors Smile

Machine learning is bringing old photos to life. A new service from genealogy company MyHeritage lets users animate their ancestors’ portraits, making them smile, blink, and turn their heads.
Art pieces with subjective commentary regarding their emotional impact
Generative Modeling

How Art Makes AI Feel

An automated art critic spells out the emotional impact of images. Led by Panos Achlioptas, researchers at Ecole Polytechnique, King Abdullah University, and Stanford University trained a deep learning system to generate subjective interpretations of art.
AI-generated images with the model DALL-E
Generative Modeling

Tell Me a Picture

Two new models show a surprisingly sharp sense of the relationship between words and images. OpenAI, the for-profit research lab, announced a pair of models that have produced impressive results in multimodal learning: DALL·E.
Harry Shum
Generative Modeling

Harry Shum: Assisted Artistry

In 2021, I envision that the AI community will create more tools to unleash human creativity. AI will help people across the globe to communicate and express emotions and moods in their own unique ways.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox