AI-generated images with the model DALL-E
Multimodal

Tell Me a Picture: OpenAI's two new multimodal AI models, CLIP and DALL·E

Two new models show a surprisingly sharp sense of the relationship between words and images. OpenAI, the for-profit research lab, announced a pair of models that have produced impressive results in multimodal learning: DALL·E.
Examples of clothes image-text combo search
Multimodal

That Online Boutique, But Smarter: A summary of Amazon's Visiolinguistic Attention Learning

Why search for “a cotton dress shirt with button-down collar, breast pockets, barrel cuffs, scooped hem, and tortoise shell buttons in grey” when a photo and the words “that shirt, but grey” will do the trick? A new network understands the image-text combo.
Animated drawing of hardware related to AI
Multimodal

Horsepower for Next-Gen Networks: Microsoft built OpenAI a custom AI supercomputer.

The for-profit research organization OpenAI has a new supercomputer to help achieve its dream of building the world’s most sophisticated AI. Microsoft engineered the new hardware network to train immense models on thousands of images, texts, and videos simultaneously.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox