ConVIRT

3 Posts

Multimodal AI Takes Off: Multimodal Models, such as CLIP and Dall-E, Are Taking Over AI
ConVIRT

Multimodal AI Takes Off: Multimodal Models, such as CLIP and Dall-E, Are Taking Over AI

While models like GPT-3 and EfficientNet, which work on text and images respectively, are responsible for some of deep learning’s highest-profile successes, approaches that find relationships between text and images made impressive
1 min read
AI-generated images with the model DALL-E
ConVIRT

Tell Me a Picture

Two new models show a surprisingly sharp sense of the relationship between words and images. OpenAI, the for-profit research lab, announced a pair of models that have produced impressive results in multimodal learning: DALL·E.
2 min read
Examples of contrastive learning
ConVIRT

Learning From Words and Pictures

It’s expensive to pay doctors to label medical images, and the relative scarcity of high-quality training examples can make it hard for neural networks to learn features that make for accurate diagnoses.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox