Large Multimodal Models

4 Posts

2 Million Tokens of Context & More: Google’s I/O developers’ conference reveals new AI models, features, and upgrades.
Large Multimodal Models

2 Million Tokens of Context & More: Google’s I/O developers’ conference reveals new AI models, features, and upgrades.

Google’s annual I/O developers’ conference brought a plethora of updates and new models. 
Faster, Cheaper Multimodality: All about GPT-4o, OpenAI’s latest multimodal model
Large Multimodal Models

Faster, Cheaper Multimodality: All about GPT-4o, OpenAI’s latest multimodal model

OpenAI’s latest model raises the bar for models that can work with common media types in any combination.
Anthropic Ups the Ante: Anthropic introduces Claude 3, a new trio of multimodal models.
Large Multimodal Models

Anthropic Ups the Ante: Anthropic introduces Claude 3, a new trio of multimodal models.

Anthropic announced a suite of large multimodal models that set new states of the art in key benchmarks.
Inference time procedure for GILL
Large Multimodal Models

Text or Images, Input or Output: GILL, an innovative approach to multimodal model training

GPT-4V introduced a large multimodal model that generates text from images and, with help from DALL-E 3, generates images from text. However, OpenAI hasn’t fully explained how it built the system. A separate group of researchers described their own method.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox