2 Posts

Inference time procedure for GILL

Text or Images, Input or Output: GILL, an innovative approach to multimodal model training

GPT-4V introduced a large multimodal model that generates text from images and, with help from DALL-E 3, generates images from text. However, OpenAI hasn’t fully explained how it built the system. A separate group of researchers described their own method.
Examples of different uses for GPT-4V

GPT-4 Opens Its Eyes: Early insights into what OpenAI's GPT-4 with Vision can do

Few people have had a chance to try out OpenAI’s GPT-4 with Vision (GPT-4V), but many of those who have played with it expressed excitement. 

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox