Multimodal
Synthetic Data Helps Image Generators: OpenAI researchers improved text-to-image prompt following with generated captions.
Text-to-image generators often miss details in text prompts, and sometimes they misunderstand parts of a prompt entirely. Synthetic captions can help them follow prompts more closely.