ResNeXt

2 Posts

Graphs and data related to visualized tokens (or vokens)
ResNeXt

Better Language Through Vision

For children, associating a word with a picture that illustrates it helps them learn the word’s meaning. Research aims to do something similar for machine learning models. Researchers improved a BERT model’s performance on some language tasks by training it on a large dataset of image-word pairs.
2 min read
Facebook service describing a photo on Instagram
ResNeXt

Every Picture Tells a Story

Facebook expanded a system of vision, language, and speech models designed to open the social network to users who are visually impaired. A Facebook service that describes photos in a synthesized voice now recognizes 1,200 visual concepts — 10 times more than the previous version.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox