Data related to SElf-supERvised (SEER), an image classifier pretrained on uncurated, unlabeled images
Classification

Pretraining on Uncurated Data

It’s well established that pretraining a model on a large dataset improves performance on fine-tuned tasks. In sufficient quantity and paired with a big model, even data scraped from the internet at random can contribute to the performance boost.
Data related to a diagnostic advice received from a machine learning model vs a human expert
Classification

Would Your Doctor Take AI’s Advice?

Some doctors don’t trust a second opinion when it comes from an AI system. A team at MIT and Regensburg University investigated how physicians responded to diagnostic advice they received from a machine learning model versus a human expert.
Sequence showing a training step that uses different perspectives of the same patient to enhance unsupervised pretraining
Classification

Same Patient, Different Views

When you lack labeled training data, pretraining a model on unlabeled data can compensate. New research pretrained a model three times to boost performance on a medical imaging task.
Sequence related to image processing
Classification

Vision Models Get Some Attention

Self-attention is a key element in state-of-the-art language models, but it struggles to process images because its memory requirement rises rapidly with the size of the input. New research addresses the issue with a simple twist on a convolutional neural network.
Tag-Retrieve-Compose-Synthesize (TReCS)
Classification

Pictures From Words and Gestures

A new system combines verbal descriptions and crude lines to visualize complex scenes. Google researchers led by Jing Yu Koh proposed Tag-Retrieve-Compose-Synthesize (TReCS), a system that generates photorealistic images by describing what they want to see while mousing around on a blank screen.
Neighborhood being monitored by AI-powered cameras
Classification

Partners in Surveillance

Police are increasingly able to track motor vehicles throughout the U.S. using a network of AI-powered cameras — many owned by civilians. Flock, which sells automatic license plate readers is encouraging enforcers to use its network to monitor cars and trucks outside their jurisdiction.
Graphs and data related to ReLabel, a technique that labels any random crop of any image.
Classification

Good Labels for Cropped Images

In training an image recognition model, it’s not uncommon to augment the data by cropping original images randomly. But if an image contains several objects, a cropped version may no longer match its label. Researchers developed a way to make sure random crops are labeled properly.
Facebook like and dislike buttons
Classification

Social Engagement vs. Social Good

Facebook’s management obstructed the architect of its recommendation algorithms from mitigating their negative social impact. The social network focused on reining in algorithmic bias against particular groups of users at the expense of efforts to reduce disinformation.
Person in wheelchair, person in side profile, person wearing a hoodie
Classification

Human Disabilities Baffle Algorithms

Facebook’s content moderation algorithms block many advertisements aimed at disabled people. The social media platform’s automated systems regularly reject ads for clothing designed for people with physical disabilities.
Model predicting ingredients in a recipe and woman cooking
Classification

Cake + Cookie = Cakie

AI may help revolutionize the human diet – or dessert, at least.What’s new: Google applied AI engineer Dale Markowitz and developer advocate Sara Robinson trained a model to predict whether a recipe is a
System Oscar+ working
Classification

Sharper Eyes For Vision+Language

Models that interpret the interplay of words and images tend to be trained on richer bodies of text than images. Recent research worked toward giving such models a more balanced knowledge of the two domains.
Series of images showing a variety of medical AI products
Classification

Medical AI’s Hidden Data

U.S. government approval of medical AI products is on the upswing — but information about how such systems were built is largely unavailable. The U.S. Food and Drug Administration (FDA) has approved a a plethora of AI-driven medical systems.
Gun detecting system working and alerting the police
Classification

Draw a Gun, Trigger an Algorithm

Computer vision is alerting authorities the moment someone draws a gun. Several companies offer deep learning systems that enable surveillance cameras to spot firearms and quickly notify security guards or police.
AI-generated images with the model DALL-E
Classification

Tell Me a Picture

Two new models show a surprisingly sharp sense of the relationship between words and images. OpenAI, the for-profit research lab, announced a pair of models that have produced impressive results in multimodal learning: DALL·E.
Rebag app working on a cellphone
Classification

How Much For That Vintage Gucci?

Computer vision is helping people resell their used designer handbags. Rebag, a resale company for luxury handbags, watches, and jewelry, launched Clair AI, an app that automatically appraises second-hand bags from brands like Gucci, Hermes, and Prada.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox