Tag-Retrieve-Compose-Synthesize (TReCS)
Classification

Pictures From Words and Gestures

A new system combines verbal descriptions and crude lines to visualize complex scenes. Google researchers led by Jing Yu Koh proposed Tag-Retrieve-Compose-Synthesize (TReCS), a system that generates photorealistic images by describing what they want to see while mousing around on a blank screen.
2 min read
Sequence related to image processing
Classification

Vision Models Get Some Attention

Self-attention is a key element in state-of-the-art language models, but it struggles to process images because its memory requirement rises rapidly with the size of the input. New research addresses the issue with a simple twist on a convolutional neural network.
2 min read
Neighborhood being monitored by AI-powered cameras
Classification

Partners in Surveillance

Police are increasingly able to track motor vehicles throughout the U.S. using a network of AI-powered cameras — many owned by civilians. Flock, which sells automatic license plate readers is encouraging enforcers to use its network to monitor cars and trucks outside their jurisdiction.
2 min read
Facebook like and dislike buttons
Classification

Social Engagement vs. Social Good

Facebook’s management obstructed the architect of its recommendation algorithms from mitigating their negative social impact. The social network focused on reining in algorithmic bias against particular groups of users at the expense of efforts to reduce disinformation.
2 min read
Graphs and data related to ReLabel, a technique that labels any random crop of any image.
Classification

Good Labels for Cropped Images

In training an image recognition model, it’s not uncommon to augment the data by cropping original images randomly. But if an image contains several objects, a cropped version may no longer match its label. Researchers developed a way to make sure random crops are labeled properly.
2 min read
Model predicting ingredients in a recipe and woman cooking
Classification

Cake + Cookie = Cakie

AI may help revolutionize the human diet – or dessert, at least.What’s new: Google applied AI engineer Dale Markowitz and developer advocate Sara Robinson trained a model to predict whether a recipe is a
1 min read
System Oscar+ working
Classification

Sharper Eyes For Vision+Language

Models that interpret the interplay of words and images tend to be trained on richer bodies of text than images. Recent research worked toward giving such models a more balanced knowledge of the two domains.
2 min read
Person in wheelchair, person in side profile, person wearing a hoodie
Classification

Human Disabilities Baffle Algorithms

Facebook’s content moderation algorithms block many advertisements aimed at disabled people. The social media platform’s automated systems regularly reject ads for clothing designed for people with physical disabilities.
2 min read
Series of images showing a variety of medical AI products
Classification

Medical AI’s Hidden Data

U.S. government approval of medical AI products is on the upswing — but information about how such systems were built is largely unavailable. The U.S. Food and Drug Administration (FDA) has approved a a plethora of AI-driven medical systems.
2 min read
Gun detecting system working and alerting the police
Classification

Draw a Gun, Trigger an Algorithm

Computer vision is alerting authorities the moment someone draws a gun. Several companies offer deep learning systems that enable surveillance cameras to spot firearms and quickly notify security guards or police.
1 min read
AI-generated images with the model DALL-E
Classification

Tell Me a Picture

Two new models show a surprisingly sharp sense of the relationship between words and images. OpenAI, the for-profit research lab, announced a pair of models that have produced impressive results in multimodal learning: DALL·E.
2 min read
Rebag app working on a cellphone
Classification

How Much For That Vintage Gucci?

Computer vision is helping people resell their used designer handbags. Rebag, a resale company for luxury handbags, watches, and jewelry, launched Clair AI, an app that automatically appraises second-hand bags from brands like Gucci, Hermes, and Prada.
1 min read
Covid Fast Fax operating
Classification

The Fax About Tracking Covid

A pair of neural networks is helping to prioritize Covid-19 cases for contact tracing. The public health department of California’s Contra Costa County is using deep learning to sort Covid-19 cases reported via the pre-internet technology known as fax.
2 min read
Series of images describing how an AI-powered collar for elephants operates
Classification

Algorithms For Elephants

An AI-powered collar may help protect wild elephants from poachers, hunters, and other hostile humans. Ten ElephantEdge wireless tracking collars will be fitted onto African elephants next year, TechCrunch reported.
1 min read
Animation alternating sad and happy emojis
Classification

Online Clues to Mental Illness

Can social media posts reveal early signs of mental illness? A new machine learning model shows promising results. Researchers developed a model that analyzes messages and images posted by Facebook users for indicators of psychological problems.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox