Right-Sizing Confidence
Vision

Right-Sizing Confidence

An object detector trained exclusively on urban images might mistake a moose for a pedestrian and express high confidence in its poor judgment. New work enables object detectors, and potentially other neural networks, to lower their confidence
2 min read
Neural Networks: Find the Function
Vision

Neural Networks: Find the Function

Let’s get this out of the way: A brain is not a cluster of graphics processing units, and if it were, it would run software far more complex than the typical artificial neural network. Yet neural networks were inspired by the brain’s architecture:
3 min read
One Model, Hundreds of Tasks
Vision

One Model, Hundreds of Tasks

Researchers took a step toward achieving a longstanding goal: One model that performs a whole lot of very different tasks.What's new: Scott Reed, Konrad Żołna, Emilio Parisotto and a team at
3 min read
Upgrade for Vision Transformers
Vision

Upgrade for Vision Transformers

Vision Transformer and models like it use a lot of computation and memory when processing images. New work modifies these architectures to run more efficiently while adopting helpful properties from convolutions.
2 min read
Robot nurse helper
Vision

Nurse’s Mechanical Helper

Hospitals are using robots to lighten the load on clinical staff.What’s new: A number of U.S. hospitals are using Moxi, a robot from Diligent Robotics, to ferry supplies, lab specimens, soiled laundry, and other items, Wired
2 min read
Less Data for Vision Transformers
Vision

Less Data for Vision Transformers

Vision Transformer (ViT) outperformed convolutional neural networks in image classification, but it required more training data. New work enabled ViT and its variants to outperform other architectures with less training data.
2 min read
The View Through the Windshield
Vision

The View Through the Windshield

Overhead cameras equipped with computer vision are spotting distracted drivers on the road.What’s new: A system from Melbourne-based Acusensus alerts police when drivers are engaged in risky activities such as using a cell phone
2 min read
Who Needs Training?
Vision

Who Needs Training?

When you’re training a neural network, it takes a lot of computation to optimize its weights using an iterative algorithm like stochastic gradient descent. Wouldn’t it be great to compute the best parameter values in one pass? A new method takes a
3 min read
Remote Meter Reader
Vision

Remote Meter Reader

Industrial gauges are often located on rooftops, underground, or in tight spaces — but they’re not out of reach of computer vision.What’s new: The Okinawa startup LiLz Gauge provides a system that reads
1 min read
Graph
Vision

The Limits of Pretraining

The higher the accuracy of a pretrained model, the better its performance after fine-tuning, right? Not necessarily.What’s new: Samira Abnar and colleagues at Google Research conducted
2 min read
Fake face diagram
Vision

Fake Faces Are Good Training Data

Collecting and annotating a dataset of facial portraits is a big job. New research shows that synthetic data can work just as well.What's new: A team led by Erroll Wood and Tadas Baltrušaitis at Microsoft used
2 min read
Transformer Architecture
Vision

Transformers See in 3D

Visual robots typically perceive the three-dimensional world through sequences of two-dimensional images, but they don’t always know what they’re looking at. For instance, Tesla’s self-driving system has been known to mistake a full moon for a
3 min read
Woman walking on a store scanning codes
Vision

Let the Model Choose Your Outfit

Amazon’s first brick-and-mortar clothing store is getting ready to deliver automated outfit recommendations.What’s new: The ecommerce giant announced plans to open a flagship Amazon Style location at a Los Angeles-area mall this year.
1 min read
Multimodal deep learning model
Vision

AI Versus the Garbage Heap

Amazon reported long-term success using machine learning to shrink its environmental footprint. The online retailer developed a system that fuses product descriptions, images, and structured data to decide how an item should be packed for shipping.
2 min read
Man with gun walking by detector
Vision

Stopping Guns at the Gate

A Major League Baseball stadium will be using computer vision to detect weapons as fans enter.What’s new: A system called Hexwave will look for firearms, knives, and explosives carried by baseball fans who visit Camden Yards, home field of the Baltimore Orioles.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox