Smart Phone app called Dieta
Vision

Deep Doo-Doo: AI App Diagnoses Poop Better Than People

People who suffer from gastrointestinal conditions such as irritable bowel syndrome are number two when it comes to describing the characteristics of their own poop.
Metaverse illustration with Meta AI product names
Vision

Meta Decentralizes AI Effort: Meta Restructures its AI Research Teams

The future of Big AI may lie with product-development teams. Meta reorganized its AI division. Henceforth, AI teams will report to departments that develop key products.
The framework of Virtual Outlier Synthesis (VOS)
Vision

Right-Sizing Confidence: Object Detector Lowers Confidence for Unfamiliar Inputs

An object detector trained exclusively on urban images might mistake a moose for a pedestrian and express high confidence in its poor judgment. New work enables object detectors, and potentially other neural networks, to lower their confidence when they encounter unfamiliar inputs.
Illustration of a robot with a captain costume
Vision

Neural Networks: Find the Function — A Basic Introduction to Neural Networks

Let’s get this out of the way: A brain is not a cluster of graphics processing units, and if it were, it would run software far more complex than the typical artificial neural network. Yet neural networks were inspired by the brain’s architecture.
Gato’s performance on simulated control tasks | Image captions generated by Gato
Vision

One Model, Hundreds of Tasks: Multimodal Transformer Performs Over 600 Different Tasks

Researchers took a step toward achieving a longstanding goal: One model that performs a whole lot of very different tasks. Scott Reed, Konrad Żołna, Emilio Parisotto and a team at DeepMind announced Gato.
Architecture of CXV
Vision

Upgrade for Vision Transformers: Improved Efficiency for Vision Transformers

Vision Transformer and models like it use a lot of computation and memory when processing images. New work modifies these architectures to run more efficiently while adopting helpful properties from convolutions.
Robot nurse helper
Vision

Nurse’s Mechanical Helper: Inside the Moxi AI-Powered Hospital Robot

Hospitals are using robots to lighten the load on clinical staff. A number of U.S. hospitals are using Moxi, a robot from Diligent Robotics, to ferry supplies, lab specimens, soiled laundry, and other items.
Shifted Patch Tokenization (SPT) | Locality Self-Attention (LSA)
Vision

Less Data for Vision Transformers: Boosting Vision Transformer Performance with Less Data

Vision Transformer (ViT) outperformed convolutional neural networks in image classification, but it required more training data. New work enabled ViT and its variants to outperform other architectures with less training data.
Series of pictures of drivers
Vision

The View Through the Windshield: New Zealand Uses Computer Vision to Spot Distracted Drivers

Overhead cameras equipped with computer vision are spotting distracted drivers on the road. A system from Melbourne-based Acusensus alerts police when drivers are engaged in risky activities such as using a cell phone, not wearing a seatbelt, or speeding.
Overview of Graph Hyper Network (GHN-2)
Vision

Who Needs Training? Graph neural network selects optimal weights for image tasks.

When you’re training a neural network, it takes a lot of computation to optimize its weights using an iterative algorithm like stochastic gradient descent. Wouldn’t it be great to compute the best parameter values in one pass? A new method takes a substantial step in that direction.
Industrial gauges being placed
Vision

Remote Meter Reader: Computer vision tool reads analog gauges at industrial sites.

Industrial gauges are often located on rooftops, underground, or in tight spaces — but they’re not out of reach of computer vision. The Okinawa startup LiLz Gauge provides a system that reads analog gauges and reports their output to a remote dashboard.
The performance of different downstream (DS)
Vision

The Limits of Pretraining: More pretraining doesn't guarantee a better fine-tuned AI.

The higher the accuracy of a pretrained model, the better its performance after fine-tuning, right? Not necessarily. Researchers conducted a meta-analysis of image-recognition experiments and performed some of their own.
Fake face diagram - FaceSynthetics
Vision

Fake Faces Are Good Training Data: Synthetic data improves face recognition performance.

Collecting and annotating a dataset of facial portraits is a big job. New research shows that synthetic data can work just as well.
Transformer Architecture
Vision

Transformers See in 3D: Using transformers to visualize depth in 2D images.

Visual robots typically perceive the three-dimensional world through sequences of two-dimensional images, but they don’t always know what they’re looking at. For instance, Tesla’s self-driving system has been known to mistake a full moon for a traffic light.
Woman walking on a store scanning codes
Vision

Let the Model Choose Your Outfit: Inside Amazon's AI-powered clothes stores.

Amazon’s first brick-and-mortar clothing store is getting ready to deliver automated outfit recommendations. The ecommerce giant announced plans to open a flagship Amazon Style location at a Los Angeles-area mall this year.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox