Smart Phone app called Dieta

Deep Doo-Doo: AI App Diagnoses Poop Better Than People

People who suffer from gastrointestinal conditions such as irritable bowel syndrome are number two when it comes to describing the characteristics of their own poop.
Metaverse illustration with Meta AI product names

Meta Decentralizes AI Effort: Meta Restructures its AI Research Teams

The future of Big AI may lie with product-development teams. Meta reorganized its AI division. Henceforth, AI teams will report to departments that develop key products.
The framework of Virtual Outlier Synthesis (VOS)

Right-Sizing Confidence: Object Detector Lowers Confidence for Unfamiliar Inputs

An object detector trained exclusively on urban images might mistake a moose for a pedestrian and express high confidence in its poor judgment. New work enables object detectors, and potentially other neural networks, to lower their confidence when they encounter unfamiliar inputs.
Illustration of a robot with a captain costume

Neural Networks: Find the Function — A Basic Introduction to Neural Networks

Let’s get this out of the way: A brain is not a cluster of graphics processing units, and if it were, it would run software far more complex than the typical artificial neural network. Yet neural networks were inspired by the brain’s architecture.
Gato’s performance on simulated control tasks | Image captions generated by Gato

One Model, Hundreds of Tasks: Multimodal Transformer Performs Over 600 Different Tasks

Researchers took a step toward achieving a longstanding goal: One model that performs a whole lot of very different tasks. Scott Reed, Konrad Żołna, Emilio Parisotto and a team at DeepMind announced Gato.
Architecture of CXV

Upgrade for Vision Transformers: Improved Efficiency for Vision Transformers

Vision Transformer and models like it use a lot of computation and memory when processing images. New work modifies these architectures to run more efficiently while adopting helpful properties from convolutions.
Robot nurse helper

Nurse’s Mechanical Helper: Inside the Moxi AI-Powered Hospital Robot

Hospitals are using robots to lighten the load on clinical staff. A number of U.S. hospitals are using Moxi, a robot from Diligent Robotics, to ferry supplies, lab specimens, soiled laundry, and other items.
Shifted Patch Tokenization (SPT) | Locality Self-Attention (LSA)

Less Data for Vision Transformers: Boosting Vision Transformer Performance with Less Data

Vision Transformer (ViT) outperformed convolutional neural networks in image classification, but it required more training data. New work enabled ViT and its variants to outperform other architectures with less training data.
Series of pictures of drivers

The View Through the Windshield: New Zealand Uses Computer Vision to Spot Distracted Drivers

Overhead cameras equipped with computer vision are spotting distracted drivers on the road. A system from Melbourne-based Acusensus alerts police when drivers are engaged in risky activities such as using a cell phone, not wearing a seatbelt, or speeding.
Overview of Graph Hyper Network (GHN-2)

Who Needs Training? Graph neural network selects optimal weights for image tasks.

When you’re training a neural network, it takes a lot of computation to optimize its weights using an iterative algorithm like stochastic gradient descent. Wouldn’t it be great to compute the best parameter values in one pass? A new method takes a substantial step in that direction.
Industrial gauges being placed

Remote Meter Reader: Computer vision tool reads analog gauges at industrial sites.

Industrial gauges are often located on rooftops, underground, or in tight spaces — but they’re not out of reach of computer vision. The Okinawa startup LiLz Gauge provides a system that reads analog gauges and reports their output to a remote dashboard.
The performance of different downstream (DS)

The Limits of Pretraining: More pretraining doesn't guarantee a better fine-tuned AI.

The higher the accuracy of a pretrained model, the better its performance after fine-tuning, right? Not necessarily. Researchers conducted a meta-analysis of image-recognition experiments and performed some of their own.
Fake face diagram - FaceSynthetics

Fake Faces Are Good Training Data: Synthetic data improves face recognition performance.

Collecting and annotating a dataset of facial portraits is a big job. New research shows that synthetic data can work just as well.
Transformer Architecture

Transformers See in 3D: Using transformers to visualize depth in 2D images.

Visual robots typically perceive the three-dimensional world through sequences of two-dimensional images, but they don’t always know what they’re looking at. For instance, Tesla’s self-driving system has been known to mistake a full moon for a traffic light.
Woman walking on a store scanning codes

Let the Model Choose Your Outfit: Inside Amazon's AI-powered clothes stores.

Amazon’s first brick-and-mortar clothing store is getting ready to deliver automated outfit recommendations. The ecommerce giant announced plans to open a flagship Amazon Style location at a Los Angeles-area mall this year.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox