Architecture of vision-language tasks
Classification

One Model for Vision-Language

Researchers have proposed task-agnostic architectures for image classification tasks and language tasks. New work proposes a single architecture for vision-language tasks.
2 min read
X-rays and charts about AI use in radiology
Classification

Radiologists Eye AI

AI lately has achieved dazzling success interpreting X-rays and other medical imagery in the lab. Now it’s catching on in the clinic. Roughly one-third of U.S. radiologists use AI in some form in their work.
2 min read
A new metod for compressing images and yielding better classification
Classification

What Machines Want to See

Researchers typically downsize images for vision networks to accommodate limited memory and accelerate processing. A new method not only compresses images but yields better classification.
1 min read
Drones flying off the coast capturing video of orcas and models analyzing the imagery
Classification

Algorithms for Orcas

A combination of computer vision and drones could help restore dwindling killer whale populations. Researchers at Oregon State University and conservation groups SR3 and Vulcan developed a system that assesses the health of orcas.
1 min read
Original vs processed image checking for leaks on a compressor
Classification

Super-Human Quality Control

A computer vision model, continually trained and automatically updated, can boost quality control in factories. Landing AI, a machine learning platform company led by Andrew Ng, helped a maker of compressors for refrigeration check them for leaks.
2 min read
Tractable app determining the cost of a car's damage
Classification

Wreck Recognition

Automobile insurers are increasingly turning to machine learning models to calculate the cost of car repairs. The pandemic has made it difficult for human assessors to visit vehicles damaged in crashes, so the insurance industry is embracing automation.
1 min read
Diagram showing how Project Debater works
Classification

Up for Debate

IBM’s Watson question-answering system stunned the world in 2011 when it bested human champions of the TV trivia game show Jeopardy! Although the Watson brand has fallen on hard times, the company’s language-processing prowess continues to develop.
2 min read
Voice recognition tool "Bleep" working
Classification

Haters Gonna [Mute]

A new tool aims to let video gamers control how much vitriol they receive from fellow players. Intel announced a voice recognition tool called Bleep that the company claims can moderate voice chat automatically, allowing users to silence offensive language.
2 min read
Operators working with factory machinery
Classification

Who Watches the Welders?

A robot inspector is looking over the shoulders of robot welders. Farm equipment maker John Deere described a computer vision system that spots defective joints, helping to ensure that its heavy machinery leaves the production line ready to roll.
1 min read
CogView home website
Classification

Large Language Models for Chinese

Researchers unveiled competition for the reigning large language model GPT-3. Four models collectively called Wu Dao were described by Beijing Academy of Artificial Intelligence, a research collective funded by the Chinese government, according to Synced Review.
2 min read
Model identifying erroneous labels in popular datasets
Classification

Labeling Errors Everywhere

Key machine learning datasets are riddled with mistakes. Several benchmark datasets are shot through with incorrect labels. On average, 3.4 percent of examples in 10 commonly used datasets are mislabeled and the detrimental impact of such errors rises with model size.
2 min read
Blurred human faces in different pictures
Classification

De-Facing ImageNet

ImageNet now comes with privacy protection.What’s new: The team that manages the machine learning community’s go-to image dataset blurred all the human faces pictured in it and tested how models trained on the modified images on a variety of image recognition tasks.
2 min read
Data related to a diagnostic advice received from a machine learning model vs a human expert
Classification

Would Your Doctor Take AI’s Advice?

Some doctors don’t trust a second opinion when it comes from an AI system. A team at MIT and Regensburg University investigated how physicians responded to diagnostic advice they received from a machine learning model versus a human expert.
1 min read
Data related to SElf-supERvised (SEER), an image classifier pretrained on uncurated, unlabeled images
Classification

Pretraining on Uncurated Data

It’s well established that pretraining a model on a large dataset improves performance on fine-tuned tasks. In sufficient quantity and paired with a big model, even data scraped from the internet at random can contribute to the performance boost.
2 min read
Tag-Retrieve-Compose-Synthesize (TReCS)
Classification

Pictures From Words and Gestures

A new system combines verbal descriptions and crude lines to visualize complex scenes. Google researchers led by Jing Yu Koh proposed Tag-Retrieve-Compose-Synthesize (TReCS), a system that generates photorealistic images by describing what they want to see while mousing around on a blank screen.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox