CogView home website

Large Language Models for Chinese: A brief overview of the Wu Dao NLP models.

Researchers unveiled competition for the reigning large language model GPT-3. Four models collectively called Wu Dao were described by Beijing Academy of Artificial Intelligence, a research collective funded by the Chinese government, according to Synced Review.
Blurred human faces in different pictures

De-Facing ImageNet: Researchers blur all faces in ImageNet.

ImageNet now comes with privacy protection.What’s new: The team that manages the machine learning community’s go-to image dataset blurred all the human faces pictured in it and tested how models trained on the modified images on a variety of image recognition tasks.
Data related to SElf-supERvised (SEER), an image classifier pretrained on unlabeled images

Pretraining on Uncurated Data: How unlabeled data improved computer vision accuracy.

It’s well established that pretraining a model on a large dataset improves performance on fine-tuned tasks. In sufficient quantity and paired with a big model, even data scraped from the internet at random can contribute to the performance boost.
Data related to a diagnostic advice received from a machine learning model vs a human expert

Would Your Doctor Take AI’s Advice?: Some doctors are skeptical of AI diagnoses.

Some doctors don’t trust a second opinion when it comes from an AI system. A team at MIT and Regensburg University investigated how physicians responded to diagnostic advice they received from a machine learning model versus a human expert.
Sequence showing a training step that uses different perspectives of the same patient to enhance unsupervised pretraining

Same Patient, Different Views: Contrastive pretraining improves medical imaging AI.

When you lack labeled training data, pretraining a model on unlabeled data can compensate. New research pretrained a model three times to boost performance on a medical imaging task.
Tag-Retrieve-Compose-Synthesize (TReCS)

Pictures From Words and Gestures: AI model generates captions as users mouse over images.

A new system combines verbal descriptions and crude lines to visualize complex scenes. Google researchers led by Jing Yu Koh proposed Tag-Retrieve-Compose-Synthesize (TReCS), a system that generates photorealistic images by describing what they want to see while mousing around on a blank screen.
Taxonomy of deep learning architectures using self-attention for visual recognition and images from the COCO dataset

Vision Models Get Some Attention: Researchers add self-attention to convolutional neural nets.

Self-attention is a key element in state-of-the-art language models, but it struggles to process images because its memory requirement rises rapidly with the size of the input. New research addresses the issue with a simple twist on a convolutional neural network.
Neighborhood being monitored by AI-powered cameras

Partners in Surveillance: How police use public cameras to track license plates.

Police are increasingly able to track motor vehicles throughout the U.S. using a network of AI-powered cameras — many owned by civilians. Flock, which sells automatic license plate readers is encouraging enforcers to use its network to monitor cars and trucks outside their jurisdiction.
Facebook like and dislike buttons

Social Engagement vs. Social Good: The builder of Facebook's algorithm talks bias.

Facebook’s management obstructed the architect of its recommendation algorithms from mitigating their negative social impact. The social network focused on reining in algorithmic bias against particular groups of users at the expense of efforts to reduce disinformation.
Graphs and data related to ReLabel, a technique that labels any random crop of any image.

Good Labels for Cropped Images: AI technique adds text labels to random image crops.

In training an image recognition model, it’s not uncommon to augment the data by cropping original images randomly. But if an image contains several objects, a cropped version may no longer match its label. Researchers developed a way to make sure random crops are labeled properly.
Person in wheelchair, person in side profile, person wearing a hoodie

Human Disabilities Baffle Algorithms: Facebook blocked ads aimed at people with disabilities.

Facebook’s content moderation algorithms block many advertisements aimed at disabled people. The social media platform’s automated systems regularly reject ads for clothing designed for people with physical disabilities.
System Oscar+ working

Sharper Eyes For Vision+Language: AI research shows improved image and text matching.

Models that interpret the interplay of words and images tend to be trained on richer bodies of text than images. Recent research worked toward giving such models a more balanced knowledge of the two domains.
Model predicting ingredients in a recipe and woman cooking

Cake + Cookie = Cakie: Google AI creates new dessert recipes.

AI may help revolutionize the human diet – or dessert, at least. Google applied AI engineer Dale Markowitz and developer advocate Sara Robinson trained a model to predict whether a recipe is...
Series of images showing a variety of medical AI products

Medical AI’s Hidden Data: Why many medical AI devices are black boxes.

U.S. government approval of medical AI products is on the upswing — but information about how such systems were built is largely unavailable. The U.S. Food and Drug Administration (FDA) has approved a a plethora of AI-driven medical systems.
Gun detecting system working and alerting the police

Draw a Gun, Trigger an Algorithm: These AI-enabled security cameras automatically ID guns.

Computer vision is alerting authorities the moment someone draws a gun. Several companies offer deep learning systems that enable surveillance cameras to spot firearms and quickly notify security guards or police.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox