Tree farm dataset
Language

Representing the Underrepresented

Some of deep learning’s bedrock datasets came under scrutiny as researchers combed them for built-in biases. Researchers found that popular datasets impart biases against socially marginalized groups to trained models due to the ways the datasets were compiled, labeled, and used.
2 min read
Two reindeers with masks on a snowy night
Language

Coping With Covid

AI accelerated the search for a coronavirus vaccine, detected Covid-19 cases, and otherwise softened the pandemic’s blow. Machine learning researchers worldwide scrambled to harness the technology against the coronavirus.
2 min read
Transkribus transcribing centuries-old letters and manuscripts
Language

Written by Quill, Read by Computer

The secrets of history are locked in troves of handwritten documents. Now a machine learning platform is making them amenable to digital search. Transkribus is transcribing centuries-old records en masse and making them available to scholars worldwide.
1 min read
Data showing how new pretrained language models might learn facts like weight and cost
Language

The Measure of a Muppet

The latest pretrained language models have shown a remarkable ability to learn facts. A new study drills down on issues of scale, showing that such models might learn the approximate weight of a dog or cost of an apple, at least to the right order of magnitude.
2 min read
Examples of contrastive learning
Language

Learning From Words and Pictures

It’s expensive to pay doctors to label medical images, and the relative scarcity of high-quality training examples can make it hard for neural networks to learn features that make for accurate diagnoses.
2 min read
Screen capture of a Semantic Scholar search with TLDR summaries generated by AI
Language

Very Short, Did Read

A new summarization model boils down AI research papers to a single sentence. TLDR from Allen Institute for AI creates at-a-glance summaries of scientific research papers. It’s up and running at Semantic Scholar, a research database, where searches now return its pithy precis.
2 min read
Data related to Nvidia's Pay Attention When Required (Par) approach
Language

Selective Attention

Large transformer networks work wonders with natural language, but they require enormous amounts of computation. New research slashes processor cycles without compromising performance.
1 min read
Proof Search Tree
Language

The Proof Is in the Network

OpenAI’s Generative Pre-Trained Transformer (GPT) architecture has created coherent essays, images, and code. Now it generates mathematical proofs as well.
2 min read
AI medical chatbot having a conversation with a patient
Language

GPT-3 Is No MD

The world’s most sophisticated language model won’t replace your doctor anytime soon. Researchers at Nabla, an AI-enabled healthcare platform, found that GPT-3 lacks the logical reasoning skills to be a useful medical chatbot.
1 min read
Example of disinformation detection system working on a news article about Syria
Language

Propaganda Watch

The U.S. military enlisted natural language processing to combat disinformation. Primer, a San Francisco startup, is developing a system for the Department of Defense that sifts through news, social media, research, and reports to spot propaganda campaigns.
1 min read
Illustration of two witches with half a pumpkin each and the moon behind them
Language

The AI Community Splinters

Will international rivalries fragment international cooperation in machine learning? Countries competing for AI dominance will lash out at competitors.
2 min read
Screen captures of online platform Dynabench
Language

Dynamic Benchmarks

Benchmarks provide a scientific basis for evaluating model performance, but they don’t necessarily map well to human cognitive abilities. Facebook aims to close the gap through a dynamic benchmarking method that keeps humans in the loop.
2 min read
Alexa device and information about its new skill called natural turn-talking
Language

Alexa, Read My Lips

Amazon’s digital assistant is using its eyes as well as its ears to figure out who’s talking. At its annual hardware showcase, Amazon introduced an Alexa skill that melds acoustic, linguistic, and visual cues to help the system keep track of individual speakers and topics of conversation.
1 min read
Graphs with data related to Microsoft's library DeepSpeed
Language

Toward 1 Trillion Parameters

An open source library could spawn trillion-parameter neural networks and help small-time developers build big-league models. Microsoft upgraded DeepSpeed, a library that accelerates the PyTorch deep learning framework.
2 min read
Bert (muppet) and information related to BERT (transformer-based machine learning technique)
Language

Do Muppets Have Common Sense?

Two years after it pointed a new direction for language models, Bert still hovers near the top of several natural language processing leaderboards. A new study considers whether Bert simply excels at tracking word order or or learns something closer to common sense.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox