Summarization

13 Posts

Everlaw's clustering feature
Summarization

Order in the Court

Machine learning is helping lawyers sift through mountains of documents to find evidence. The legal technology company Everlaw launched a clustering feature that automatically organizes up to 25 million documents for lawyers gathering evidence to be used during a trial.
2 min read
Screen capture of a Semantic Scholar search with TLDR summaries generated by AI
Summarization

Very Short, Did Read

A new summarization model boils down AI research papers to a single sentence. TLDR from Allen Institute for AI creates at-a-glance summaries of scientific research papers. It’s up and running at Semantic Scholar, a research database, where searches now return its pithy precis.
2 min read
Examples and explanation of an automatic headline generation
Summarization

AI Makes Headlines

Which headline was written by a computer? A: FIFA to Decide on 2022 World Cup in March B: Decision in March on 48-team 2022 World Cup, Says Infantino
2 min read
Talking bubbles inside talking bubbles
Summarization

Bigger is Better

Natural language processing lately has come to resemble an arms race, as the big AI companies build models that encompass ever larger numbers of parameters. Microsoft recently held the record — but not for long.
2 min read
Richard Socher
Summarization

Richard Socher: Boiling the Information Ocean

Ignorance is a choice in the Internet age. Virtually all of human knowledge is available for the cost of typing a few words into a search box.
2 min read
Illustration of a fireplace with "Happy holidays" cards in English, Spanish and French
Summarization

Language Models Get Literate

Earlier language models powered by Word2Vec and GloVe embeddings yielded confused chatbots, grammar tools with middle-school reading comprehension, and not-half-bad translations. The latest generation is so good, some people consider it dangerous.
2 min read
Automatically generated text summary
Summarization

Keeping the Facts Straight

Automatically generated text summaries are becoming common in search engines and news websites. But existing summarizers often mix up facts. For instance, a victim’s name might get switched for the perpetrator’s.
2 min read
Information about a model for multi-document summarization and question answering
Summarization

Bigger Corpora, Better Answers

Models that summarize documents and answer questions work pretty well with limited source material, but they can slip into incoherence when they draw from a sizeable corpus. Recent work addresses this problem.
2 min read
Proposed model for abstractive summarization of a scientific article
Summarization

Two Steps to Better Summaries

Summarizing a document using original words is a longstanding problem for natural language processing. Researchers recently took a step toward human-level performance in this task, known as abstractive summarization, as opposed to extractive summarization.
1 min read
 Proportion of examples covered by number of annotators (sorted by number of annotations)
Summarization

AI Knows Who Labeled the Data

The latest language models are great at answering questions about a given text passage. However, these models are also powerful enough to recognize an individual writer’s style, which can clue them in to the right answers. New research measures such annotator bias in several data sets.
2 min read
Graph related to Language Model Analysis (LAMA)
Summarization

What Language Models Know

Watson set a high bar for language understanding in 2011, when it famously whipped human competitors in the televised trivia game show Jeopardy! IBM’s special-purpose AI required around $1 billion. Research suggests that today’s best language models can accomplish similar tasks right off the shelf.
2 min read
Question from an exam
Summarization

Smart Students, Dumb Algorithms

A growing number of companies that sell standardized tests are using natural language processing to assess writing skills. Critics contend that these language models don’t make the grade.
1 min read
Bert and Ernie from Sesame Street
Summarization

BERT Is Back

Less than a month after XLNet overtook BERT, the pole position in natural language understanding changed hands again. RoBERTa is an improved BERT pretraining recipe that beats its forbear, becoming the new state-of-the-art language model — for the moment.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox