Summarization

13 Posts

Illustration of a person shoveling snow with the help of a flamethrower
Summarization

Language Models; Extended: Language models grew more reliable and less biased in 2022.

Researchers pushed the boundaries of language models to address persistent problems of trustworthiness, bias, and updatability.
Everlaw's clustering feature organizing thousands of documents
Summarization

Order in the Court: Machine Learning Tool from Everlaw Finds Legal Evidence

Machine learning is helping lawyers sift through mountains of documents to find evidence. The legal technology company Everlaw launched a clustering feature that automatically organizes up to 25 million documents for lawyers gathering evidence to be used during a trial.
Screen capture of a Semantic Scholar search with TLDR summaries generated by AI
Summarization

Very Short, Did Read

A new summarization model boils down AI research papers to a single sentence. TLDR from Allen Institute for AI creates at-a-glance summaries of scientific research papers. It’s up and running at Semantic Scholar, a research database, where searches now return its pithy precis.
Talking bubbles inside talking bubbles
Summarization

Bigger is Better

Natural language processing lately has come to resemble an arms race, as the big AI companies build models that encompass ever larger numbers of parameters. Microsoft recently held the record — but not for long.
Richard Socher
Summarization

Richard Socher: Boiling the Information Ocean

Ignorance is a choice in the Internet age. Virtually all of human knowledge is available for the cost of typing a few words into a search box.
Illustration of a fireplace with "Happy holidays" cards in English, Spanish and French
Summarization

Natural Language Processing Models Get Literate: Top NLP Advances in 2019

Earlier language models powered by Word2Vec and GloVe embeddings yielded confused chatbots, grammar tools with middle-school reading comprehension, and not-half-bad translations. The latest generation is so good, some people consider it dangerous.
Automatically generated text summary from FactCC with misleading facts highlighted in different colors.
Summarization

Keeping the Facts Straight: NLP System FactCC Fact Checks Texts

Automatically generated text summaries are becoming common in search engines and news websites. But existing summarizers often mix up facts. For instance, a victim’s name might get switched for the perpetrator’s.
Information about a model for multi-document summarization and question answering
Summarization

Bigger Corpora, Better Answers

Models that summarize documents and answer questions work pretty well with limited source material, but they can slip into incoherence when they draw from a sizeable corpus. Recent work addresses this problem.
Proposed model for abstractive summarization of a scientific article
Summarization

Two Steps to Better Summaries

Summarizing a document using original words is a longstanding problem for natural language processing. Researchers recently took a step toward human-level performance in this task, known as abstractive summarization, as opposed to extractive summarization.
 Proportion of examples covered by number of annotators (sorted by number of annotations)
Summarization

AI Knows Who Labeled the Data

The latest language models are great at answering questions about a given text passage. However, these models are also powerful enough to recognize an individual writer’s style, which can clue them in to the right answers. New research measures such annotator bias in several data sets.
Graph related to Language Model Analysis (LAMA)
Summarization

What Language Models Know

Watson set a high bar for language understanding in 2011, when it famously whipped human competitors in the televised trivia game show Jeopardy! IBM’s special-purpose AI required around $1 billion. Research suggests that today’s best language models can accomplish similar tasks right off the shelf.
Question from an exam
Summarization

Smart Students, Dumb Algorithms: NLP Systems Struggle at Grading Essays

A growing number of companies that sell standardized tests are using natural language processing to assess writing skills. Critics contend that these language models don’t make the grade.
Bert and Ernie from Sesame Street
Summarization

BERT Is Back

Less than a month after XLNet overtook BERT, the pole position in natural language understanding changed hands again. RoBERTa is an improved BERT pretraining recipe that beats its forbear, becoming the new state-of-the-art language model — for the moment.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox