Different logos from companies like OpenAI, Stability.ai, Jasper and the dollar sign
NLP

Generating Investment: Generative AI Startups Raise Hundreds of Millions in Funding

The generative gold rush is on. Venture capitalists are betting hundreds of millions of dollars on startups that use AI to generate images, text, and more, Wired reported.
Technical components of No Language Left Behind and how they fit together
NLP

The Net Speaks in Many Tongues: NLP Model Translates 200 Different Languages

Sentence pairs that have equivalent meanings in different languages — typically used to train machine translation systems — have been available in sufficient quantities for only around 100 languages. New work doubled that number and produced a more capable model.
AI-generated image of Joe Rogan interviewing Steve Jobs
NLP

All Synthetic, All the Time: Joe Rogan Meets Steve Jobs in an AI-Generated Podcast

For the debut episode of a new podcast series, Play.ht synthesized a 19-minute interview between the rock-star podcaster and late Apple CEO.
Example of a video produced from a story-like description
NLP

Long-Form Videos from Text Stories: Google's Phenaki Generates Long-Form Video from Text

Only a week ago, researchers unveiled a system that generates a few seconds of video based on a text prompt. New work enables a text-to-video system to produce an entire visual narrative from several sentences of text.
Bloom logo
NLP

Large Language Models Unbound: BLOOM is the Largest Open Source NLP Model to Date

A worldwide collaboration produced the biggest open source language model to date. BLOOM is a family of language models built by the BigScience Research Workshop, a collective of over 1,000 researchers from 250 institutions around the globe.
A series of graphs show the carbon emissions associated with training AI models.
NLP

Cutting the Carbon Cost of Training: A New Tool Helps NLP Models Lower Their Gas Emissions

You can reduce your model’s carbon emissions by being choosy about when and where you train it.
Everlaw's clustering feature organizing thousands of documents
NLP

Order in the Court: Machine Learning Tool from Everlaw Finds Legal Evidence

Machine learning is helping lawyers sift through mountains of documents to find evidence. The legal technology company Everlaw launched a clustering feature that automatically organizes up to 25 million documents for lawyers gathering evidence to be used during a trial.
Example of text generated by LaMDA
NLP

LaMDA Comes Alive?: Google Engineer Says LaMDA AI is Sentient

A chatbot persuaded at least one person that it has feelings. A senior engineer at Google announced his belief that the company’s latest conversational language model is sentient.
Andrew Ng staring at neural networks
NLP

The Batch: Special Issue! Foundational Algorithms, Where They Came From, Where They're Going

Years ago, I had to choose between a neural network and a decision tree learning algorithm. It was necessary to pick an efficient one, because we planned to apply the algorithm to a very large set of users on a limited compute budget.
Man sitting on a tree with a monkey and a gorilla
NLP

Decision Trees: From Root to Leaves — Decision Trees for Machine Learning Explained

What kind of beast was Aristotle? The philosopher's follower Porphyry, who lived in Syria during the third century, came up with a logical way to answer the question...
Illustration of a robot with a captain costume
NLP

Neural Networks: Find the Function — A Basic Introduction to Neural Networks

Let’s get this out of the way: A brain is not a cluster of graphics processing units, and if it were, it would run software far more complex than the typical artificial neural network. Yet neural networks were inspired by the brain’s architecture.
One person pouring a drink of poison in the company of another person
NLP

Logistic Regression: Follow the Curve — A Basic Introduction to Logistic Regression for Machine Learning

There was a moment when logistic regression was used to classify just one thing: If you drink a vial of poison, are you likely to be labeled “living” or “deceased”? Times have changed.
Graph Average across 14 NLP Tasks parameters versus Average Accuracy
NLP

GPT-Free: Meta Releases Open Source Large Language Models OPT

Itching to get your hands on a fully trained large language model? The wait is over. Meta introduced the OPT family of transformer-based language models with nearly unfettered access to source code and trained weights.
Shifted Patch Tokenization (SPT) | Locality Self-Attention (LSA)
NLP

Less Data for Vision Transformers: Boosting Vision Transformer Performance with Less Data

Vision Transformer (ViT) outperformed convolutional neural networks in image classification, but it required more training data. New work enabled ViT and its variants to outperform other architectures with less training data.
GLaM model architecture
NLP

Efficiency Experts: Mixture of Experts Makes Language Models More Efficient

The emerging generation of trillion-parameter language models take significant computation to train. Activating only a portion of the network at a time can cut the requirement dramatically and still achieve exceptional results.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox