99 Posts

Screen captures of the Sparrow Chatbot

Google’s Rule-Respecting Chatbot: Research helps AI chatbots be more truthful and less hateful.

Amid speculation about the threat posed by OpenAI’s ChatGPT chatbot to Google’s search business, a paper shows how the search giant might address the tendency of such models to produce offensive, incoherent, or untruthful dialog.
High-level overview of the STEGO architecture at train and prediction steps

Segmented Images, No Labeled Data: Improved unsupervised learning for semantic segmentation

Training a model to separate the objects in a picture typically requires labeled images for best results. Recent work upped the ante for training without labels.
Plot demonstrating the relative sizes of parallel and monolingual examples

Massively Multilingual Translation: Machine Learning Model Trained to Translate 1,000 Languages

Recent work showed that models for multilingual machine translation can increase the number of languages they translate by scraping the web for pairs of equivalent sentences in different languages. A new study radically expanded the language repertoire through training on untranslated web text.
Example of a video produced from a story-like description

Long-Form Videos from Text Stories: Google's Phenaki Generates Long-Form Video from Text

Only a week ago, researchers unveiled a system that generates a few seconds of video based on a text prompt. New work enables a text-to-video system to produce an entire visual narrative from several sentences of text.
Robot with an arm, camera, and gripper handing over a plastic bottle to a person

Parsing Commands Into Actions: NLP Helps Google Robot Understand Spoken Instructions

A new method enables robots to respond helpfully to verbal commands by pairing a natural language model with a repertoire of existing skills.
Satellite imagery of suburban homes with backyard pools.

Spotting Tax Cheats From Overhead: French Authorities Use AI to Tax Pool Owners

French tax authorities netted nearly €10 million using an automated system to identify unregistered pools
Gif of Google search results shows how the company is minimizing disinformation.

Misinformation Recognition: Google Updated its Search Engine to Minimize Disinformation

Google updated the Multitask Unified model its search algorithm to respond to the flood of misinformation on the web.
Animated graphs showing how an ensemble of fine-tuned models can provide better performance.

Ensemble Models Simplified: New Machine Learning Research Simplifies Ensembles

A CLIP model whose weights were the mean of an ensemble of fine-tuned models performed as well as the ensemble and better than its best-performing constituent.
Animated flowcharts show how the ProtCNN AI model classifies proteins.

Protein Families Deciphered: Machine Learning Categorizes Proteins Based on Their Functions

Convolutional neural networks separate proteins into functional families without considering their shapes.
Different videoclips showing windmills

Wind in the Forecast: AI Tool Predicts Wind Turbine Energy Output

Machine learning is making wind power more predictable. Engie SA, a multinational energy utility based in France, is the first customer for an AI-powered tool from Google that predicts the energy output of wind farms.
Metaverse illustration with Meta AI product names

Meta Decentralizes AI Effort: Meta Restructures its AI Research Teams

The future of Big AI may lie with product-development teams. Meta reorganized its AI division. Henceforth, AI teams will report to departments that develop key products.
Example of text generated by LaMDA

LaMDA Comes Alive?: Google Engineer Says LaMDA AI is Sentient

A chatbot persuaded at least one person that it has feelings. A senior engineer at Google announced his belief that the company’s latest conversational language model is sentient.
Contentedge screen video capture

Winning The Google Game: 14 Companies Using GPT-3 to Top SEO

AI startups are helping writers tailor articles that appear near the top of Google’s search results. At least 14 companies sell access to software that uses GPT-3, the language model from OpenAI, to generate headlines, product descriptions, blog posts, and video scripts.
Didactic diagram of a hypothetical embedded-model architecture

Image Generation + Probabilities: New Method Boosts Performance for Normalizing Flow

If you want to both synthesize data and find the probability of any given example — say, generate images of manufacturing defects to train a defect detector and identify the highest-probability defects — you may use the architecture known as a normalizing flow.
GLaM model architecture

Efficiency Experts: Mixture of Experts Makes Language Models More Efficient

The emerging generation of trillion-parameter language models take significant computation to train. Activating only a portion of the network at a time can cut the requirement dramatically and still achieve exceptional results.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox