Facebook service describing a photo on Instagram
Speech Recognition

Every Picture Tells a Story

Facebook expanded a system of vision, language, and speech models designed to open the social network to users who are visually impaired. A Facebook service that describes photos in a synthesized voice now recognizes 1,200 visual concepts — 10 times more than the previous version.
2 min read
Alexa device and information about its new skill called natural turn-talking
Speech Recognition

Alexa, Read My Lips

Amazon’s digital assistant is using its eyes as well as its ears to figure out who’s talking. At its annual hardware showcase, Amazon introduced an Alexa skill that melds acoustic, linguistic, and visual cues to help the system keep track of individual speakers and topics of conversation.
1 min read
Graphs and data related to transformer networks
Speech Recognition

The Transformation Continues

Transformer networks are gaining popularity as a high-accuracy alternative to recurrent neural networks. But they can run slowly when they’re applied to long sequences.
2 min read
Series of pictures of people smiling
Speech Recognition

Deepfakes for Good

A strategy manifesto from one of China’s biggest tech companies declares, amid familiar visions of ubiquitous AI, that deepfakes are more boon than bane.
2 min read
Apple watch with countdown
Speech Recognition

The AI of Small Things

Some tech companies boast that their AI will change the world. Apple’s latest just aims to make your life a little easier. Apple unveiled a flock of modest conveniences powered by machine learning at its annual developer conference.
1 min read
Illustration of Amazon Alexa with a question mark inside of a thought bubble
Speech Recognition

What Were We Talking About?

Conversational agents have a tough job following the zigs and zags of human conversation. They’re getting better at it — thanks to yesterday’s technology. Amazon recently improved the Alexa chatbot’s ability to identify the current topic of conversation.
1 min read
Illustration of doctor sheets and a pencil
Speech Recognition

Data: From Patient to Health Record

Doctors are overwhelmed by clerical work. Healthcare-savvy voice assistants are picking up the slack.
2 min read
Conference on Microsoft Teams with a person eating a chip bag
Speech Recognition

Silent Snacking

As working from home becomes the new normal, AI may protect you from the sound of coworkers munching while they chat. No more smacking lips and rustling chip bags! Microsoft’s online collaboration platform Teams announced a feature that removes extraneous sounds from videoconferences.
1 min read
Two pandas eating
Speech Recognition

What Love Sounds Like

Female giant pandas are fertile for only 24 to 36 hours a year: Valentine’s Day on steroids. A new neural network alerts human keepers when a panda couple mates.
1 min read
Chart with number of AI startup acquisitions from 2010 to 2019
Speech Recognition

AI Startups in Demand

AI startups are being scooped up at an accelerating pace, many by companies outside the tech sphere. A report by CB Insights shows that, as of August, 2019 was on track to surpass last year’s record number of AI startup acquisitions.
1 min read
Average Relative WER improvement as a function of the amount of training data
Speech Recognition

Speech Recognition With an Accent

Models that achieve state-of-the-art performance in automatic speech recognition (ASR) often perform poorly on nonstandard speech. New research offers methods to make ASR more useful to users with heavy accents or speech impairment.
2 min read

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox