Speech Recognition

30 Posts

DoNotPay's system that autonomously navigates phone menus and converses with customer service representatives working
Speech Recognition

Your Personal Deepfaked Agent: This GPT-powered voice tool will talk to customer service for you.

Hate talking to customer service? An AI-powered tool may soon do it for you. Joshua Browder, chief executive of the consumer advocacy organization DoNotPay, demonstrated a system that autonomously navigates phone menus and converses...
Moving slide with information about AWS AI Service Cards.
Speech Recognition

Transparency for AI as a Service: Amazon introduces service cards to enhance responsible AI.

Amazon published a series of web pages designed to help people use AI responsibly. Amazon Web Services introduced so-called AI service cards that describe the uses and limitations of some models it serves.
Image of body parts in Hokkien, map showing Hokkien speaking regions across the world and Model architecture of S2ST
Speech Recognition

Translating a Mostly Oral Language: How Meta Trained an NLP Model to Translate Hokkein

Most speech-to-speech translation systems use text as an intermediate mode. So how do you build an automated translator for a language that has no standard written form? A new approach trained neural networks to translate a primarily oral language.
Illustration of the Dialogue Transformer Language Model (DLM)
Speech Recognition

The Sound of Conversation: AI Learns to Mimic Conversational Pauses and Interruptions

In spoken conversation, people naturally take turns amid interjections and other patterns that aren’t strictly verbal. A new approach generated natural-sounding audio dialogs without training on text transcriptions that mark when one party should stop speaking and the other should chime in.
Illustration of a laughing robot
Speech Recognition

Toward Machines That LOL: Scientists Teach a Speech Recognition Robot to Laugh

Even if we manage to stop robots from taking over the world, they may still have the last laugh. Researchers at Kyoto University developed a series of neural networks that enable a robot engaged in spoken conversation to chortle along with its human interlocutor.
parsing network diagram
Speech Recognition

Speaking Your Language: Startup Papercup Offers AI-Powered Voice Translation

A startup that automatically translates video voice overs into different languages is ready for its big break. London-based Papercup offers a voice translation service that combines algorithmic translation and voice synthesis with human-in-the-loop quality control.
Illustration of a robot with a captain costume
Speech Recognition

Neural Networks: Find the Function — A Basic Introduction to Neural Networks

Let’s get this out of the way: A brain is not a cluster of graphics processing units, and if it were, it would run software far more complex than the typical artificial neural network. Yet neural networks were inspired by the brain’s architecture.
AI-generated personas
Speech Recognition

Colleague in the Machine: Your future co-worker may be powered by AI.

Your next coworker may be an algorithmic teammate with a virtual face. WorkFusion unveiled a line of AI tools that automate daily business tasks. One thing that sets them apart is the marketing pitch: Each has a fictitious persona including a name, face, and professional résumé.
AI Research SuperCluster (RSC)
Speech Recognition

New Supercomputer on the Block: All about Meta's AI Research Supercluster.

Facebook’s parent company is staking its future on a new compute cluster. Meta unveiled AI Research SuperCluster (RSC), which is designed to accelerate training of large models for applications like computer vision, natural language processing, and speech recognition.
Questionnaire for evaluating AI system vendors
Speech Recognition

Standards for Hiring Algorithms: Met, Walmart, and more agree to hiring algorithm standards.

Some of the world’s largest corporations will use standardized criteria to evaluate AI systems that influence hiring and other personnel decisions.
Books on the floor
Speech Recognition

How to Learn Machine Learning

Want to become an AI practitioner? Here’s a program that will take you from beginner to job-ready. You may already have a head start, depending on your background. For a motivated person who starts with a solid high-school education it may take around two years.
Animated graphics from Google demonstrate Project Relate, a tool for recognizing impaired speech. .
Speech Recognition

Everyone Has a Voice: Project Relate Offers Synthesized Speech that Works in Real Time

An Android app offers speech recognition model for speech impaired by cerebral palsy, Down syndrome, Parkinson’s disease, stroke, or traumatic brain injury.
First image showing the Google Tensor chip. Second image showing the Google Pixel 6 phone
Speech Recognition

Competition Heats Up in Mobile AI: Google Designed Its Own Tensor AI Chip for Smartphones

Google designed its own AI chip for its new smartphone — a snub to Qualcomm, the dominant chip vendor in Android phones. What’s new: Google debuted the Tensor chip last week
GIF showing a zoomed in human mouth speaking through a headset microphone
Speech Recognition

Your Voice, Your Choice: AI-Powered Tool Modifies Voices in Real Time

A startup enables people who participate in voice chat to use realistic artificial voices in real time. What’s new: Massachusetts-based Modulate offers a voice-masking tool to forestall harassment of people, particularly women and trans individuals,
GIF showing Amazon Housold Robot working
Speech Recognition

Guard Bot: Amazon Household Robot Patrols Home for Intruders

Amazon unveiled a robot that patrols users’ homes, scopes out strangers, and warns of perceived dangers.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox