issue-291

1 Post

Diagram of an RQ-Transformer speech system with Helium and Depth Transformers for audio processing.
issue-291

GPT-4.5 Goes Big, Claude 3.7 Reasons, Alexa+ Goes Agentic, Generating Text Like an Image

The Batch AI News and Insights: Continuing our discussion on the Voice Stack, I’d like to explore an area that today’s voice-based systems mostly struggle with: Voice Activity Detection (VAD) and the turn-taking paradigm of communication.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox