École des Hautes Études en Sciences Sociales

1 Post

Illustration of the Dialogue Transformer Language Model (DLM)
École des Hautes Études en Sciences Sociales

The Sound of Conversation: AI Learns to Mimic Conversational Pauses and Interruptions

In spoken conversation, people naturally take turns amid interjections and other patterns that aren’t strictly verbal. A new approach generated natural-sounding audio dialogs without training on text transcriptions that mark when one party should stop speaking and the other should chime in.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox