
Changelog Master Feed Full-duplex, real-time dialogue with Kyutai (Practical AI #298)
8 snips
Dec 4, 2024 Alexandre Défossez, co-founder of Kyutai and scientist focused on real-time speech-to-speech AI, shares insights about their groundbreaking Moshi model that facilitates full-duplex communication. He highlights how Kyutai promotes open-source research in a vibrant French AI landscape. The discussion also delves into innovative audio datasets essential for enhancing text-to-speech systems and the distinction between nonprofit and for-profit AI initiatives. Alex provides a glimpse into the future of AI technologies, emphasizing the growing significance of collaboration in advancing the field.
AI Snips
Chapters
Transcript
Episode notes
French AI Ecosystem
- France's strong engineering and math focus created fertile ground for AI, attracting companies like Facebook.
- This has fostered a growing independent AI ecosystem with startups and access to resources.
Open Science at Kyutai
- Open science involves explaining the research process, including mistakes and what was tried.
- It goes beyond releasing weights, aiming for transparency in training pipelines.
Kyutai's Advantages
- Kyutai prioritizes agility and open-source, commercially-friendly licenses, unlike larger companies.
- They focus on on-device models, avoiding the benchmark race of for-profit labs.

