
Lex Fridman Podcast #151 – Dan Kokotov: Speech Recognition with AI and Humans
7 snips
Jan 4, 2021 Dan Kokotov, VP of Engineering at Rev.ai, shares his expertise in automatic speech recognition technology. He discusses the challenges of real-time transcription, including accuracy issues with accents and pacing. Kokotov emphasizes the role of user feedback and data quality in improving ASR systems. He also explores the future of transcription services in the gig economy and highlights the importance of bridging human and machine efforts. Their conversation touches on the evolution of podcasting and the need for standardized transcripts to enhance accessibility.
AI Snips
Chapters
Books
Transcript
Episode notes
Rev's Simplicity
- Lex Fridman praises Rev for simplifying the transcription process, unlike his past experiences with Upwork.
- He compares it to Isotope RX, another product that streamlined his audio editing workflow.
Rev's Origin
- Rev aimed to improve the Upwork model by standardizing service categories and simplifying the user experience.
- They started with translation services and later added audio transcription.
Rev's Work Philosophy
- Rev positions itself not as part of the "gig economy," but as a way to improve work-from-home opportunities.
- They emphasize flexibility and removing geographical and time limitations for freelancers.









