How does it work ?
Our platform integrates with the Google Text-to-Speech API to automatically create transcripts from podcast audio.
Machine learning technology applies neural network models to audio content to detect speech and convert it into text.
Models are trained to understand a specific language dialect. Over time with additional training and optimisations, these models become more and more accurate.
Transcription can only be performed for languages that have a model available.
Why use it ?
Including a text version of audio content on web pages is a good SEO feature, allowing search engines to index more relevant keywords.
In addition, automated transcripts form a great start for a human-curated version, frequently only needing punctuation and correction of person and brand names.