Automatic Transcription of the audio (AI transcription)
Podigee offers the option of running an audio transcription algorithm (AI) when encoding. We save this transcript once as a JSON and once as a VTT file on our servers. The URLs to the files are shown as a link tag via <podcast:transcript> in your RSS feed.
What can the transcription algorithm do?
The algorithm attempts to convert spoken words into text and summarize sections of speech in the text into time units.
Once the transcript has been created, it can be called up in the "Transcript" tab in the episode settings and edited in the transcript editor. This is what the created text document looks like in the transcript editor, without manual editing:
In the editor, there is the option of simply listening to short passages again and editing the text directly at this point, otherwise confusing text passages can sometimes occur.
You also have the option of downloading the transcript directly as a text file. A downloaded transcript can look like the following image, for example.
What cannot the transcription algorithm do?
This is why post-editing of the text is always necessary. A 100% automated transcription is not possible at the moment. However, the time required to edit the automatically generated text document is still less than doing the transcription completely by hand.
Does transcription cost extra and how do I activate it?
Please note: If the transcription option is activated, it will be applied to every newly encoded episode. There is no free re-encoding within 7 days for a transcription. If you want to re-encode an episode that has been transcribed, you should first disable this feature.
Before publication and as the default setting:
After publication:
What does the transcript look like in the web player and the Podigee blog?
Podigee Web-Player:
Podigee Blog of the episode, with speaker:
.., without speaker:
Why do I need transcripts?
There are certainly many more application examples that are not mentioned here. However, we are relying entirely on the creativity of our users.
Best practice
There are a few things you can keep in mind when recording and editing: Audio quality should be as good as possible, speakers should not talk too much out of order and as clearly as possible. Basically, these are all aspects that make a good podcast in general, but here in particular help to make it easier for the algorithm to recognize.