Manual Transcription / Format of Text File of a Transcript
Which file formats are common and what should it look like?
Only files of these types → txt, srt, docx, vtt should be used as the correct format for a transcript.
This (handwritten) transcript should look like this:
- timestamp in square brackets in the format HH:MM:SS (hour - minute - second)
- the name of the speaker, ending with a colon
- the speaker's text
Here are some more examples of formats that we can also use to create a valid VTT file:
[00:00:01] Speaker 1:Text spoken by the speaker.
[00:00:10] Speaker 2: Text spoken by the other speaker.
[00:00:10] Speaker 2: Text spoken by the other speaker.
--------
Speaker 1: [00:00:01] Text spoken by the speaker.
Speaker 2: [00:00:10] Text of the other speaker.
Speaker 2: [00:00:10] Text of the other speaker.
---------
XY Podcast with John Doe
00:00:00 Speaker 1: Text of the speaker.
00:00:10 Speaker 2: Text of the other speaker.
00:00:50 *Music*
00:00:10 Speaker 2: Text of the other speaker.
00:00:50 *Music*
---------
00:00:00.100 - 00:00:09.000
Speaker 1:
Speaker's text.
Speaker 1:
Speaker's text.
00:00:10.000 - 00:00:27.160
Speaker 2:
The other speaker's text.
Speaker 2:
The other speaker's text.
⚠️ We cannot process files without timestamps into a VTT file and only display this transcript in the Podigee Player.
Is the manual transcript included in the feed?
We save this transcript once as a JSON file and once as a VTT file on our servers. The URLs to the files are displayed as link tags via <podcast:transcript> in your RSS feed.