PodSearch →

Underscore David Smith:

Back in 2017 I had created a site which took the the audio of some of my favorite podcasts and tried to make them searchable by passing them through an automated speech-to-text engine.

In the original version of this the transcription was done with very rough, slow, low accuracy system which resulted in transcripts that were fit for keyword searching but not much else.

Thankfully since then OpenAI has released Whisper a powerful speech-to-text engine that I can run right on my Mac and results in transcripts that are shockingly good. They aren’t quite at the level of a human transcriber but they get darn close in many instances. Getting close to the level where you could use them to grab a pull quote with only a little bit of tidying up to do.

I’m thrilled that Connected is archived in the new version of PodSearch. This tool is amazing.