We are addressing the problem of enhancing the precision of transcriptions, particularly when it comes to company names and product references. Our solution involves a dual strategy that utilizes both the Whisper prompt parameter and GPT-4's post-processing capabilities.
Two approaches to correct inaccuracies are:
We input a list of correct spellings directly into Whisper's prompt parameter to guide the initial transcription.
We utilized GPT-4 to fix misspellings post transcription, again using the same list of correct spellings in the prompt.
These strategies aimed at ensuring precise transcription of unfamilar proper nouns.
To get started, let's:
- Import the OpenAI Python library (if you don't have it, you'll need to install it with
pip install openai)
- Download the audio file example