Open API audio transcription not working

Question

Typebot Open API transcription service seems to be not working. I tried and the model always hallucinate. So I downloaded the audio file from S3 and tried it on my postman and showing same result "MBC 뉴스 이덕영입니다."

vibing · Answer

@Baptiste I know you’re super busy with queries, but really appreciate any help you can give in this

Baptiste · Answer

Well it's not from Typebot's side then

Baptiste · Answer

but maybe OpenAI is not performant enough?

Baptiste · Answer

Since you tried without Typebot

vibing · Answer

I tried with Typebot and it showed me same output as well. So I downloaded audio file from s3 and then used postman to invoke openAI transcription to test

vibing · Answer

and it works when I convert .mp4 to .mp3 and then send it to openAI

Baptiste · Answer

I don't understand what you are showing in the screenshot then?

Baptiste · Answer

I can't read teh response body.

vibing · Answer

I downloaded the audio mp4 file from the S3 and then used postman to invoke the whisper api this is the request and response

vibing · Answer

then I converted same file from mp4 to mp3 locally and tried it again

vibing · Answer

and it worked

vibing · Answer

I used the "Audio GPT" template from typebot and seems the same issue there in transcribing audio

Baptiste · Answer

So you are saying that the transcription is working but is not accurate and if you convert the audio file to mp3 it is far more accurate? How can I try this? Do you have an example in English?

vibing · Answer

Yes, if you try this mp4, which I recorded and downloaded from typebot s3 bucket.

Baptiste · Answer

Then you are talking about an issue on OpenAI's end... The audio recording is clear to me

vibing · Answer

Yes the audio is clear, but when you run this to openAI transcribe its not transcribing correctly and hallucinates. Could be the mp4 format?, as I see in couple of OpenAI threads

Baptiste · Answer

Like I said, then it is an issue from OpenAI’s side 🙏

Ademar · Answer

hey guys, is there a way for the typebot to understand audio from the sender? and answer accordingly?

Ademar · Answer

i know the bot can send the audio, but when it receives, it says it doesnt understand

Ademar · Answer

I'm using the assistant from OpenAI ChatGPT 4ºmini

Ademar · Answer

Do I need the regular one or is a configuration on typebot?

Denny · Answer

Hey guys, I faced the same issue where I received strange responses. The problem lies in the file format. OpenAI seems to have issues processing .mp4 files. If you use iOS devices, you’re out of luck because all messages are provided as .mp4. However, Android works fine because files are provided as .webm. I had to create an automation to convert the audio and feed it back into Typebot to solve this.
Of course, the file format itself is not a Typebot issue, but it’s a headache if you can’t just use audio recording and transcription within Typebot itself.
Unfortunately, I also noticed that there are some issues with audio recording in Typebot depending on the browser. Here’s what I have experienced so far (I could not test on Windows).

OSX:

❌ Firefox: Recording does not start at button click
✅ Chrome: Can send audio messages and audio file format "webm" is being accepted by OpenAI
❌ Safari: Recording starts, but can’t send audio recording (keeps recording when clicking send button)

iOS:

❌ Firefox: Can send audio messages but audio format "mp4" is not being accepted by OpenAI
❌ Chrome: Can send audio messages but audio format "mp4" is not being accepted by OpenAI
❌ Safari: Can send audio messages but audio format "mp4" is not being accepted by OpenAI

Android:

❌ Firefox: Recording starts, but can’t send audio recording (keeps recording when clicking send button)
✅ Chrome: Can send audio messages, format web is being accepted by OpenAI

Baptiste · Answer

Appreciate the testing! Let me create a GitHub issue and I will see what I can do

Baptiste · Answer

https://github.com/baptisteArno/typebot.io/issues/1917

Baptiste · Answer

Hi @Denny what version of Typebot do you have on? I could not reproduce the issues you mention

Denny · Answer

Hi @Baptiste, thanks for looking into it. The issues regarding the audio message sending errors appeared on v3.1.2. I've now double-checked before updating, and they were still there. After testing, I have updated to v3.2, and it seems that audio message sending works fine now. The only thing I have noticed is that macOS is sending .mp4 files in Safari as well. Therefore, the transcribing problem will appear on all iOS browsers and Safari on macOS.

Baptiste · Answer

OK, that is then another issue, related to OpenAI

Baptiste · Answer

Thanks for confirming

Share feedback, ideas and get community help

Open API audio transcription not working