Question 1

Is my audio uploaded anywhere?

Accepted Answer

Yes — to transcribe with this high-accuracy model, your audio is sent to our server (running on Cloudflare) to be converted to text. It is processed in memory to generate your transcript and is not stored on our servers afterwards. We don't save your audio or your transcript. For highly sensitive recordings that you prefer never leave your device, use a fully offline tool instead.

Question 2

Does it work for Tagalog and Filipino?

Accepted Answer

Yes. Whisper large-v3-turbo is multilingual and handles Tagalog well, including a fair amount of Taglish code-switching. For the best Tagalog results, set the language to Tagalog rather than relying on Auto-detect. Accuracy is highest with clear speech and limited background noise; very fast slang, several people talking at once, or strong noise can still reduce accuracy.

Question 3

How long can the audio be?

Accepted Answer

Up to about 90 minutes per file. Longer recordings are automatically split into ~5-minute parts that are transcribed in sequence, so you'll see progress as it works. For very long recordings, split them and transcribe each piece.

Question 4

How accurate is it?

Accepted Answer

Whisper large-v3-turbo is one of the most accurate openly available speech models, and it's far better than lightweight in-browser models for Tagalog and noisy audio. Even so, expect to proofread names, numbers, and punctuation, and setting the language explicitly helps. Noisy, multi-speaker event recordings are the hardest case for any speech model.

Question 5

Why did it produce odd or made-up words?

Accepted Answer

Whisper is a speech recognizer, so music and singing are its weakest point — it can invent lyrics that were never sung. Heavy background noise, overlapping speakers, and mumbling can also cause errors. For reliable results, use recordings of people talking — interviews, voice notes, lectures, meetings — and as clear as possible.

Question 6

How much does it cost?

Accepted Answer

It's free to use, with no sign-up. The transcription runs on Cloudflare Workers AI behind the scenes; there's nothing to install and no model to download.

Audio to Text Transcriber

Turn audio into text — English or Tagalog

English and Tagalog (Filipino)

How to use it

FAQ