About the Audio Transcriber

Audio Transcriber is a free online tool that converts spoken audio into text, powered by OpenAI's Whisper model running locally on our server — no API key, no external services, no charge.

How It Works

Your audio file is uploaded to our server where the OpenAI Whisper "base" model transcribes it. Whisper is a general-purpose automatic speech recognition (ASR) model trained on 680,000 hours of multilingual speech data. The base model balances accuracy and speed: it supports 99 languages and is reasonably accurate for clear recordings with minimal background noise.

You can let the tool detect the language automatically, or specify one of the supported languages to improve accuracy when the language is known. The result includes the transcribed text, the detected language, and the audio duration.

What Is sortout.app?

sortout.app is a growing collection of focused web tools, each designed to do exactly one thing well. Every tool is free, requires no account, and is built to load fast anywhere.

Privacy

Your files are sent to our server over HTTPS, processed immediately, and deleted — never stored or shared. See our full Privacy Policy.

Frequently Asked Questions

Is this tool free?

Yes, completely free with no usage limits.

Do I need to create an account?

No. There is no sign-up, no login, and no registration of any kind.

What audio formats are supported?

MP3, WAV, OGG, M4A, and FLAC. Maximum file size is 50 MB.

How accurate is the transcription?

Whisper base is very accurate for clear recordings in English. Accuracy decreases with heavy accents, background noise, overlapping speakers, or low-quality recordings. For best results, use a quiet environment and a clear microphone.

Is my audio stored on your servers?

No. Audio files are deleted immediately after transcription is complete. See our Privacy Policy for full details.

Contact

Questions or feedback? Open an issue at github.com/sortout-app/feedback or email hello@sortout.app.