How it works
VidPickr pulls the audio-only stream of the video you paste, hands it to our speech-recognition pipeline, and produces a clean transcript with timestamps. First run takes a moment to warm up; the same machine then starts transcribing instantly on subsequent videos.
Why use this over other transcription tools?
Most free transcription sites paywall their best model, cap free use to a few minutes per day, or hand back lossy auto-captions disguised as new transcripts. VidPickr keeps the whole pipeline free — every video, every length, every supported language, with real word-level timestamps in the SRT and VTT output.
Speed expectations
- Apple Silicon (M1/M2/M3/M4): ~3-5× realtime on Whisper Base. A 30-minute video transcribes in 6-10 minutes.
- Recent Intel + dedicated GPU: similar, with WebGPU. Without WebGPU, ~1-2× realtime.
- Older laptops / mobile: works but slower; consider Whisper Tiny (39 MB) for short videos under 5 min.
What about subtitles that already exist?
If the video creator uploaded subtitles or YouTube generated auto-captions for it, you don’t need this tool — just use the subtitle downloader which exports those directly. Use Transcribe when there are no subtitles, when the auto-captions are wrong, or when you want a different language via the “Translate to English” mode.