Whisper tiny.en is only 75 MB and accurate for transcription itself. We default to small.en and also offer Whisper Small (multilingual) and others so people can pick what's best for their situation.
I tagged Qwen 2.5 here but we actually used Qwen 3.5 family series. Found them to be the best mix of small, fast, and accurate for cleaning up "ums" etc.