OpenAI
Whisper Tiny English (Quantized)
Smallest possible speech recognition model. Only 32MB. English only. Default speech model - guaranteed to run on any iPhone.
About This Model
The Whisper Tiny English (Quantized) model by OpenAI is a lightweight automatic speech recognition (ASR) model designed for efficient local deployment. With only 0.039 billion parameters, this quantized version of the Whisper architecture is optimized for minimal resource usage while maintaining reasonable accuracy for English speech recognition tasks. It is particularly well-suited for applications where computational resources are limited, such as on edge devices or low-end computers. The model's small size and low VRAM requirement (0.1–0.1 GB) make it highly efficient, allowing it to run smoothly even on hardware with very limited memory.
In its size class, the Whisper Tiny English (Quantized) model punches above its weight. While it may not match the accuracy of larger, more resource-intensive ASR models, it offers a compelling balance between performance and efficiency. This makes it an excellent choice for real-time speech-to-text applications, such as voice commands, transcription of short audio clips, or basic dictation tasks. Users who prioritize low latency and minimal power consumption will find this model particularly useful. Ideal hardware for running this model includes Raspberry Pi, low-end laptops, or any device with limited processing power and memory.
Check Your Hardware
See which quantizations of Whisper Tiny English (Quantized) your hardware can run.
Quantization Options
| Quantization | Bits | File Size | VRAM Needed | RAM Needed | Quality |
|---|---|---|---|---|---|
| Q5_1 | 5 | 0.032 GB | 0.1 GB | 0.2 GB | 65% |
Frequently Asked Questions
How much VRAM do I need to run Whisper Tiny English (Quantized)?
Whisper Tiny English (Quantized) requires 0.1GB VRAM minimum with Q5_1 quantization. For full precision, you need 0.1GB VRAM.
What is the best quantization for Whisper Tiny English (Quantized)?
Q4_K_M offers the best balance of quality and VRAM usage. Q8_0 is near-lossless if you have enough VRAM.