Rhasspy
Piper TTS - Amy (English)
Lightweight TTS voice. High quality English speech synthesis. Default TTS model - runs on any iPhone. Only 63MB.
About This Model
Piper TTS - Amy is a lightweight text-to-speech model developed by Rhasspy, designed to generate natural-sounding English speech from written text. With only 0.02 billion parameters, this model is exceptionally compact, making it highly efficient for devices with limited computational resources. Despite its small size, Piper TTS - Amy delivers surprisingly clear and smooth audio output, making it a strong contender in the low-parameter TTS category. It supports ONNX quantization, which further enhances its performance and reduces memory usage, requiring just 0.1 GB of VRAM.
While it may not match the nuanced intonations and complex speech patterns of larger, more resource-intensive models, Piper TTS - Amy punches well above its weight in terms of efficiency and usability. This makes it an excellent choice for developers and hobbyists working on embedded systems, Raspberry Pis, or any other low-power devices where local deployment is crucial. Its MIT license also ensures that it can be used freely in both personal and commercial projects, adding to its appeal for a wide range of applications.
Check Your Hardware
See which quantizations of Piper TTS - Amy (English) your hardware can run.
Quantization Options
| Quantization | Bits | File Size | VRAM Needed | RAM Needed | Quality |
|---|---|---|---|---|---|
| ONNX | 16 | 0.063 GB | 0.15 GB | 0.3 GB | 85% |
Frequently Asked Questions
How much VRAM do I need to run Piper TTS - Amy (English)?
Piper TTS - Amy (English) requires 0.15GB VRAM minimum with ONNX quantization. For full precision, you need 0.15GB VRAM.
What is the best quantization for Piper TTS - Amy (English)?
Q4_K_M offers the best balance of quality and VRAM usage. Q8_0 is near-lossless if you have enough VRAM.