AI Models
Browse 45+ AI models across chat, coding, image generation, video, audio, and more.
Coqui
XTTS v2
Multilingual TTS with voice cloning from 6-second samples. 17 languages. Very lightweight.
OpenAI
Whisper Turbo
8x faster than Large with minimal quality loss. Best speed/accuracy trade-off.
Stability AI
Stable Diffusion 1.5
The original Stable Diffusion. Huge ecosystem of LoRAs, ControlNets, and extensions. Best compatibility.
Suno
Bark
Versatile text-to-speech. Generates speech, music, and sound effects. Supports voice cloning.
Meta
Llama 3.2 1B Instruct
Ultra-compact 1B model. Runs on virtually any device including smartphones.
Alibaba
Wan 2.1 1.3B
Lightweight video generation. Consumer GPU friendly with CPU offloading.
Meta
MusicGen Medium
AI music generation from text prompts. Creates 30-second clips of genre-specific music.
OpenAI
Whisper Large v3
Best accuracy speech-to-text. Supports 99 languages. The gold standard for transcription.
Tsinghua/ZhipuAI
CogVideoX 2B
Accessible video generation model. Creates short video clips from text descriptions.
Meta
Llama 3.2 3B Instruct
Meta's compact 3B model designed for edge and mobile deployment.
Stability AI
Stable Diffusion XL
Higher resolution (1024x1024) image generation. Better detail and prompt following than SD 1.5.
Microsoft
Phi-3.5 Mini 3.8B
Tiny but capable 3.8B model. Runs on almost any hardware including phones.
Tsinghua/ZhipuAI
CogVideoX 5B
Higher quality video generation. Better motion and detail than the 2B variant.
LLaVA Team
LLaVA 1.6 7B
Lightweight vision-language model. Analyze images on consumer hardware.
Alibaba
Qwen2-VL 7B
Multimodal model supporting image, video, and text input. Dynamic resolution handling.
Mistral AI
Mistral 7B Instruct v0.3
Efficient 7B model from Mistral AI with strong performance for its size.
Alibaba
Qwen 2.5 7B Instruct
Efficient 7B model with strong coding and reasoning abilities. Great for local deployment.
Alibaba
Qwen 2.5 Coder 7B
Efficient coding model with 131K context. Great local Copilot replacement.
Meta
Llama 3.1 8B Instruct
Meta's 8B parameter instruction-tuned model. Great balance of performance and efficiency for local deployment.
DeepSeek
DeepSeek R1 Distill 8B
Compact reasoning model. Good reasoning capabilities in a small package.
Stability AI
Stable Diffusion 3.5 Large
Latest SD with MMDiT architecture. Best prompt adherence with T5-XXL encoder.
Gemma 2 9B Instruct
Google's efficient 9B model. Great performance-to-size ratio.
Genmo
Mochi 1
High-quality text-to-video. Produces realistic motion but requires significant VRAM.
Black Forest Labs
Flux.1 Dev
State-of-the-art image generation. 12B parameters. Best open-source image quality available.
Black Forest Labs
Flux.1 Schnell
Fast version of Flux. Only 4 inference steps needed. Near-instant generation on good GPUs.
Microsoft
Phi-4
Microsoft's 14B parameter model. Punches well above its weight class on reasoning benchmarks.
DeepSeek
DeepSeek Coder V2 Lite 16B
Lightweight MoE code model. Strong coding with only 2.4B active parameters.
Mistral AI
Codestral 22B
Mistral's dedicated code generation model. Supports 80+ programming languages.
Gemma 2 27B Instruct
Google's 27B instruction-tuned model. Strong general performance with efficient architecture.
Alibaba
Qwen 2.5 Coder 32B
Specialized coding model. One of the best open-source coding assistants available.
DeepSeek
DeepSeek R1 Distill 32B
Reasoning-focused model distilled from DeepSeek R1. Excellent at complex problem-solving.
LLaVA Team
LLaVA 1.6 34B
Vision-language model. Analyze images and answer questions about them. Best quality variant.
Mistral AI
Mixtral 8x7B Instruct
Mixture-of-experts model with 8 experts of 7B each. Excellent quality at moderate resource requirements.
Meta
Llama 3.1 70B Instruct
Meta's flagship 70B parameter model. Excellent performance rivaling GPT-4 on many benchmarks.
Meta
Llama 3.3 70B Instruct
Latest Meta 70B model with improved reasoning and multilingual capabilities.
Alibaba
Qwen 2.5 72B Instruct
Alibaba's top-tier 72B model. Excellent at coding, math, and multilingual tasks.
Cohere
Command R+ 104B
Cohere's 104B parameter model. Excellent for RAG, tool use, and enterprise applications.
DeepSeek
DeepSeek V3
DeepSeek's latest flagship model with 685B total parameters (37B active via MoE). State-of-the-art performance.
OpenAI
Whisper Tiny
Fastest Whisper model. Runs on anything including phones. 10x faster than Large.
OpenAI
Whisper Small
Fast transcription for resource-limited devices. 4x faster than Large.
Meta
MusicGen Small
Lightweight music generation. Quick and accessible on most hardware.
guoyww
AnimateDiff v1.5
Animate existing Stable Diffusion images. Lightweight motion module on top of SD 1.5.
OpenAI
Whisper Medium
Good balance of speed and accuracy. 2x faster than Large, slight quality trade-off.
BigCode
StarCoder2 15B
Trained on The Stack v2 with 600+ languages. Strong code completion.
01.AI
Yi 1.5 34B Chat
Strong 34B model from 01.AI with good multilingual and reasoning performance.