AI Models

Browse 45+ AI models across chat, coding, image generation, video, audio, and more.

Featured

Coqui

XTTS v2

Multilingual TTS with voice cloning from 6-second samples. 17 languages. Very lightweight.

🔊 TTS0.46B2GB+ VRAM

Featured

OpenAI

Whisper Turbo

8x faster than Large with minimal quality loss. Best speed/accuracy trade-off.

🎤 Speech0.81B6GB+ VRAM

Featured

Stability AI

Stable Diffusion 1.5

The original Stable Diffusion. Huge ecosystem of LoRAs, ControlNets, and extensions. Best compatibility.

🎨 Image Gen0.86B4GB+ VRAM

Featured

Suno

Bark

Versatile text-to-speech. Generates speech, music, and sound effects. Supports voice cloning.

🔊 TTS1B4GB+ VRAM

Featured

Meta

Llama 3.2 1B Instruct

Ultra-compact 1B model. Runs on virtually any device including smartphones.

💬 Chat1.24B1.3GB+ VRAM

Featured

Alibaba

Wan 2.1 1.3B

Lightweight video generation. Consumer GPU friendly with CPU offloading.

🎬 Video Gen1.3B8GB+ VRAM

Featured

Meta

MusicGen Medium

AI music generation from text prompts. Creates 30-second clips of genre-specific music.

🎵 Music1.5B6GB+ VRAM

Featured

OpenAI

Whisper Large v3

Best accuracy speech-to-text. Supports 99 languages. The gold standard for transcription.

🎤 Speech1.55B10GB+ VRAM

Featured

Tsinghua/ZhipuAI

CogVideoX 2B

Accessible video generation model. Creates short video clips from text descriptions.

🎬 Video Gen2B6GB+ VRAM

Featured

Meta

Llama 3.2 3B Instruct

Meta's compact 3B model designed for edge and mobile deployment.

💬 Chat3.2B2.6GB+ VRAM

Featured

Stability AI

Stable Diffusion XL

Higher resolution (1024x1024) image generation. Better detail and prompt following than SD 1.5.

🎨 Image Gen3.5B8GB+ VRAM

Featured

Microsoft

Phi-3.5 Mini 3.8B

Tiny but capable 3.8B model. Runs on almost any hardware including phones.

💬 Chat3.8B3GB+ VRAM

Featured

Tsinghua/ZhipuAI

CogVideoX 5B

Higher quality video generation. Better motion and detail than the 2B variant.

🎬 Video Gen5B12GB+ VRAM

Featured

LLaVA Team

LLaVA 1.6 7B

Lightweight vision-language model. Analyze images on consumer hardware.

👁️ Vision7B5.5GB+ VRAM

Featured

Alibaba

Qwen2-VL 7B

Multimodal model supporting image, video, and text input. Dynamic resolution handling.

👁️ Vision7B12GB+ VRAM

Featured

Mistral AI

Mistral 7B Instruct v0.3

Efficient 7B model from Mistral AI with strong performance for its size.

💬 Chat7.3B5GB+ VRAM

Featured

Alibaba

Qwen 2.5 7B Instruct

Efficient 7B model with strong coding and reasoning abilities. Great for local deployment.

💬 Chat7.6B5.3GB+ VRAM

Featured

Alibaba

Qwen 2.5 Coder 7B

Efficient coding model with 131K context. Great local Copilot replacement.

💻 Code7.6B5.3GB+ VRAM

Featured

Meta

Llama 3.1 8B Instruct

Meta's 8B parameter instruction-tuned model. Great balance of performance and efficiency for local deployment.

💬 Chat8B5.5GB+ VRAM

Featured

DeepSeek

DeepSeek R1 Distill 8B

Compact reasoning model. Good reasoning capabilities in a small package.

💬 Chat8B5.5GB+ VRAM

Featured

Stability AI

Stable Diffusion 3.5 Large

Latest SD with MMDiT architecture. Best prompt adherence with T5-XXL encoder.

🎨 Image Gen8B6GB+ VRAM

Featured

Google

Gemma 2 9B Instruct

Google's efficient 9B model. Great performance-to-size ratio.

💬 Chat9.2B6.2GB+ VRAM

Featured

Genmo

Mochi 1

High-quality text-to-video. Produces realistic motion but requires significant VRAM.

🎬 Video Gen10B24GB+ VRAM

Featured

Black Forest Labs

Flux.1 Dev

State-of-the-art image generation. 12B parameters. Best open-source image quality available.

🎨 Image Gen12B8GB+ VRAM

Featured

Black Forest Labs

Flux.1 Schnell

Fast version of Flux. Only 4 inference steps needed. Near-instant generation on good GPUs.

🎨 Image Gen12B8GB+ VRAM

Featured

Microsoft

Phi-4

Microsoft's 14B parameter model. Punches well above its weight class on reasoning benchmarks.

💬 Chat14B9.5GB+ VRAM

Featured

DeepSeek

DeepSeek Coder V2 Lite 16B

Lightweight MoE code model. Strong coding with only 2.4B active parameters.

💻 Code16B10.5GB+ VRAM

Featured

Mistral AI

Codestral 22B

Mistral's dedicated code generation model. Supports 80+ programming languages.

💻 Code22B14GB+ VRAM

Featured

Google

Gemma 2 27B Instruct

Google's 27B instruction-tuned model. Strong general performance with efficient architecture.

💬 Chat27B17GB+ VRAM

Featured

Alibaba

Qwen 2.5 Coder 32B

Specialized coding model. One of the best open-source coding assistants available.

💻 Code32B20GB+ VRAM

Featured

DeepSeek

DeepSeek R1 Distill 32B

Reasoning-focused model distilled from DeepSeek R1. Excellent at complex problem-solving.

💬 Chat32B20GB+ VRAM

Featured

LLaVA Team

LLaVA 1.6 34B

Vision-language model. Analyze images and answer questions about them. Best quality variant.

👁️ Vision34B22GB+ VRAM

Featured

Mistral AI

Mixtral 8x7B Instruct

Mixture-of-experts model with 8 experts of 7B each. Excellent quality at moderate resource requirements.

💬 Chat46.7B28GB+ VRAM

Featured

Meta

Llama 3.1 70B Instruct

Meta's flagship 70B parameter model. Excellent performance rivaling GPT-4 on many benchmarks.

💬 Chat70B42GB+ VRAM

Featured

Meta

Llama 3.3 70B Instruct

Latest Meta 70B model with improved reasoning and multilingual capabilities.

💬 Chat70B42GB+ VRAM

Featured

Alibaba

Qwen 2.5 72B Instruct

Alibaba's top-tier 72B model. Excellent at coding, math, and multilingual tasks.

💬 Chat72B44GB+ VRAM

Featured

Cohere

Command R+ 104B

Cohere's 104B parameter model. Excellent for RAG, tool use, and enterprise applications.

💬 Chat104B62GB+ VRAM

Featured

DeepSeek

DeepSeek V3

DeepSeek's latest flagship model with 685B total parameters (37B active via MoE). State-of-the-art performance.

💬 Chat685B390GB+ VRAM

OpenAI

Whisper Tiny

Fastest Whisper model. Runs on anything including phones. 10x faster than Large.

🎤 Speech0.039B1GB+ VRAM

OpenAI

Whisper Small

Fast transcription for resource-limited devices. 4x faster than Large.

🎤 Speech0.24B2GB+ VRAM

Meta

MusicGen Small

Lightweight music generation. Quick and accessible on most hardware.

🎵 Music0.3B2GB+ VRAM

guoyww

AnimateDiff v1.5

Animate existing Stable Diffusion images. Lightweight motion module on top of SD 1.5.

🎬 Video Gen0.4B8GB+ VRAM

OpenAI

Whisper Medium

Good balance of speed and accuracy. 2x faster than Large, slight quality trade-off.

🎤 Speech0.77B5GB+ VRAM

BigCode

StarCoder2 15B

Trained on The Stack v2 with 600+ languages. Strong code completion.

💻 Code15B10GB+ VRAM

01.AI

Yi 1.5 34B Chat

Strong 34B model from 01.AI with good multilingual and reasoning performance.

💬 Chat34B21GB+ VRAM