Large
AceReason-Nemotron-14B specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba-Mini-1.5 specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba-Mini-1.6 specs, VRAM requirements, and which GPUs can run it.
CodeLlama-13b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.
DeepSeek R1 Distill 14B specs, VRAM requirements, and which GPUs can run it. Reasoning-focused model that punches above its weight.
deepseek-coder-33b-base specs, VRAM requirements, and which GPUs can run it.
deepseek-coder-33b-instruct specs, VRAM requirements, and which GPUs can run it.
DeepSeek-Coder-V2-Lite-Base specs, VRAM requirements, and which GPUs can run it.
deepseek-moe-16b-base specs, VRAM requirements, and which GPUs can run it.
deepseek-moe-16b-chat specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-Distill-Qwen-14B specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-Distill-Qwen-32B specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2-Lite specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2-Lite-Chat specs, VRAM requirements, and which GPUs can run it.
dolphin-2.9.1-yi-1.5-34b specs, VRAM requirements, and which GPUs can run it.
Dolphin-Mistral-24B-Venice-Edition specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-34B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-34B-Instruct specs, VRAM requirements, and which GPUs can run it.
gemma-2-27b-it specs, VRAM requirements, and which GPUs can run it.
gpt-oss-20b specs, VRAM requirements, and which GPUs can run it.
Hermes-4-14B specs, VRAM requirements, and which GPUs can run it.
internlm2-chat-20b specs, VRAM requirements, and which GPUs can run it.
LFM2-24B-A2B specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.
Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.
Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.
Nous-Hermes-2-Mixtral-8x7B-DPO specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0325-32B specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0325-32B-Instruct specs, VRAM requirements, and which GPUs can run it.
OLMo-2-1124-13B-Instruct specs, VRAM requirements, and which GPUs can run it.
Olmo-3-1125-32B specs, VRAM requirements, and which GPUs can run it.
Olmo-3-32B-Think specs, VRAM requirements, and which GPUs can run it.
Olmo-3.1-32B-Think specs, VRAM requirements, and which GPUs can run it.
Phi-3-medium-4k-instruct specs, VRAM requirements, and which GPUs can run it.
polyglot-ko-12.8b specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-32B specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-14B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-32B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-14B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-8bit specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.
Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.
QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.
StableBeluga-13B specs, VRAM requirements, and which GPUs can run it.
starchat-alpha specs, VRAM requirements, and which GPUs can run it.
Strand-Rust-Coder-14B-v1 specs, VRAM requirements, and which GPUs can run it.
tulu-2-dpo-70b specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-32K specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-Chat specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-Chat-16K specs, VRAM requirements, and which GPUs can run it.