Large
AceReason-Nemotron-14B specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba-Mini-1.5 specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba-Mini-1.6 specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba2-Mini specs, VRAM requirements, and which GPUs can run it.
AI21-Jamba2-Mini-FP8 specs, VRAM requirements, and which GPUs can run it.
CodeLlama-13b-hf specs, VRAM requirements, and which GPUs can run it.
CodeLlama-13b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.
CodeLlama-34b-hf specs, VRAM requirements, and which GPUs can run it.
CodeLlama-34b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.
DeepSeek R1 Distill 14B specs, VRAM requirements, and which GPUs can run it. Reasoning-focused model that punches above its weight.
deepseek-coder-33b-base specs, VRAM requirements, and which GPUs can run it.
deepseek-coder-33b-instruct specs, VRAM requirements, and which GPUs can run it.
DeepSeek-Coder-V2-Lite-Base specs, VRAM requirements, and which GPUs can run it.
DeepSeek-Coder-V2-Lite-Instruct specs, VRAM requirements, and which GPUs can run it.
deepseek-moe-16b-base specs, VRAM requirements, and which GPUs can run it.
deepseek-moe-16b-chat specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-Distill-Qwen-14B specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-Distill-Qwen-32B specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2-Lite specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V2-Lite-Chat specs, VRAM requirements, and which GPUs can run it.
DiarizationLM-13b-Fisher-v1 specs, VRAM requirements, and which GPUs can run it.
dolphin-2.9.1-yi-1.5-34b specs, VRAM requirements, and which GPUs can run it.
Dolphin-Mistral-24B-Venice-Edition specs, VRAM requirements, and which GPUs can run it.
ESFT-vanilla-lite specs, VRAM requirements, and which GPUs can run it.
falcon-40b specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-34B-Base specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-34B-Instruct specs, VRAM requirements, and which GPUs can run it.
Falcon-H1-34B-Instruct-GPTQ-Int8 specs, VRAM requirements, and which GPUs can run it.
FlexOlmo-7x7B-1T-RT specs, VRAM requirements, and which GPUs can run it.
gemma-2-27b-it specs, VRAM requirements, and which GPUs can run it.
Gemma-4-31B-IT-NVFP4 specs, VRAM requirements, and which GPUs can run it.
GLM-4.7-Flash specs, VRAM requirements, and which GPUs can run it.
GLM-4.7-Flash-FP8-Dynamic specs, VRAM requirements, and which GPUs can run it.
gpt-oss-20b specs, VRAM requirements, and which GPUs can run it.
Hermes-4-14B specs, VRAM requirements, and which GPUs can run it.
Hermes-4-14B-FP8 specs, VRAM requirements, and which GPUs can run it.
internlm2_5-20b-chat specs, VRAM requirements, and which GPUs can run it.
internlm2-chat-20b specs, VRAM requirements, and which GPUs can run it.
japanese-stablelm-base-beta-70b specs, VRAM requirements, and which GPUs can run it.
japanese-stablelm-instruct-beta-70b specs, VRAM requirements, and which GPUs can run it.
Karnak specs, VRAM requirements, and which GPUs can run it.
LFM2-24B-A2B specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.
Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.
Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.
Nemotron-3-Nano-30B-A3B specs, VRAM requirements, and which GPUs can run it.
Nemotron-Cascade-2-30B-A3B specs, VRAM requirements, and which GPUs can run it.
NextCoder-14B specs, VRAM requirements, and which GPUs can run it.
NextCoder-32B specs, VRAM requirements, and which GPUs can run it.
Nous-Hermes-2-Mixtral-8x7B-DPO specs, VRAM requirements, and which GPUs can run it.
Nous-Hermes-Llama2-13b specs, VRAM requirements, and which GPUs can run it.
NousCoder-14B specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0325-32B specs, VRAM requirements, and which GPUs can run it.
OLMo-2-0325-32B-Instruct specs, VRAM requirements, and which GPUs can run it.
OLMo-2-1124-13B-DPO specs, VRAM requirements, and which GPUs can run it.
OLMo-2-1124-13B-Instruct specs, VRAM requirements, and which GPUs can run it.
Olmo-3-1125-32B specs, VRAM requirements, and which GPUs can run it.
Olmo-3-32B-Think specs, VRAM requirements, and which GPUs can run it.
Olmo-3-32B-Think-DPO specs, VRAM requirements, and which GPUs can run it.
Olmo-3.1-32B-Think specs, VRAM requirements, and which GPUs can run it.
OpenReasoning-Nemotron-32B specs, VRAM requirements, and which GPUs can run it.
OptiMind-SFT specs, VRAM requirements, and which GPUs can run it.
Phi-3-medium-4k-instruct specs, VRAM requirements, and which GPUs can run it.
polyglot-ko-12.8b specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-32B specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-32B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-14B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-32B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-14B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-14B-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-14B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-Thinking-2507 specs, VRAM requirements, and which GPUs can run it.
Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-30B-A3B-Instruct specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-8bit specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.
Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2 specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.
QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.
sarvam-105b-uncensored specs, VRAM requirements, and which GPUs can run it.
shieldgemma-27b specs, VRAM requirements, and which GPUs can run it.
StableBeluga-13B specs, VRAM requirements, and which GPUs can run it.
StableBeluga1-Delta specs, VRAM requirements, and which GPUs can run it.
starchat-alpha specs, VRAM requirements, and which GPUs can run it.
starchat-beta specs, VRAM requirements, and which GPUs can run it.
Strand-Rust-Coder-14B-v1 specs, VRAM requirements, and which GPUs can run it.
tulu-2-dpo-70b specs, VRAM requirements, and which GPUs can run it.
txgemma-27b-chat specs, VRAM requirements, and which GPUs can run it.
txgemma-27b-predict specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-32K specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-Chat specs, VRAM requirements, and which GPUs can run it.
Yi-1.5-34B-Chat-16K specs, VRAM requirements, and which GPUs can run it.
Yi-34B specs, VRAM requirements, and which GPUs can run it.
Yi-34B-Chat specs, VRAM requirements, and which GPUs can run it.
Yi-34B-Chat-8bits specs, VRAM requirements, and which GPUs can run it.