Large

AceReason-Nemotron-14B

AceReason-Nemotron-14B specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.5

AI21-Jamba-Mini-1.5 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.6

AI21-Jamba-Mini-1.6 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba2-Mini

AI21-Jamba2-Mini specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba2-Mini-FP8

AI21-Jamba2-Mini-FP8 specs, VRAM requirements, and which GPUs can run it.

CodeLlama-13b-hf

CodeLlama-13b-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-13b-Instruct-hf

CodeLlama-13b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-34b-hf

CodeLlama-34b-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-34b-Instruct-hf

CodeLlama-34b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

DeepSeek R1 Distill 14B

DeepSeek R1 Distill 14B specs, VRAM requirements, and which GPUs can run it. Reasoning-focused model that punches above its weight.

deepseek-coder-33b-base

deepseek-coder-33b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-33b-instruct

deepseek-coder-33b-instruct specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Lite-Base

DeepSeek-Coder-V2-Lite-Base specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Lite-Instruct

DeepSeek-Coder-V2-Lite-Instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-base

deepseek-moe-16b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-chat

deepseek-moe-16b-chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-14B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill-Qwen-32B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite

DeepSeek-V2-Lite specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite-Chat

DeepSeek-V2-Lite-Chat specs, VRAM requirements, and which GPUs can run it.

DiarizationLM-13b-Fisher-v1

DiarizationLM-13b-Fisher-v1 specs, VRAM requirements, and which GPUs can run it.

dolphin-2.9.1-yi-1.5-34b

dolphin-2.9.1-yi-1.5-34b specs, VRAM requirements, and which GPUs can run it.

Dolphin-Mistral-24B-Venice-Edition

Dolphin-Mistral-24B-Venice-Edition specs, VRAM requirements, and which GPUs can run it.

ESFT-vanilla-lite

ESFT-vanilla-lite specs, VRAM requirements, and which GPUs can run it.

falcon-40b

falcon-40b specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Base

Falcon-H1-34B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Instruct

Falcon-H1-34B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Instruct-GPTQ-Int8

Falcon-H1-34B-Instruct-GPTQ-Int8 specs, VRAM requirements, and which GPUs can run it.

FlexOlmo-7x7B-1T-RT

FlexOlmo-7x7B-1T-RT specs, VRAM requirements, and which GPUs can run it.

gemma-2-27b-it

gemma-2-27b-it specs, VRAM requirements, and which GPUs can run it.

Gemma-4-31B-IT-NVFP4

Gemma-4-31B-IT-NVFP4 specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash

GLM-4.7-Flash specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-FP8-Dynamic

GLM-4.7-Flash-FP8-Dynamic specs, VRAM requirements, and which GPUs can run it.

gpt-oss-20b

gpt-oss-20b specs, VRAM requirements, and which GPUs can run it.

Hermes-4-14B

Hermes-4-14B specs, VRAM requirements, and which GPUs can run it.

Hermes-4-14B-FP8

Hermes-4-14B-FP8 specs, VRAM requirements, and which GPUs can run it.

internlm2_5-20b-chat

internlm2_5-20b-chat specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-20b

internlm2-chat-20b specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-base-beta-70b

japanese-stablelm-base-beta-70b specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-instruct-beta-70b

japanese-stablelm-instruct-beta-70b specs, VRAM requirements, and which GPUs can run it.

Karnak

Karnak specs, VRAM requirements, and which GPUs can run it.

LFM2-24B-A2B

LFM2-24B-A2B specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1

Llama-3_3-Nemotron-Super-49B-v1 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1-FP8

Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.

Mistral-Small-24B-Instruct-2501-AWQ

Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.

Mixtral-8x7B-Instruct-v0.1-GPTQ

Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.

Nemotron-3-Nano-30B-A3B

Nemotron-3-Nano-30B-A3B specs, VRAM requirements, and which GPUs can run it.

Nemotron-Cascade-2-30B-A3B

Nemotron-Cascade-2-30B-A3B specs, VRAM requirements, and which GPUs can run it.

NextCoder-14B

NextCoder-14B specs, VRAM requirements, and which GPUs can run it.

NextCoder-32B

NextCoder-32B specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mixtral-8x7B-DPO

Nous-Hermes-2-Mixtral-8x7B-DPO specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-Llama2-13b

Nous-Hermes-Llama2-13b specs, VRAM requirements, and which GPUs can run it.

NousCoder-14B

NousCoder-14B specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B

OLMo-2-0325-32B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B-Instruct

OLMo-2-0325-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-13B-DPO

OLMo-2-1124-13B-DPO specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-13B-Instruct

OLMo-2-1124-13B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1125-32B

Olmo-3-1125-32B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-32B-Think

Olmo-3-32B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3-32B-Think-DPO

Olmo-3-32B-Think-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-32B-Think

Olmo-3.1-32B-Think specs, VRAM requirements, and which GPUs can run it.

OpenReasoning-Nemotron-32B

OpenReasoning-Nemotron-32B specs, VRAM requirements, and which GPUs can run it.

OptiMind-SFT

OptiMind-SFT specs, VRAM requirements, and which GPUs can run it.

Phi-3-medium-4k-instruct

Phi-3-medium-4k-instruct specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-12.8b

polyglot-ko-12.8b specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-14B-Instruct-AWQ

Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B

Qwen2.5-32B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B-Instruct-AWQ

Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B-Instruct-GPTQ-Int4

Qwen2.5-32B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-14B-Instruct

Qwen2.5-Coder-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct

Qwen2.5-Coder-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct-AWQ

Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-AWQ

Qwen3-14B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-FP8

Qwen3-14B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-Instruct

Qwen3-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-Instruct-2507-FP8

Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-NVFP4

Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-Thinking-2507

Qwen3-30B-A3B-Thinking-2507 specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-AWQ

Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-NVFP4

Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-30B-A3B-Instruct

Qwen3-Coder-30B-A3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-30B-A3B-Instruct-FP8

Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-8bit

Qwen3-Coder-Next-8bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-AWQ-4bit

Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-VL-30B-A3B-Instruct-AWQ

Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2 specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Text-NVFP4-MTP

Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.

QwQ-32B-AWQ

QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

sarvam-105b-uncensored

sarvam-105b-uncensored specs, VRAM requirements, and which GPUs can run it.

shieldgemma-27b

shieldgemma-27b specs, VRAM requirements, and which GPUs can run it.

StableBeluga-13B

StableBeluga-13B specs, VRAM requirements, and which GPUs can run it.

StableBeluga1-Delta

StableBeluga1-Delta specs, VRAM requirements, and which GPUs can run it.

starchat-alpha

starchat-alpha specs, VRAM requirements, and which GPUs can run it.

starchat-beta

starchat-beta specs, VRAM requirements, and which GPUs can run it.

Strand-Rust-Coder-14B-v1

Strand-Rust-Coder-14B-v1 specs, VRAM requirements, and which GPUs can run it.

tulu-2-dpo-70b

tulu-2-dpo-70b specs, VRAM requirements, and which GPUs can run it.

txgemma-27b-chat

txgemma-27b-chat specs, VRAM requirements, and which GPUs can run it.

txgemma-27b-predict

txgemma-27b-predict specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B

Yi-1.5-34B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-32K

Yi-1.5-34B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat

Yi-1.5-34B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat-16K

Yi-1.5-34B-Chat-16K specs, VRAM requirements, and which GPUs can run it.

Yi-34B

Yi-34B specs, VRAM requirements, and which GPUs can run it.

Yi-34B-Chat

Yi-34B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-34B-Chat-8bits

Yi-34B-Chat-8bits specs, VRAM requirements, and which GPUs can run it.