Llm

AceReason-Nemotron-14B

AceReason-Nemotron-14B specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Large-1.5

AI21-Jamba-Large-1.5 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.5

AI21-Jamba-Mini-1.5 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.6

AI21-Jamba-Mini-1.6 specs, VRAM requirements, and which GPUs can run it.

Athene-70B-Preview

Athene-70B-Preview specs, VRAM requirements, and which GPUs can run it.

Athene-V2-Agent

Athene-V2-Agent specs, VRAM requirements, and which GPUs can run it.

bigscience-small-testing

bigscience-small-testing specs, VRAM requirements, and which GPUs can run it.

bitnet-b1.58-2B-4T-bf16

bitnet-b1.58-2B-4T-bf16 specs, VRAM requirements, and which GPUs can run it.

bloom-1b1

bloom-1b1 specs, VRAM requirements, and which GPUs can run it.

bloom-1b7

bloom-1b7 specs, VRAM requirements, and which GPUs can run it.

bloom-3b

bloom-3b specs, VRAM requirements, and which GPUs can run it.

bloom-560m

bloom-560m specs, VRAM requirements, and which GPUs can run it.

bloom-7b1

bloom-7b1 specs, VRAM requirements, and which GPUs can run it.

bloomz

bloomz specs, VRAM requirements, and which GPUs can run it.

bloomz-1b7

bloomz-1b7 specs, VRAM requirements, and which GPUs can run it.

bloomz-3b

bloomz-3b specs, VRAM requirements, and which GPUs can run it.

bloomz-560m

bloomz-560m specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1

bloomz-7b1 specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1-mt

bloomz-7b1-mt specs, VRAM requirements, and which GPUs can run it.

Bolmo-1B

Bolmo-1B specs, VRAM requirements, and which GPUs can run it.

codegemma-2b

codegemma-2b specs, VRAM requirements, and which GPUs can run it.

CodeLlama-13b-Instruct-hf

CodeLlama-13b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-7b-Instruct-hf

CodeLlama-7b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

deep-ignorance-unfiltered

deep-ignorance-unfiltered specs, VRAM requirements, and which GPUs can run it.

DeepSeek R1 Distill 14B

DeepSeek R1 Distill 14B specs, VRAM requirements, and which GPUs can run it. Reasoning-focused model that punches above its weight.

deepseek-coder-33b-base

deepseek-coder-33b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-33b-instruct

deepseek-coder-33b-instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-6.7b-base

deepseek-coder-6.7b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-6.7b-instruct

deepseek-coder-6.7b-instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-7b-base-v1.5

deepseek-coder-7b-base-v1.5 specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-7b-instruct-v1.5

deepseek-coder-7b-instruct-v1.5 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Instruct

DeepSeek-Coder-V2-Instruct specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Instruct-0724

DeepSeek-Coder-V2-Instruct-0724 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Lite-Base

DeepSeek-Coder-V2-Lite-Base specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-base

deepseek-moe-16b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-chat

deepseek-moe-16b-chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528

DeepSeek-R1-0528 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-NVFP4

DeepSeek-R1-0528-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-NVFP4-v2

DeepSeek-R1-0528-NVFP4-v2 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B

DeepSeek-R1-0528-Qwen3-8B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B-MLX-4bit

DeepSeek-R1-0528-Qwen3-8B-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B-MLX-8bit

DeepSeek-R1-0528-Qwen3-8B-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-14B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill-Qwen-32B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-7B

DeepSeek-R1-Distill-Qwen-7B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-NVFP4

DeepSeek-R1-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2

DeepSeek-V2 specs, VRAM requirements, and which GPUs can run it.

Deepseek-V2 Pro

Deepseek-V2 Pro specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Chat

DeepSeek-V2-Chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Chat-0628

DeepSeek-V2-Chat-0628 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite

DeepSeek-V2-Lite specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite-Chat

DeepSeek-V2-Lite-Chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2.5

DeepSeek-V2.5 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3-0324

DeepSeek-V3-0324 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3-0324-NVFP4

DeepSeek-V3-0324-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.1-NVFP4

DeepSeek-V3.1-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.2

DeepSeek-V3.2 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.2-NVFP4

DeepSeek-V3.2-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DialoGPT-small

DialoGPT-small specs, VRAM requirements, and which GPUs can run it.

distilgpt2

distilgpt2 specs, VRAM requirements, and which GPUs can run it.

dolphin-2.9.1-yi-1.5-34b

dolphin-2.9.1-yi-1.5-34b specs, VRAM requirements, and which GPUs can run it.

Dolphin-Mistral-24B-Venice-Edition

Dolphin-Mistral-24B-Venice-Edition specs, VRAM requirements, and which GPUs can run it.

ELM

ELM specs, VRAM requirements, and which GPUs can run it.

falcon-11B

falcon-11B specs, VRAM requirements, and which GPUs can run it.

falcon-7b-instruct

falcon-7b-instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-0.5B-Base

Falcon-H1-0.5B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-0.5B-Instruct

Falcon-H1-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Base

Falcon-H1-1.5B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Instruct

Falcon-H1-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Base

Falcon-H1-34B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Instruct

Falcon-H1-34B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-3B-Base

Falcon-H1-3B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-3B-Instruct

Falcon-H1-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-7B-Base

Falcon-H1-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-7B-Instruct

Falcon-H1-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-90M-Instruct

Falcon-H1-Tiny-90M-Instruct specs, VRAM requirements, and which GPUs can run it.

falcon-mamba-7b-instruct

falcon-mamba-7b-instruct specs, VRAM requirements, and which GPUs can run it.

falcon-mamba-tiny-dev

falcon-mamba-tiny-dev specs, VRAM requirements, and which GPUs can run it.

Falcon3-10B-Base

Falcon3-10B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-1B-Instruct

Falcon3-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-3B-Base

Falcon3-3B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-3B-Instruct

Falcon3-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Base

Falcon3-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Instruct

Falcon3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Flex-reddit-2x7B-1T

Flex-reddit-2x7B-1T specs, VRAM requirements, and which GPUs can run it.

gemma-1.1-2b-it

gemma-1.1-2b-it specs, VRAM requirements, and which GPUs can run it.

gemma-1.1-7b-it

gemma-1.1-7b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-27b-it

gemma-2-27b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-9b-it

gemma-2-9b-it specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-MLX-6bit

GLM-4.7-Flash-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-MLX-8bit

GLM-4.7-Flash-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

gpt-neo-1.3B

gpt-neo-1.3B specs, VRAM requirements, and which GPUs can run it.

gpt-neo-125m

gpt-neo-125m specs, VRAM requirements, and which GPUs can run it.

gpt-neo-2.7B

gpt-neo-2.7B specs, VRAM requirements, and which GPUs can run it.

gpt-oss-120b

gpt-oss-120b specs, VRAM requirements, and which GPUs can run it.

gpt-oss-120b-Eagle3-long-context

gpt-oss-120b-Eagle3-long-context specs, VRAM requirements, and which GPUs can run it.

gpt-oss-20b

gpt-oss-20b specs, VRAM requirements, and which GPUs can run it.

gpt2

gpt2 specs, VRAM requirements, and which GPUs can run it.

gpt2-large

gpt2-large specs, VRAM requirements, and which GPUs can run it.

gpt2-medium

gpt2-medium specs, VRAM requirements, and which GPUs can run it.

gpt2-mini

gpt2-mini specs, VRAM requirements, and which GPUs can run it.

h2ovl-mississippi-2b

h2ovl-mississippi-2b specs, VRAM requirements, and which GPUs can run it.

h2ovl-mississippi-800m

h2ovl-mississippi-800m specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Pro-Llama-3-8B

Hermes-2-Pro-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Pro-Mistral-7B

Hermes-2-Pro-Mistral-7B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Theta-Llama-3-8B

Hermes-2-Theta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-3-Llama-3.1-8B

Hermes-3-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-4-14B

Hermes-4-14B specs, VRAM requirements, and which GPUs can run it.

internlm2_5-7b

internlm2_5-7b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-1_8b

internlm2-chat-1_8b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-20b

internlm2-chat-20b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-7b-sft

internlm2-chat-7b-sft specs, VRAM requirements, and which GPUs can run it.

Jan-v3-4B-base-instruct

Jan-v3-4B-base-instruct specs, VRAM requirements, and which GPUs can run it.

japanese-gpt-neox-small

japanese-gpt-neox-small specs, VRAM requirements, and which GPUs can run it.

LFM2-24B-A2B

LFM2-24B-A2B specs, VRAM requirements, and which GPUs can run it.

LFM2-8B-A1B

LFM2-8B-A1B specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct

LFM2.5-1.2B-Instruct specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-4bit

LFM2.5-1.2B-Instruct-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-6bit

LFM2.5-1.2B-Instruct-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-8bit

LFM2.5-1.2B-Instruct-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

Llama 3.1 70B

Llama 3.1 70B specs, VRAM requirements, and which GPUs can run it. The sweet spot for local reasoning.

Llama 3.1 8B

Llama 3.1 8B specs, VRAM requirements, and which GPUs can run it. The go-to small model for local inference.

Llama-2-7b-hf

Llama-2-7b-hf specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1

Llama-3_3-Nemotron-Super-49B-v1 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1-FP8

Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-405B-Instruct

Llama-3.1-405B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-405B-Instruct-FP8

Llama-3.1-405B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-70B-Instruct

Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-8B-Instruct-FP8

Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Tulu-3-8B-SFT

Llama-3.1-Tulu-3-8B-SFT specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B

Llama-3.2-1B specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct-FP8

Llama-3.2-1B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct-FP8-dynamic

Llama-3.2-1B-Instruct-FP8-dynamic specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-3B

Llama-3.2-3B specs, VRAM requirements, and which GPUs can run it.

llama-3.3-70b-instruct-awq

llama-3.3-70b-instruct-awq specs, VRAM requirements, and which GPUs can run it.

llama-300M-v3-original

llama-300M-v3-original specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-8B

Llama-Guard-3-8B specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-8B-INT8

Llama-Guard-3-8B-INT8 specs, VRAM requirements, and which GPUs can run it.

LlamaGuard-7b

LlamaGuard-7b specs, VRAM requirements, and which GPUs can run it.

llm-jp-3-3.7b-instruct

llm-jp-3-3.7b-instruct specs, VRAM requirements, and which GPUs can run it.

LocoOperator-4B

LocoOperator-4B specs, VRAM requirements, and which GPUs can run it.

maira-2

maira-2 specs, VRAM requirements, and which GPUs can run it.

MediPhi-Clinical

MediPhi-Clinical specs, VRAM requirements, and which GPUs can run it.

MediPhi-Instruct

MediPhi-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-70B-Instruct

Meta-Llama-3-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-8B

Meta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-8B-Instruct

Meta-Llama-3-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-70B-Instruct

Meta-Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B

Meta-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct

Meta-Llama-3.1-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct-bnb-4bit

Meta-Llama-3.1-8B-Instruct-bnb-4bit specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct-FP8

Meta-Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-Guard-2-8B

Meta-Llama-Guard-2-8B specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2-AWQ

MiniMax-M2-AWQ specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2.5

MiniMax-M2.5 specs, VRAM requirements, and which GPUs can run it.

Mistral 7B

Mistral 7B specs, VRAM requirements, and which GPUs can run it. Efficient and fast for everyday tasks.

Mistral-7B-Instruct-v0.2

Mistral-7B-Instruct-v0.2 specs, VRAM requirements, and which GPUs can run it.

mistral-7b-v0.3-bnb-4bit

mistral-7b-v0.3-bnb-4bit specs, VRAM requirements, and which GPUs can run it.

Mistral-NeMo-Minitron-8B-Instruct

Mistral-NeMo-Minitron-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Mistral-Small-24B-Instruct-2501-AWQ

Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.

Mixtral-8x7B-Instruct-v0.1-GPTQ

Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.

Nanbeige4.1-3B

Nanbeige4.1-3B specs, VRAM requirements, and which GPUs can run it.

Nanbeige4.1-3B-heretic

Nanbeige4.1-3B-heretic specs, VRAM requirements, and which GPUs can run it.

Nemotron-Flash-3B

Nemotron-Flash-3B specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-4B-Base-8K

Nemotron-H-4B-Base-8K specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-4B-Instruct-128K

Nemotron-H-4B-Instruct-128K specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mistral-7B-DPO

Nous-Hermes-2-Mistral-7B-DPO specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mixtral-8x7B-DPO

Nous-Hermes-2-Mixtral-8x7B-DPO specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-SOLAR-10.7B

Nous-Hermes-2-SOLAR-10.7B specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-llama-2-7b

Nous-Hermes-llama-2-7b specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2

NVIDIA-Nemotron-Nano-9B-v2 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-Base

NVIDIA-Nemotron-Nano-9B-v2-Base specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-FP8

NVIDIA-Nemotron-Nano-9B-v2-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-Japanese

NVIDIA-Nemotron-Nano-9B-v2-Japanese specs, VRAM requirements, and which GPUs can run it.

OLMo-1B

OLMo-1B specs, VRAM requirements, and which GPUs can run it.

OLMo-1B-0724-hf

OLMo-1B-0724-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-1B-hf

OLMo-1B-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B

OLMo-2-0325-32B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B-Instruct

OLMo-2-0325-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B

OLMo-2-0425-1B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B-Instruct

OLMo-2-0425-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B-RLVR1

OLMo-2-0425-1B-RLVR1 specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-13B-Instruct

OLMo-2-1124-13B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-7B-Instruct

OLMo-2-1124-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1025-7B

Olmo-3-1025-7B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1125-32B

Olmo-3-1125-32B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-32B-Think

Olmo-3-32B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct

Olmo-3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct-DPO

Olmo-3-7B-Instruct-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct-SFT

Olmo-3-7B-Instruct-SFT specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think

Olmo-3-7B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think-DPO

Olmo-3-7B-Think-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think-SFT

Olmo-3-7B-Think-SFT specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-32B-Think

Olmo-3.1-32B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-7B-RL-Zero-Math

Olmo-3.1-7B-RL-Zero-Math specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-0724-hf

OLMo-7B-0724-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-hf

OLMo-7B-hf specs, VRAM requirements, and which GPUs can run it.

Olmo-Hybrid-Instruct-DPO-7B

Olmo-Hybrid-Instruct-DPO-7B specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0125

OLMoE-1B-7B-0125 specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0125-Instruct

OLMoE-1B-7B-0125-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0924-Instruct

OLMoE-1B-7B-0924-Instruct specs, VRAM requirements, and which GPUs can run it.

phi-1

phi-1 specs, VRAM requirements, and which GPUs can run it.

phi-1_5

phi-1_5 specs, VRAM requirements, and which GPUs can run it.

phi-2

phi-2 specs, VRAM requirements, and which GPUs can run it.

Phi-3-medium-4k-instruct

Phi-3-medium-4k-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-3-mini-4k-instruct-gptq-4bit

Phi-3-mini-4k-instruct-gptq-4bit specs, VRAM requirements, and which GPUs can run it.

Phi-3-small-8k-instruct

Phi-3-small-8k-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-mini-MoE-instruct

Phi-mini-MoE-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-tiny-MoE-instruct

Phi-tiny-MoE-instruct specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-1.3b

polyglot-ko-1.3b specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-12.8b

polyglot-ko-12.8b specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-5.8b

polyglot-ko-5.8b specs, VRAM requirements, and which GPUs can run it.

pythia-1.4b

pythia-1.4b specs, VRAM requirements, and which GPUs can run it.

pythia-1.4b-deduped

pythia-1.4b-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-12b

pythia-12b specs, VRAM requirements, and which GPUs can run it.

pythia-14m

pythia-14m specs, VRAM requirements, and which GPUs can run it.

pythia-14m-deduped

pythia-14m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-160m-deduped

pythia-160m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-160m-seed1

pythia-160m-seed1 specs, VRAM requirements, and which GPUs can run it.

pythia-1b

pythia-1b specs, VRAM requirements, and which GPUs can run it.

pythia-2.8b-deduped

pythia-2.8b-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-31m

pythia-31m specs, VRAM requirements, and which GPUs can run it.

pythia-31m-deduped

pythia-31m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-410m

pythia-410m specs, VRAM requirements, and which GPUs can run it.

pythia-410m-deduped

pythia-410m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-410m-v0

pythia-410m-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-6.9b

pythia-6.9b specs, VRAM requirements, and which GPUs can run it.

pythia-70m-deduped

pythia-70m-deduped specs, VRAM requirements, and which GPUs can run it.

Qwen 2.5 72B

Qwen 2.5 72B specs, VRAM requirements, and which GPUs can run it. Strong on benchmarks, competitive with Llama 70B.

Qwen 2.5 72B Instruct

Qwen 2.5 72B Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen1.5-110B-Chat-AWQ

Qwen1.5-110B-Chat-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2 72B

Qwen2 72B specs, VRAM requirements, and which GPUs can run it.

Qwen2-0.5B-Instruct

Qwen2-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2-1.5B-Instruct

Qwen2-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2-7B-Instruct

Qwen2-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-0.5B

Qwen2.5-0.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-0.5B-Instruct

Qwen2.5-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B

Qwen2.5-1.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-Instruct

Qwen2.5-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-Instruct-AWQ

Qwen2.5-1.5B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-quantized.w8a8

Qwen2.5-1.5B-quantized.w8a8 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-14B-Instruct-AWQ

Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B

Qwen2.5-32B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B-Instruct-AWQ

Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-3B

Qwen2.5-3B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-3B-Instruct

Qwen2.5-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct

Qwen2.5-72B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct-AWQ

Qwen2.5-72B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-7B

Qwen2.5-7B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-7B-Instruct

Qwen2.5-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-0.5B-Instruct

Qwen2.5-Coder-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-1.5B-Instruct

Qwen2.5-Coder-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-14B-Instruct

Qwen2.5-Coder-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct

Qwen2.5-Coder-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct-AWQ

Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct

Qwen2.5-Coder-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct-AWQ

Qwen2.5-Coder-7B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Math-1.5B

Qwen2.5-Math-1.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-VL-7B-Instruct-NVFP4

Qwen2.5-VL-7B-Instruct-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-0.6B

Qwen3-0.6B specs, VRAM requirements, and which GPUs can run it.

Qwen3-0.6B-FP8

Qwen3-0.6B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-1.7B-Base

Qwen3-1.7B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-Instruct

Qwen3-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-NVFP4

Qwen3-14B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B

Qwen3-235B-A22B specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B-Instruct-2507-FP8

Qwen3-235B-A22B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B-NVFP4

Qwen3-235B-A22B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-Instruct-2507-FP8

Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-NVFP4

Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-AWQ

Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-NVFP4

Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-AWQ

Qwen3-4B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-Instruct-2507-FP8

Qwen3-4B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-SafeRL

Qwen3-4B-SafeRL specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-AWQ

Qwen3-8B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-Base

Qwen3-8B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-FP8

Qwen3-8B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-NVFP4

Qwen3-8B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-30B-A3B-Instruct-FP8

Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next

Qwen3-Coder-Next specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-8bit

Qwen3-Coder-Next-8bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-AWQ-4bit

Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-Base

Qwen3-Coder-Next-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-FP8

Qwen3-Coder-Next-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Next-80B-A3B-Instruct

Qwen3-Next-80B-A3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-Next-80B-A3B-Instruct-FP8

Qwen3-Next-80B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-VL-30B-A3B-Instruct-AWQ

Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Text-NVFP4-MTP

Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-4B-Safety-Thinking

Qwen3.5-4B-Safety-Thinking specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-9B-abliterated

Qwen3.5-9B-abliterated specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-0.6B

Qwen3Guard-Gen-0.6B specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-4B

Qwen3Guard-Gen-4B specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-8B

Qwen3Guard-Gen-8B specs, VRAM requirements, and which GPUs can run it.

QwQ-32B-AWQ

QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

recurrentgemma-2b

recurrentgemma-2b specs, VRAM requirements, and which GPUs can run it.

saiga_llama3_8b

saiga_llama3_8b specs, VRAM requirements, and which GPUs can run it.

SmolLM-135M-Instruct

SmolLM-135M-Instruct specs, VRAM requirements, and which GPUs can run it.

SmolLM2-135M

SmolLM2-135M specs, VRAM requirements, and which GPUs can run it.

SmolLM2-135M-Instruct

SmolLM2-135M-Instruct specs, VRAM requirements, and which GPUs can run it.

SOLAR-10.7B-v1.0

SOLAR-10.7B-v1.0 specs, VRAM requirements, and which GPUs can run it.

StableBeluga-13B

StableBeluga-13B specs, VRAM requirements, and which GPUs can run it.

stablelm-2-1_6b

stablelm-2-1_6b specs, VRAM requirements, and which GPUs can run it.

stablelm-2-zephyr-1_6b

stablelm-2-zephyr-1_6b specs, VRAM requirements, and which GPUs can run it.

stablelm-3b-4e1t

stablelm-3b-4e1t specs, VRAM requirements, and which GPUs can run it.

stablelm-base-alpha-7b-v2

stablelm-base-alpha-7b-v2 specs, VRAM requirements, and which GPUs can run it.

stablelm-zephyr-3b

stablelm-zephyr-3b specs, VRAM requirements, and which GPUs can run it.

starchat-alpha

starchat-alpha specs, VRAM requirements, and which GPUs can run it.

Starling-LM-7B-beta

Starling-LM-7B-beta specs, VRAM requirements, and which GPUs can run it.

steerling-8b

steerling-8b specs, VRAM requirements, and which GPUs can run it.

Step-3.5-Flash

Step-3.5-Flash specs, VRAM requirements, and which GPUs can run it.

stories15M_MOE

stories15M_MOE specs, VRAM requirements, and which GPUs can run it.

Strand-Rust-Coder-14B-v1

Strand-Rust-Coder-14B-v1 specs, VRAM requirements, and which GPUs can run it.

tiny-aya-global

tiny-aya-global specs, VRAM requirements, and which GPUs can run it.

tiny-random-Gemma2ForCausalLM

tiny-random-Gemma2ForCausalLM specs, VRAM requirements, and which GPUs can run it.

TinyLlama-1.1B-Chat-v0.3-GPTQ

TinyLlama-1.1B-Chat-v0.3-GPTQ specs, VRAM requirements, and which GPUs can run it.

TinyLlama-1.1B-Chat-v1.0

TinyLlama-1.1B-Chat-v1.0 specs, VRAM requirements, and which GPUs can run it.

tulu-2-dpo-70b

tulu-2-dpo-70b specs, VRAM requirements, and which GPUs can run it.

txgemma-2b-predict

txgemma-2b-predict specs, VRAM requirements, and which GPUs can run it.

vaultgemma-1b

vaultgemma-1b specs, VRAM requirements, and which GPUs can run it.

wildguard

wildguard specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B

Yi-1.5-34B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-32K

Yi-1.5-34B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat

Yi-1.5-34B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat-16K

Yi-1.5-34B-Chat-16K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-6B

Yi-1.5-6B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-6B-Chat

Yi-1.5-6B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B

Yi-1.5-9B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-32K

Yi-1.5-9B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-Chat

Yi-1.5-9B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-Chat-16K

Yi-1.5-9B-Chat-16K specs, VRAM requirements, and which GPUs can run it.

Yi-6B

Yi-6B specs, VRAM requirements, and which GPUs can run it.

Yi-6B-200K

Yi-6B-200K specs, VRAM requirements, and which GPUs can run it.

Yi-6B-Chat

Yi-6B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-9B

Yi-9B specs, VRAM requirements, and which GPUs can run it.

Yi-9B-200K

Yi-9B-200K specs, VRAM requirements, and which GPUs can run it.

Yi-Coder-9B

Yi-Coder-9B specs, VRAM requirements, and which GPUs can run it.

Yi-Coder-9B-Chat

Yi-Coder-9B-Chat specs, VRAM requirements, and which GPUs can run it.

zephyr-7b-beta

zephyr-7b-beta specs, VRAM requirements, and which GPUs can run it.