Xlarge

AI21-Jamba-Large-1.5

AI21-Jamba-Large-1.5 specs, VRAM requirements, and which GPUs can run it.

Athene-70B-Preview

Athene-70B-Preview specs, VRAM requirements, and which GPUs can run it.

Athene-V2-Agent

Athene-V2-Agent specs, VRAM requirements, and which GPUs can run it.

bloomz

bloomz specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Instruct

DeepSeek-Coder-V2-Instruct specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Instruct-0724

DeepSeek-Coder-V2-Instruct-0724 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528

DeepSeek-R1-0528 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-NVFP4

DeepSeek-R1-0528-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-NVFP4-v2

DeepSeek-R1-0528-NVFP4-v2 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-NVFP4

DeepSeek-R1-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2

DeepSeek-V2 specs, VRAM requirements, and which GPUs can run it.

Deepseek-V2 Pro

Deepseek-V2 Pro specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Chat

DeepSeek-V2-Chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Chat-0628

DeepSeek-V2-Chat-0628 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2.5

DeepSeek-V2.5 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3-0324

DeepSeek-V3-0324 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3-0324-NVFP4

DeepSeek-V3-0324-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.1-NVFP4

DeepSeek-V3.1-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.2

DeepSeek-V3.2 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.2-NVFP4

DeepSeek-V3.2-NVFP4 specs, VRAM requirements, and which GPUs can run it.

gpt-oss-120b

gpt-oss-120b specs, VRAM requirements, and which GPUs can run it.

Llama 3.1 70B

Llama 3.1 70B specs, VRAM requirements, and which GPUs can run it. The sweet spot for local reasoning.

Llama-3.1-405B-Instruct

Llama-3.1-405B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-405B-Instruct-FP8

Llama-3.1-405B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-70B-Instruct

Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

llama-3.3-70b-instruct-awq

llama-3.3-70b-instruct-awq specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-70B-Instruct

Meta-Llama-3-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-70B-Instruct

Meta-Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2-AWQ

MiniMax-M2-AWQ specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2.5

MiniMax-M2.5 specs, VRAM requirements, and which GPUs can run it.

Qwen 2.5 72B

Qwen 2.5 72B specs, VRAM requirements, and which GPUs can run it. Strong on benchmarks, competitive with Llama 70B.

Qwen 2.5 72B Instruct

Qwen 2.5 72B Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen1.5-110B-Chat-AWQ

Qwen1.5-110B-Chat-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2 72B

Qwen2 72B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct

Qwen2.5-72B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct-AWQ

Qwen2.5-72B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B

Qwen3-235B-A22B specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B-Instruct-2507-FP8

Qwen3-235B-A22B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B-NVFP4

Qwen3-235B-A22B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next

Qwen3-Coder-Next specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-Base

Qwen3-Coder-Next-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-FP8

Qwen3-Coder-Next-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Next-80B-A3B-Instruct

Qwen3-Next-80B-A3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-Next-80B-A3B-Instruct-FP8

Qwen3-Next-80B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Step-3.5-Flash

Step-3.5-Flash specs, VRAM requirements, and which GPUs can run it.