Quantized
DeepSeek-R1-0528-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-0528-NVFP4-v2 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-R1-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V3-0324-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V3.1-NVFP4 specs, VRAM requirements, and which GPUs can run it.
DeepSeek-V3.2-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3.1-405B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3.2-1B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Llama-3.2-1B-Instruct-FP8-dynamic specs, VRAM requirements, and which GPUs can run it.
llama-3.3-70b-instruct-awq specs, VRAM requirements, and which GPUs can run it.
Llama-Guard-3-8B-INT8 specs, VRAM requirements, and which GPUs can run it.
Meta-Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
MiniMax-M2-AWQ specs, VRAM requirements, and which GPUs can run it.
Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.
Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
NVIDIA-Nemotron-Nano-9B-v2-FP8 specs, VRAM requirements, and which GPUs can run it.
Phi-3-mini-4k-instruct-gptq-4bit specs, VRAM requirements, and which GPUs can run it.
Qwen1.5-110B-Chat-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-1.5B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-72B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-7B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-Coder-7B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.
Qwen2.5-VL-7B-Instruct-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-0.6B-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-14B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-235B-A22B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-235B-A22B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-4B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-4B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-8B-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3-8B-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-8B-NVFP4 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.
Qwen3-Coder-Next-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-Next-80B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.
Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.
Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.
QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.
TinyLlama-1.1B-Chat-v0.3-GPTQ specs, VRAM requirements, and which GPUs can run it.