Text-Generation

AceReason-Nemotron-14B

AceReason-Nemotron-14B specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Large-1.5

AI21-Jamba-Large-1.5 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.5

AI21-Jamba-Mini-1.5 specs, VRAM requirements, and which GPUs can run it.

AI21-Jamba-Mini-1.6

AI21-Jamba-Mini-1.6 specs, VRAM requirements, and which GPUs can run it.

Athene-70B-Preview

Athene-70B-Preview specs, VRAM requirements, and which GPUs can run it.

Athene-V2-Agent

Athene-V2-Agent specs, VRAM requirements, and which GPUs can run it.

bigscience-small-testing

bigscience-small-testing specs, VRAM requirements, and which GPUs can run it.

bitnet-b1.58-2B-4T

bitnet-b1.58-2B-4T specs, VRAM requirements, and which GPUs can run it.

bitnet-b1.58-2B-4T-bf16

bitnet-b1.58-2B-4T-bf16 specs, VRAM requirements, and which GPUs can run it.

bloom-1b1

bloom-1b1 specs, VRAM requirements, and which GPUs can run it.

bloom-1b7

bloom-1b7 specs, VRAM requirements, and which GPUs can run it.

bloom-3b

bloom-3b specs, VRAM requirements, and which GPUs can run it.

bloom-560m

bloom-560m specs, VRAM requirements, and which GPUs can run it.

bloom-7b1

bloom-7b1 specs, VRAM requirements, and which GPUs can run it.

bloom-7b1-petals

bloom-7b1-petals specs, VRAM requirements, and which GPUs can run it.

bloomz

bloomz specs, VRAM requirements, and which GPUs can run it.

bloomz-1b7

bloomz-1b7 specs, VRAM requirements, and which GPUs can run it.

bloomz-3b

bloomz-3b specs, VRAM requirements, and which GPUs can run it.

bloomz-560m

bloomz-560m specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1

bloomz-7b1 specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1-mt

bloomz-7b1-mt specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1-p3

bloomz-7b1-p3 specs, VRAM requirements, and which GPUs can run it.

bloomz-mt

bloomz-mt specs, VRAM requirements, and which GPUs can run it.

Bolmo-1B

Bolmo-1B specs, VRAM requirements, and which GPUs can run it.

Bonsai-8B-mlx-1bit

Bonsai-8B-mlx-1bit specs, VRAM requirements, and which GPUs can run it.

codegemma-2b

codegemma-2b specs, VRAM requirements, and which GPUs can run it.

CodeLlama-13b-hf

CodeLlama-13b-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-13b-Instruct-hf

CodeLlama-13b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-34b-hf

CodeLlama-34b-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-7b-Instruct-hf

CodeLlama-7b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

CodeLlama-7b-Python-hf

CodeLlama-7b-Python-hf specs, VRAM requirements, and which GPUs can run it.

convergent-llama-300M-muon-isolate-1

convergent-llama-300M-muon-isolate-1 specs, VRAM requirements, and which GPUs can run it.

convergent-llama-300M-muon-window-2

convergent-llama-300M-muon-window-2 specs, VRAM requirements, and which GPUs can run it.

convergent-llama-300M-muon-window-4

convergent-llama-300M-muon-window-4 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170m-GR

Dayhoff-170m-GR specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GR-1000

Dayhoff-170M-GR-1000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GR-16000

Dayhoff-170M-GR-16000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GR-31000

Dayhoff-170M-GR-31000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GR-46000

Dayhoff-170M-GR-46000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GR-61000

Dayhoff-170M-GR-61000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GRS-112000

Dayhoff-170M-GRS-112000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GRS-2000

Dayhoff-170M-GRS-2000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GRS-26000

Dayhoff-170M-GRS-26000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GRS-50000

Dayhoff-170M-GRS-50000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-GRS-76000

Dayhoff-170M-GRS-76000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170m-UR90

Dayhoff-170m-UR90 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-UR90-1000

Dayhoff-170M-UR90-1000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-UR90-46000

Dayhoff-170M-UR90-46000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-170M-UR90-61000

Dayhoff-170M-UR90-61000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-3b-GR-HM-1000

Dayhoff-3b-GR-HM-1000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-3b-GR-HM-11000

Dayhoff-3b-GR-HM-11000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-3b-GR-HM-21000

Dayhoff-3b-GR-HM-21000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-3b-GR-HM-31000

Dayhoff-3b-GR-HM-31000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-3b-GR-HM-41000

Dayhoff-3b-GR-HM-41000 specs, VRAM requirements, and which GPUs can run it.

Dayhoff-3b-GR-HM-c

Dayhoff-3b-GR-HM-c specs, VRAM requirements, and which GPUs can run it.

deep-ignorance-pretraining-stage-unfiltered

deep-ignorance-pretraining-stage-unfiltered specs, VRAM requirements, and which GPUs can run it.

deep-ignorance-unfiltered

deep-ignorance-unfiltered specs, VRAM requirements, and which GPUs can run it.

DeepHermes-3-Llama-3-8B-Preview

DeepHermes-3-Llama-3-8B-Preview specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-33b-base

deepseek-coder-33b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-33b-instruct

deepseek-coder-33b-instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-6.7b-base

deepseek-coder-6.7b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-6.7b-instruct

deepseek-coder-6.7b-instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-7b-base-v1.5

deepseek-coder-7b-base-v1.5 specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-7b-instruct-v1.5

deepseek-coder-7b-instruct-v1.5 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Instruct

DeepSeek-Coder-V2-Instruct specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Instruct-0724

DeepSeek-Coder-V2-Instruct-0724 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Lite-Base

DeepSeek-Coder-V2-Lite-Base specs, VRAM requirements, and which GPUs can run it.

DeepSeek-Coder-V2-Lite-Instruct

DeepSeek-Coder-V2-Lite-Instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-math-7b-rl

deepseek-math-7b-rl specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-base

deepseek-moe-16b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-moe-16b-chat

deepseek-moe-16b-chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528

DeepSeek-R1-0528 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-NVFP4

DeepSeek-R1-0528-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-NVFP4-v2

DeepSeek-R1-0528-NVFP4-v2 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B

DeepSeek-R1-0528-Qwen3-8B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B-MLX-4bit

DeepSeek-R1-0528-Qwen3-8B-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B-MLX-8bit

DeepSeek-R1-0528-Qwen3-8B-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-1.5B

DeepSeek-R1-Distill-Qwen-1.5B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-14B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill-Qwen-32B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-7B

DeepSeek-R1-Distill-Qwen-7B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-NVFP4

DeepSeek-R1-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2

DeepSeek-V2 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Chat

DeepSeek-V2-Chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Chat-0628

DeepSeek-V2-Chat-0628 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite

DeepSeek-V2-Lite specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2-Lite-Chat

DeepSeek-V2-Lite-Chat specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V2.5

DeepSeek-V2.5 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3-0324

DeepSeek-V3-0324 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3-0324-NVFP4

DeepSeek-V3-0324-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.1-NVFP4

DeepSeek-V3.1-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.2-Exp-Base

DeepSeek-V3.2-Exp-Base specs, VRAM requirements, and which GPUs can run it.

DeepSeek-V3.2-NVFP4

DeepSeek-V3.2-NVFP4 specs, VRAM requirements, and which GPUs can run it.

DialoGPT-small

DialoGPT-small specs, VRAM requirements, and which GPUs can run it.

DiarizationLM-13b-Fisher-v1

DiarizationLM-13b-Fisher-v1 specs, VRAM requirements, and which GPUs can run it.

distilgpt2

distilgpt2 specs, VRAM requirements, and which GPUs can run it.

dolphin-2.9.1-yi-1.5-34b

dolphin-2.9.1-yi-1.5-34b specs, VRAM requirements, and which GPUs can run it.

Dolphin-Mistral-24B-Venice-Edition

Dolphin-Mistral-24B-Venice-Edition specs, VRAM requirements, and which GPUs can run it.

ELM

ELM specs, VRAM requirements, and which GPUs can run it.

ESFT-vanilla-lite

ESFT-vanilla-lite specs, VRAM requirements, and which GPUs can run it.

falcon-11B

falcon-11B specs, VRAM requirements, and which GPUs can run it.

falcon-40b

falcon-40b specs, VRAM requirements, and which GPUs can run it.

falcon-7b

falcon-7b specs, VRAM requirements, and which GPUs can run it.

falcon-7b-instruct

falcon-7b-instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-E-1B-Base

Falcon-E-1B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-E-1B-Instruct

Falcon-E-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-E-3B-Instruct

Falcon-E-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-0.5B-Base

Falcon-H1-0.5B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-0.5B-Instruct

Falcon-H1-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Base

Falcon-H1-1.5B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Deep-Base

Falcon-H1-1.5B-Deep-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-1.5B-Instruct

Falcon-H1-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Base

Falcon-H1-34B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Instruct

Falcon-H1-34B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-34B-Instruct-GPTQ-Int8

Falcon-H1-34B-Instruct-GPTQ-Int8 specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-3B-Base

Falcon-H1-3B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-3B-Instruct

Falcon-H1-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-7B-Base

Falcon-H1-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-7B-Instruct

Falcon-H1-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-90M-Base

Falcon-H1-Tiny-90M-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-90M-Instruct

Falcon-H1-Tiny-90M-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-90M-Instruct-pre-DPO

Falcon-H1-Tiny-90M-Instruct-pre-DPO specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-Multilingual-100M-Base

Falcon-H1-Tiny-Multilingual-100M-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-Multilingual-100M-Instruct

Falcon-H1-Tiny-Multilingual-100M-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-R-0.6B

Falcon-H1-Tiny-R-0.6B specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-Tiny-R-90M

Falcon-H1-Tiny-R-90M specs, VRAM requirements, and which GPUs can run it.

Falcon-H1R-7B-FP8

Falcon-H1R-7B-FP8 specs, VRAM requirements, and which GPUs can run it.

falcon-mamba-7b-instruct

falcon-mamba-7b-instruct specs, VRAM requirements, and which GPUs can run it.

falcon-mamba-tiny-dev

falcon-mamba-tiny-dev specs, VRAM requirements, and which GPUs can run it.

falcon-rw-7b

falcon-rw-7b specs, VRAM requirements, and which GPUs can run it.

Falcon3-10B-Base

Falcon3-10B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-10B-Instruct-1.58bit

Falcon3-10B-Instruct-1.58bit specs, VRAM requirements, and which GPUs can run it.

Falcon3-1B-Instruct

Falcon3-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-1B-Instruct-1.58bit

Falcon3-1B-Instruct-1.58bit specs, VRAM requirements, and which GPUs can run it.

Falcon3-3B-Base

Falcon3-3B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-3B-Instruct

Falcon3-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Base

Falcon3-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Instruct

Falcon3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Instruct-1.58bit

Falcon3-7B-Instruct-1.58bit specs, VRAM requirements, and which GPUs can run it.

Falcon3-Mamba-7B-Base

Falcon3-Mamba-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-Mamba-7B-Instruct

Falcon3-Mamba-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Faust-1

Faust-1 specs, VRAM requirements, and which GPUs can run it.

Flex-reddit-2x7B-1T

Flex-reddit-2x7B-1T specs, VRAM requirements, and which GPUs can run it.

FlexOlmo-7x7B-1T-RT

FlexOlmo-7x7B-1T-RT specs, VRAM requirements, and which GPUs can run it.

gemma-1.1-2b-it

gemma-1.1-2b-it specs, VRAM requirements, and which GPUs can run it.

gemma-1.1-7b-it

gemma-1.1-7b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-27b-it

gemma-2-27b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-2b-it

gemma-2-2b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-2b-jpn-it

gemma-2-2b-jpn-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-9b-it

gemma-2-9b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2b

gemma-2b specs, VRAM requirements, and which GPUs can run it.

gemma-2b-AWQ

gemma-2b-AWQ specs, VRAM requirements, and which GPUs can run it.

gemma-3-1b-it

gemma-3-1b-it specs, VRAM requirements, and which GPUs can run it.

gemma-3-1b-it-qat-int4-unquantized

gemma-3-1b-it-qat-int4-unquantized specs, VRAM requirements, and which GPUs can run it.

gemma-3-1b-it-qat-q4_0-unquantized

gemma-3-1b-it-qat-q4_0-unquantized specs, VRAM requirements, and which GPUs can run it.

gemma-3-270m

gemma-3-270m specs, VRAM requirements, and which GPUs can run it.

gemma-3-270m-it-qat-q4_0-unquantized

gemma-3-270m-it-qat-q4_0-unquantized specs, VRAM requirements, and which GPUs can run it.

gemma-3-270m-qat-q4_0-unquantized

gemma-3-270m-qat-q4_0-unquantized specs, VRAM requirements, and which GPUs can run it.

Gemma-4-31B-IT-NVFP4

Gemma-4-31B-IT-NVFP4 specs, VRAM requirements, and which GPUs can run it.

gemma-4-E4B-it-OBLITERATED

gemma-4-E4B-it-OBLITERATED specs, VRAM requirements, and which GPUs can run it.

GLM-4.5-Air

GLM-4.5-Air specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash

GLM-4.7-Flash specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-FP8-Dynamic

GLM-4.7-Flash-FP8-Dynamic specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-MLX-6bit

GLM-4.7-Flash-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-MLX-8bit

GLM-4.7-Flash-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

GLM-5-FP8

GLM-5-FP8 specs, VRAM requirements, and which GPUs can run it.

GLM-5-NVFP4

GLM-5-NVFP4 specs, VRAM requirements, and which GPUs can run it.

GLM-5.1-FP8

GLM-5.1-FP8 specs, VRAM requirements, and which GPUs can run it.

GLM-5.1-MLX-4.8bit

GLM-5.1-MLX-4.8bit specs, VRAM requirements, and which GPUs can run it.

gpt-neo-1.3B

gpt-neo-1.3B specs, VRAM requirements, and which GPUs can run it.

gpt-neo-125m

gpt-neo-125m specs, VRAM requirements, and which GPUs can run it.

gpt-neo-2.7B

gpt-neo-2.7B specs, VRAM requirements, and which GPUs can run it.

gpt-oss-120b

gpt-oss-120b specs, VRAM requirements, and which GPUs can run it.

gpt-oss-120b-Eagle3-long-context

gpt-oss-120b-Eagle3-long-context specs, VRAM requirements, and which GPUs can run it.

gpt-oss-20b

gpt-oss-20b specs, VRAM requirements, and which GPUs can run it.

gpt-oss-20b-MXFP4-Q8

gpt-oss-20b-MXFP4-Q8 specs, VRAM requirements, and which GPUs can run it.

gpt-oss-puzzle-88B

gpt-oss-puzzle-88B specs, VRAM requirements, and which GPUs can run it.

gpt2

gpt2 specs, VRAM requirements, and which GPUs can run it.

gpt2-large

gpt2-large specs, VRAM requirements, and which GPUs can run it.

gpt2-medium

gpt2-medium specs, VRAM requirements, and which GPUs can run it.

gpt2-mini

gpt2-mini specs, VRAM requirements, and which GPUs can run it.

granite-3.3-8b-instruct

granite-3.3-8b-instruct specs, VRAM requirements, and which GPUs can run it.

h2ovl-mississippi-2b

h2ovl-mississippi-2b specs, VRAM requirements, and which GPUs can run it.

h2ovl-mississippi-800m

h2ovl-mississippi-800m specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Pro-Llama-3-8B

Hermes-2-Pro-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Pro-Mistral-7B

Hermes-2-Pro-Mistral-7B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Theta-Llama-3-70B

Hermes-2-Theta-Llama-3-70B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Theta-Llama-3-8B

Hermes-2-Theta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-3-Llama-3.1-70B

Hermes-3-Llama-3.1-70B specs, VRAM requirements, and which GPUs can run it.

Hermes-3-Llama-3.1-8B

Hermes-3-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-3-Llama-3.2-3B

Hermes-3-Llama-3.2-3B specs, VRAM requirements, and which GPUs can run it.

Hermes-4-14B

Hermes-4-14B specs, VRAM requirements, and which GPUs can run it.

Hermes-4-405B

Hermes-4-405B specs, VRAM requirements, and which GPUs can run it.

Hermes-4-70B-FP8

Hermes-4-70B-FP8 specs, VRAM requirements, and which GPUs can run it.

internlm2_5-1_8b-chat

internlm2_5-1_8b-chat specs, VRAM requirements, and which GPUs can run it.

internlm2_5-20b-chat

internlm2_5-20b-chat specs, VRAM requirements, and which GPUs can run it.

internlm2_5-7b

internlm2_5-7b specs, VRAM requirements, and which GPUs can run it.

internlm2_5-7b-chat

internlm2_5-7b-chat specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-1_8b

internlm2-chat-1_8b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-20b

internlm2-chat-20b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-7b-sft

internlm2-chat-7b-sft specs, VRAM requirements, and which GPUs can run it.

internlm2-math-7b

internlm2-math-7b specs, VRAM requirements, and which GPUs can run it.

internlm2-math-plus-7b

internlm2-math-plus-7b specs, VRAM requirements, and which GPUs can run it.

Jamba-tiny-random

Jamba-tiny-random specs, VRAM requirements, and which GPUs can run it.

Jan-v3-4B-base-instruct

Jan-v3-4B-base-instruct specs, VRAM requirements, and which GPUs can run it.

japanese-gpt-neox-small

japanese-gpt-neox-small specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-2-base-1_6b

japanese-stablelm-2-base-1_6b specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-2-instruct-1_6b

japanese-stablelm-2-instruct-1_6b specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-3b-4e1t-instruct

japanese-stablelm-3b-4e1t-instruct specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-base-beta-70b

japanese-stablelm-base-beta-70b specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-base-gamma-7b

japanese-stablelm-base-gamma-7b specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-instruct-beta-70b

japanese-stablelm-instruct-beta-70b specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-instruct-beta-7b

japanese-stablelm-instruct-beta-7b specs, VRAM requirements, and which GPUs can run it.

japanese-stablelm-instruct-gamma-7b

japanese-stablelm-instruct-gamma-7b specs, VRAM requirements, and which GPUs can run it.

Karnak

Karnak specs, VRAM requirements, and which GPUs can run it.

KD-Tinker

KD-Tinker specs, VRAM requirements, and which GPUs can run it.

Kimi-K2-Instruct-0905

Kimi-K2-Instruct-0905 specs, VRAM requirements, and which GPUs can run it.

L3.3-GeneticLemonade-Final-v2-70B

L3.3-GeneticLemonade-Final-v2-70B specs, VRAM requirements, and which GPUs can run it.

LFM2-1.2B

LFM2-1.2B specs, VRAM requirements, and which GPUs can run it.

LFM2-24B-A2B

LFM2-24B-A2B specs, VRAM requirements, and which GPUs can run it.

LFM2-24B-A2B-MLX-4bit

LFM2-24B-A2B-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

LFM2-24B-A2B-MLX-6bit

LFM2-24B-A2B-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

LFM2-24B-A2B-MLX-8bit

LFM2-24B-A2B-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

LFM2-8B-A1B

LFM2-8B-A1B specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct

LFM2.5-1.2B-Instruct specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-4bit

LFM2.5-1.2B-Instruct-MLX-4bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-6bit

LFM2.5-1.2B-Instruct-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Instruct-MLX-8bit

LFM2.5-1.2B-Instruct-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

LFM2.5-1.2B-Thinking

LFM2.5-1.2B-Thinking specs, VRAM requirements, and which GPUs can run it.

Llama-2-7b-hf

Llama-2-7b-hf specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1

Llama-3_3-Nemotron-Super-49B-v1 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Llama-3_3-Nemotron-Super-49B-v1_5-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Llama-3_3-Nemotron-Super-49B-v1-FP8

Llama-3_3-Nemotron-Super-49B-v1-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-405B-FP8

Llama-3.1-405B-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-405B-Instruct

Llama-3.1-405B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-405B-Instruct-FP8

Llama-3.1-405B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-70B-Instruct

Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-8B-Instruct

Llama-3.1-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-8B-Instruct-FP8

Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Nemotron-70B-Instruct-HF

Llama-3.1-Nemotron-70B-Instruct-HF specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Nemotron-Nano-4B-v1.1

Llama-3.1-Nemotron-Nano-4B-v1.1 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Nemotron-Nano-8B-v1

Llama-3.1-Nemotron-Nano-8B-v1 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Nemotron-Safety-Guard-8B-v3

Llama-3.1-Nemotron-Safety-Guard-8B-v3 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Tulu-3-70B-DPO

Llama-3.1-Tulu-3-70B-DPO specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Tulu-3-8B-SFT

Llama-3.1-Tulu-3-8B-SFT specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B

Llama-3.2-1B specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct

Llama-3.2-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct-FP8

Llama-3.2-1B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-1B-Instruct-FP8-dynamic

Llama-3.2-1B-Instruct-FP8-dynamic specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-3B

Llama-3.2-3B specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-3B-Instruct

Llama-3.2-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Llama-3.3-70B-Instruct

Llama-3.3-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

llama-3.3-70b-instruct-awq

llama-3.3-70b-instruct-awq specs, VRAM requirements, and which GPUs can run it.

llama-300M-v3-original

llama-300M-v3-original specs, VRAM requirements, and which GPUs can run it.

llama-300M-v5-isolate

llama-300M-v5-isolate specs, VRAM requirements, and which GPUs can run it.

llama-300M-v5-window_2

llama-300M-v5-window_2 specs, VRAM requirements, and which GPUs can run it.

llama-300M-v5-window_4

llama-300M-v5-window_4 specs, VRAM requirements, and which GPUs can run it.

llama-600M-v4-isolate

llama-600M-v4-isolate specs, VRAM requirements, and which GPUs can run it.

llama-7b

llama-7b specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-1B

Llama-Guard-3-1B specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-8B

Llama-Guard-3-8B specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-8B-INT8

Llama-Guard-3-8B-INT8 specs, VRAM requirements, and which GPUs can run it.

LlamaGuard-7b

LlamaGuard-7b specs, VRAM requirements, and which GPUs can run it.

llm-jp-3-3.7b-instruct

llm-jp-3-3.7b-instruct specs, VRAM requirements, and which GPUs can run it.

LocoOperator-4B

LocoOperator-4B specs, VRAM requirements, and which GPUs can run it.

madlad400-8b-lm

madlad400-8b-lm specs, VRAM requirements, and which GPUs can run it.

maira-2

maira-2 specs, VRAM requirements, and which GPUs can run it.

MediPhi

MediPhi specs, VRAM requirements, and which GPUs can run it.

MediPhi-Clinical

MediPhi-Clinical specs, VRAM requirements, and which GPUs can run it.

MediPhi-Instruct

MediPhi-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-70B

Meta-Llama-3-70B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-70B-Instruct

Meta-Llama-3-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-8B

Meta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-8B-Instruct

Meta-Llama-3-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-70B

Meta-Llama-3.1-70B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-70B-Instruct

Meta-Llama-3.1-70B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B

Meta-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct

Meta-Llama-3.1-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct-bnb-4bit

Meta-Llama-3.1-8B-Instruct-bnb-4bit specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct-FP8

Meta-Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-Guard-2-8B

Meta-Llama-Guard-2-8B specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2-AWQ

MiniMax-M2-AWQ specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2.5

MiniMax-M2.5 specs, VRAM requirements, and which GPUs can run it.

MiniMax-M2.5-NVFP4

MiniMax-M2.5-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Mistral-7B-Instruct-v0.2

Mistral-7B-Instruct-v0.2 specs, VRAM requirements, and which GPUs can run it.

Mistral-7B-v0.1

Mistral-7B-v0.1 specs, VRAM requirements, and which GPUs can run it.

mistral-7b-v0.3-bnb-4bit

mistral-7b-v0.3-bnb-4bit specs, VRAM requirements, and which GPUs can run it.

Mistral-NeMo-Minitron-8B-Instruct

Mistral-NeMo-Minitron-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Mistral-Small-24B-Instruct-2501-AWQ

Mistral-Small-24B-Instruct-2501-AWQ specs, VRAM requirements, and which GPUs can run it.

Mixtral-8x7B-Instruct-v0.1-GPTQ

Mixtral-8x7B-Instruct-v0.1-GPTQ specs, VRAM requirements, and which GPUs can run it.

Nanbeige4.1-3B

Nanbeige4.1-3B specs, VRAM requirements, and which GPUs can run it.

Nanbeige4.1-3B-heretic

Nanbeige4.1-3B-heretic specs, VRAM requirements, and which GPUs can run it.

Nemotron-3-Nano-30B-A3B

Nemotron-3-Nano-30B-A3B specs, VRAM requirements, and which GPUs can run it.

Nemotron-Cascade-2-30B-A3B

Nemotron-Cascade-2-30B-A3B specs, VRAM requirements, and which GPUs can run it.

Nemotron-Flash-1B

Nemotron-Flash-1B specs, VRAM requirements, and which GPUs can run it.

Nemotron-Flash-3B

Nemotron-Flash-3B specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-4B-Base-8K

Nemotron-H-4B-Base-8K specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-4B-Instruct-128K

Nemotron-H-4B-Instruct-128K specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-8B-Base-8K

Nemotron-H-8B-Base-8K specs, VRAM requirements, and which GPUs can run it.

NextCoder-14B

NextCoder-14B specs, VRAM requirements, and which GPUs can run it.

NextCoder-7B

NextCoder-7B specs, VRAM requirements, and which GPUs can run it.

nmt_21

nmt_21 specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mistral-7B-DPO

Nous-Hermes-2-Mistral-7B-DPO specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mixtral-8x7B-DPO

Nous-Hermes-2-Mixtral-8x7B-DPO specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-SOLAR-10.7B

Nous-Hermes-2-SOLAR-10.7B specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-llama-2-7b

Nous-Hermes-llama-2-7b specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-Llama2-13b

Nous-Hermes-Llama2-13b specs, VRAM requirements, and which GPUs can run it.

NousCoder-14B

NousCoder-14B specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Nano-4B-FP8

NVIDIA-Nemotron-3-Nano-4B-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16

NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2

NVIDIA-Nemotron-Nano-9B-v2 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-Base

NVIDIA-Nemotron-Nano-9B-v2-Base specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-FP8

NVIDIA-Nemotron-Nano-9B-v2-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-Japanese

NVIDIA-Nemotron-Nano-9B-v2-Japanese specs, VRAM requirements, and which GPUs can run it.

OLMo-1B

OLMo-1B specs, VRAM requirements, and which GPUs can run it.

OLMo-1B-0724-hf

OLMo-1B-0724-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-1B-hf

OLMo-1B-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B

OLMo-2-0325-32B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0325-32B-Instruct

OLMo-2-0325-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B

OLMo-2-0425-1B specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B-Instruct

OLMo-2-0425-1B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-0425-1B-RLVR1

OLMo-2-0425-1B-RLVR1 specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-13B-DPO

OLMo-2-1124-13B-DPO specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-13B-Instruct

OLMo-2-1124-13B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-7B-Instruct

OLMo-2-1124-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1025-7B

Olmo-3-1025-7B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1125-32B

Olmo-3-1125-32B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-32B-Think

Olmo-3-32B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3-32B-Think-DPO

Olmo-3-32B-Think-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct

Olmo-3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct-DPO

Olmo-3-7B-Instruct-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct-SFT

Olmo-3-7B-Instruct-SFT specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-RL-Zero-Math

Olmo-3-7B-RL-Zero-Math specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think

Olmo-3-7B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think-DPO

Olmo-3-7B-Think-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think-SFT

Olmo-3-7B-Think-SFT specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-32B-Think

Olmo-3.1-32B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-7B-RL-Zero-Math

Olmo-3.1-7B-RL-Zero-Math specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-0424-hf

OLMo-7B-0424-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-0724-hf

OLMo-7B-0724-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-0724-Instruct-hf

OLMo-7B-0724-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-hf

OLMo-7B-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-Instruct

OLMo-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-SFT-hf

OLMo-7B-SFT-hf specs, VRAM requirements, and which GPUs can run it.

Olmo-Hybrid-Instruct-DPO-7B

Olmo-Hybrid-Instruct-DPO-7B specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0125

OLMoE-1B-7B-0125 specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0125-Instruct

OLMoE-1B-7B-0125-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0924-Instruct

OLMoE-1B-7B-0924-Instruct specs, VRAM requirements, and which GPUs can run it.

OpenReasoning-Nemotron-1.5B

OpenReasoning-Nemotron-1.5B specs, VRAM requirements, and which GPUs can run it.

OptiMind-SFT

OptiMind-SFT specs, VRAM requirements, and which GPUs can run it.

OTel-LLM-1B-IT

OTel-LLM-1B-IT specs, VRAM requirements, and which GPUs can run it.

OTel-LLM-270M-IT

OTel-LLM-270M-IT specs, VRAM requirements, and which GPUs can run it.

phi-1

phi-1 specs, VRAM requirements, and which GPUs can run it.

phi-1_5

phi-1_5 specs, VRAM requirements, and which GPUs can run it.

phi-2

phi-2 specs, VRAM requirements, and which GPUs can run it.

Phi-3-medium-4k-instruct

Phi-3-medium-4k-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-3-mini-4k-instruct

Phi-3-mini-4k-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-3-mini-4k-instruct-gptq-4bit

Phi-3-mini-4k-instruct-gptq-4bit specs, VRAM requirements, and which GPUs can run it.

Phi-3-small-8k-instruct

Phi-3-small-8k-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-3.5-mini-instruct

Phi-3.5-mini-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-mini-MoE-instruct

Phi-mini-MoE-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-tiny-MoE-instruct

Phi-tiny-MoE-instruct specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-1.3b

polyglot-ko-1.3b specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-12.8b

polyglot-ko-12.8b specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-5.8b

polyglot-ko-5.8b specs, VRAM requirements, and which GPUs can run it.

PowerMoE-3b

PowerMoE-3b specs, VRAM requirements, and which GPUs can run it.

pythia-1.4b

pythia-1.4b specs, VRAM requirements, and which GPUs can run it.

pythia-1.4b-deduped

pythia-1.4b-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-12b

pythia-12b specs, VRAM requirements, and which GPUs can run it.

pythia-14m

pythia-14m specs, VRAM requirements, and which GPUs can run it.

pythia-14m-deduped

pythia-14m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-160m-deduped

pythia-160m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-160m-deduped-v0

pythia-160m-deduped-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-160m-seed1

pythia-160m-seed1 specs, VRAM requirements, and which GPUs can run it.

pythia-160m-seed2

pythia-160m-seed2 specs, VRAM requirements, and which GPUs can run it.

pythia-160m-seed3

pythia-160m-seed3 specs, VRAM requirements, and which GPUs can run it.

pythia-160m-v0

pythia-160m-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-1b

pythia-1b specs, VRAM requirements, and which GPUs can run it.

pythia-1b-deduped-v0

pythia-1b-deduped-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-1b-v0

pythia-1b-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-2.8b-deduped

pythia-2.8b-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-2.8b-deduped-v0

pythia-2.8b-deduped-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-2.8b-v0

pythia-2.8b-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-31m

pythia-31m specs, VRAM requirements, and which GPUs can run it.

pythia-31m-deduped

pythia-31m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-410m

pythia-410m specs, VRAM requirements, and which GPUs can run it.

pythia-410m-deduped

pythia-410m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-410m-deduped-v0

pythia-410m-deduped-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-410m-v0

pythia-410m-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-6.9b

pythia-6.9b specs, VRAM requirements, and which GPUs can run it.

pythia-70m-deduped

pythia-70m-deduped specs, VRAM requirements, and which GPUs can run it.

pythia-70m-deduped-v0

pythia-70m-deduped-v0 specs, VRAM requirements, and which GPUs can run it.

pythia-70m-v0

pythia-70m-v0 specs, VRAM requirements, and which GPUs can run it.

Qwen1.5-110B-Chat-AWQ

Qwen1.5-110B-Chat-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2-0.5B-Instruct

Qwen2-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2-1.5B-Instruct

Qwen2-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2-7B-Instruct

Qwen2-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-0.5B

Qwen2.5-0.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-0.5B-Instruct

Qwen2.5-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B

Qwen2.5-1.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-Instruct

Qwen2.5-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-Instruct-AWQ

Qwen2.5-1.5B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-1.5B-quantized.w8a8

Qwen2.5-1.5B-quantized.w8a8 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-14B-Instruct-AWQ

Qwen2.5-14B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B

Qwen2.5-32B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B-Instruct-AWQ

Qwen2.5-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-32B-Instruct-GPTQ-Int4

Qwen2.5-32B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-3B

Qwen2.5-3B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-3B-Instruct

Qwen2.5-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct

Qwen2.5-72B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct-abliterated

Qwen2.5-72B-Instruct-abliterated specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-72B-Instruct-AWQ

Qwen2.5-72B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-7B

Qwen2.5-7B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-7B-Instruct

Qwen2.5-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-0.5B-Instruct

Qwen2.5-Coder-0.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-1.5B-Instruct

Qwen2.5-Coder-1.5B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-14B-Instruct

Qwen2.5-Coder-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct

Qwen2.5-Coder-32B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-32B-Instruct-AWQ

Qwen2.5-Coder-32B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B

Qwen2.5-Coder-7B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct

Qwen2.5-Coder-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct-AWQ

Qwen2.5-Coder-7B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Math-1.5B

Qwen2.5-Math-1.5B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-VL-7B-Instruct-NVFP4

Qwen2.5-VL-7B-Instruct-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-0.6B

Qwen3-0.6B specs, VRAM requirements, and which GPUs can run it.

Qwen3-0.6B-FP8

Qwen3-0.6B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-1.7B

Qwen3-1.7B specs, VRAM requirements, and which GPUs can run it.

Qwen3-1.7B-Base

Qwen3-1.7B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-1.7B-GPTQ-Int8

Qwen3-1.7B-GPTQ-Int8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-FP8

Qwen3-14B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-Instruct

Qwen3-14B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-NVFP4

Qwen3-14B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B

Qwen3-235B-A22B specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B-Instruct-2507-FP8

Qwen3-235B-A22B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-235B-A22B-NVFP4

Qwen3-235B-A22B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-Instruct-2507-FP8

Qwen3-30B-A3B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-NVFP4

Qwen3-30B-A3B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-30B-A3B-Thinking-2507

Qwen3-30B-A3B-Thinking-2507 specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-AWQ

Qwen3-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-32B-NVFP4

Qwen3-32B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-AWQ

Qwen3-4B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-Base

Qwen3-4B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-Instruct-2507

Qwen3-4B-Instruct-2507 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-Instruct-2507-FP8

Qwen3-4B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-SafeRL

Qwen3-4B-SafeRL specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-Thinking-2507

Qwen3-4B-Thinking-2507 specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B

Qwen3-8B specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-AWQ

Qwen3-8B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-Base

Qwen3-8B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-FP8

Qwen3-8B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-NVFP4

Qwen3-8B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-30B-A3B-Instruct

Qwen3-Coder-30B-A3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-30B-A3B-Instruct-FP8

Qwen3-Coder-30B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-30B-A3B-Instruct-gptq-8bit

Qwen3-Coder-30B-A3B-Instruct-gptq-8bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next

Qwen3-Coder-Next specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-8bit

Qwen3-Coder-Next-8bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-AWQ-4bit

Qwen3-Coder-Next-AWQ-4bit specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-Base

Qwen3-Coder-Next-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-Coder-Next-FP8

Qwen3-Coder-Next-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-Next-80B-A3B-Instruct

Qwen3-Next-80B-A3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen3-Next-80B-A3B-Instruct-FP8

Qwen3-Next-80B-A3B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-VL-30B-A3B-Instruct-AWQ

Qwen3-VL-30B-A3B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-27B-Text-NVFP4-MTP

Qwen3.5-27B-Text-NVFP4-MTP specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-35B-A3B-Text-qx64-hi-mlx

Qwen3.5-35B-A3B-Text-qx64-hi-mlx specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-4B-Safety-Thinking

Qwen3.5-4B-Safety-Thinking specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-9B-abliterated

Qwen3.5-9B-abliterated specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-0.6B

Qwen3Guard-Gen-0.6B specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-4B

Qwen3Guard-Gen-4B specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-8B

Qwen3Guard-Gen-8B specs, VRAM requirements, and which GPUs can run it.

QwQ-32B-AWQ

QwQ-32B-AWQ specs, VRAM requirements, and which GPUs can run it.

recurrentgemma-2b

recurrentgemma-2b specs, VRAM requirements, and which GPUs can run it.

recurrentgemma-9b

recurrentgemma-9b specs, VRAM requirements, and which GPUs can run it.

recurrentgemma-9b-it

recurrentgemma-9b-it specs, VRAM requirements, and which GPUs can run it.

Ring-2.5-1T

Ring-2.5-1T specs, VRAM requirements, and which GPUs can run it.

saiga_llama3_8b

saiga_llama3_8b specs, VRAM requirements, and which GPUs can run it.

sarvam-105b-uncensored

sarvam-105b-uncensored specs, VRAM requirements, and which GPUs can run it.

shieldgemma-27b

shieldgemma-27b specs, VRAM requirements, and which GPUs can run it.

SmolLM-135M-Instruct

SmolLM-135M-Instruct specs, VRAM requirements, and which GPUs can run it.

SmolLM2-135M

SmolLM2-135M specs, VRAM requirements, and which GPUs can run it.

SmolLM2-135M-Instruct

SmolLM2-135M-Instruct specs, VRAM requirements, and which GPUs can run it.

SOLAR-10.7B-Instruct-v1.0

SOLAR-10.7B-Instruct-v1.0 specs, VRAM requirements, and which GPUs can run it.

SOLAR-10.7B-v1.0

SOLAR-10.7B-v1.0 specs, VRAM requirements, and which GPUs can run it.

stable-code-3b

stable-code-3b specs, VRAM requirements, and which GPUs can run it.

StableBeluga-13B

StableBeluga-13B specs, VRAM requirements, and which GPUs can run it.

StableBeluga-7B

StableBeluga-7B specs, VRAM requirements, and which GPUs can run it.

StableBeluga1-Delta

StableBeluga1-Delta specs, VRAM requirements, and which GPUs can run it.

stablecode-completion-alpha-3b-4k

stablecode-completion-alpha-3b-4k specs, VRAM requirements, and which GPUs can run it.

stablelm-2-1_6b

stablelm-2-1_6b specs, VRAM requirements, and which GPUs can run it.

stablelm-2-1_6b-chat

stablelm-2-1_6b-chat specs, VRAM requirements, and which GPUs can run it.

stablelm-2-12b

stablelm-2-12b specs, VRAM requirements, and which GPUs can run it.

stablelm-2-zephyr-1_6b

stablelm-2-zephyr-1_6b specs, VRAM requirements, and which GPUs can run it.

stablelm-3b-4e1t

stablelm-3b-4e1t specs, VRAM requirements, and which GPUs can run it.

stablelm-base-alpha-7b-v2

stablelm-base-alpha-7b-v2 specs, VRAM requirements, and which GPUs can run it.

stablelm-zephyr-3b

stablelm-zephyr-3b specs, VRAM requirements, and which GPUs can run it.

starchat-alpha

starchat-alpha specs, VRAM requirements, and which GPUs can run it.

starchat-beta

starchat-beta specs, VRAM requirements, and which GPUs can run it.

Starling-LM-7B-beta

Starling-LM-7B-beta specs, VRAM requirements, and which GPUs can run it.

steerling-8b

steerling-8b specs, VRAM requirements, and which GPUs can run it.

Step-3.5-Flash

Step-3.5-Flash specs, VRAM requirements, and which GPUs can run it.

Step-3.5-Flash-FP8

Step-3.5-Flash-FP8 specs, VRAM requirements, and which GPUs can run it.

stories15M_MOE

stories15M_MOE specs, VRAM requirements, and which GPUs can run it.

Strand-Rust-Coder-14B-v1

Strand-Rust-Coder-14B-v1 specs, VRAM requirements, and which GPUs can run it.

tiny-aya-global

tiny-aya-global specs, VRAM requirements, and which GPUs can run it.

tiny-random-Gemma2ForCausalLM

tiny-random-Gemma2ForCausalLM specs, VRAM requirements, and which GPUs can run it.

tiny-random-stablelm-2

tiny-random-stablelm-2 specs, VRAM requirements, and which GPUs can run it.

TinyLlama-1.1B-Chat-v0.3-GPTQ

TinyLlama-1.1B-Chat-v0.3-GPTQ specs, VRAM requirements, and which GPUs can run it.

TinyLlama-1.1B-Chat-v1.0

TinyLlama-1.1B-Chat-v1.0 specs, VRAM requirements, and which GPUs can run it.

TinySolar-248m-4k

TinySolar-248m-4k specs, VRAM requirements, and which GPUs can run it.

tinyteapot

tinyteapot specs, VRAM requirements, and which GPUs can run it.

tulu-2-dpo-70b

tulu-2-dpo-70b specs, VRAM requirements, and which GPUs can run it.

Turkish-Gemma-9b-T1

Turkish-Gemma-9b-T1 specs, VRAM requirements, and which GPUs can run it.

txgemma-27b-chat

txgemma-27b-chat specs, VRAM requirements, and which GPUs can run it.

txgemma-27b-predict

txgemma-27b-predict specs, VRAM requirements, and which GPUs can run it.

txgemma-2b-predict

txgemma-2b-predict specs, VRAM requirements, and which GPUs can run it.

txgemma-9b-chat

txgemma-9b-chat specs, VRAM requirements, and which GPUs can run it.

UserLM-8b

UserLM-8b specs, VRAM requirements, and which GPUs can run it.

vaultgemma-1b

vaultgemma-1b specs, VRAM requirements, and which GPUs can run it.

wildguard

wildguard specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B

Yi-1.5-34B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-32K

Yi-1.5-34B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat

Yi-1.5-34B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-34B-Chat-16K

Yi-1.5-34B-Chat-16K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-6B

Yi-1.5-6B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-6B-Chat

Yi-1.5-6B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B

Yi-1.5-9B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-32K

Yi-1.5-9B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-Chat

Yi-1.5-9B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-Chat-16K

Yi-1.5-9B-Chat-16K specs, VRAM requirements, and which GPUs can run it.

Yi-34B

Yi-34B specs, VRAM requirements, and which GPUs can run it.

Yi-34B-Chat-8bits

Yi-34B-Chat-8bits specs, VRAM requirements, and which GPUs can run it.

Yi-6B

Yi-6B specs, VRAM requirements, and which GPUs can run it.

Yi-6B-200K

Yi-6B-200K specs, VRAM requirements, and which GPUs can run it.

Yi-6B-Chat

Yi-6B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-6B-Chat-4bits

Yi-6B-Chat-4bits specs, VRAM requirements, and which GPUs can run it.

Yi-9B

Yi-9B specs, VRAM requirements, and which GPUs can run it.

Yi-9B-200K

Yi-9B-200K specs, VRAM requirements, and which GPUs can run it.

Yi-Coder-9B

Yi-Coder-9B specs, VRAM requirements, and which GPUs can run it.

Yi-Coder-9B-Chat

Yi-Coder-9B-Chat specs, VRAM requirements, and which GPUs can run it.

zephyr-7b-alpha

zephyr-7b-alpha specs, VRAM requirements, and which GPUs can run it.

zephyr-7b-beta

zephyr-7b-beta specs, VRAM requirements, and which GPUs can run it.

zephyr-7b-gemma-sft-v0.1

zephyr-7b-gemma-sft-v0.1 specs, VRAM requirements, and which GPUs can run it.

zephyr-orpo-141b-A35b-v0.1

zephyr-orpo-141b-A35b-v0.1 specs, VRAM requirements, and which GPUs can run it.