Medium

bloom-3b

bloom-3b specs, VRAM requirements, and which GPUs can run it.

bloom-7b1

bloom-7b1 specs, VRAM requirements, and which GPUs can run it.

bloomz-3b

bloomz-3b specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1

bloomz-7b1 specs, VRAM requirements, and which GPUs can run it.

bloomz-7b1-mt

bloomz-7b1-mt specs, VRAM requirements, and which GPUs can run it.

CodeLlama-7b-Instruct-hf

CodeLlama-7b-Instruct-hf specs, VRAM requirements, and which GPUs can run it.

deep-ignorance-unfiltered

deep-ignorance-unfiltered specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-6.7b-base

deepseek-coder-6.7b-base specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-6.7b-instruct

deepseek-coder-6.7b-instruct specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-7b-base-v1.5

deepseek-coder-7b-base-v1.5 specs, VRAM requirements, and which GPUs can run it.

deepseek-coder-7b-instruct-v1.5

deepseek-coder-7b-instruct-v1.5 specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-0528-Qwen3-8B

DeepSeek-R1-0528-Qwen3-8B specs, VRAM requirements, and which GPUs can run it.

DeepSeek-R1-Distill-Qwen-7B

DeepSeek-R1-Distill-Qwen-7B specs, VRAM requirements, and which GPUs can run it.

falcon-11B

falcon-11B specs, VRAM requirements, and which GPUs can run it.

falcon-7b-instruct

falcon-7b-instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-3B-Base

Falcon-H1-3B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-3B-Instruct

Falcon-H1-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-7B-Base

Falcon-H1-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon-H1-7B-Instruct

Falcon-H1-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

falcon-mamba-7b-instruct

falcon-mamba-7b-instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-10B-Base

Falcon3-10B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-3B-Base

Falcon3-3B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-3B-Instruct

Falcon3-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Base

Falcon3-7B-Base specs, VRAM requirements, and which GPUs can run it.

Falcon3-7B-Instruct

Falcon3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Flex-reddit-2x7B-1T

Flex-reddit-2x7B-1T specs, VRAM requirements, and which GPUs can run it.

gemma-1.1-7b-it

gemma-1.1-7b-it specs, VRAM requirements, and which GPUs can run it.

gemma-2-9b-it

gemma-2-9b-it specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-MLX-6bit

GLM-4.7-Flash-MLX-6bit specs, VRAM requirements, and which GPUs can run it.

GLM-4.7-Flash-MLX-8bit

GLM-4.7-Flash-MLX-8bit specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Pro-Llama-3-8B

Hermes-2-Pro-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Pro-Mistral-7B

Hermes-2-Pro-Mistral-7B specs, VRAM requirements, and which GPUs can run it.

Hermes-2-Theta-Llama-3-8B

Hermes-2-Theta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Hermes-3-Llama-3.1-8B

Hermes-3-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.

internlm2_5-7b

internlm2_5-7b specs, VRAM requirements, and which GPUs can run it.

internlm2-chat-7b-sft

internlm2-chat-7b-sft specs, VRAM requirements, and which GPUs can run it.

Jan-v3-4B-base-instruct

Jan-v3-4B-base-instruct specs, VRAM requirements, and which GPUs can run it.

LFM2-8B-A1B

LFM2-8B-A1B specs, VRAM requirements, and which GPUs can run it.

Llama 3.1 8B

Llama 3.1 8B specs, VRAM requirements, and which GPUs can run it. The go-to small model for local inference.

Llama-2-7b-hf

Llama-2-7b-hf specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-8B-Instruct-FP8

Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Llama-3.1-Tulu-3-8B-SFT

Llama-3.1-Tulu-3-8B-SFT specs, VRAM requirements, and which GPUs can run it.

Llama-3.2-3B

Llama-3.2-3B specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-8B

Llama-Guard-3-8B specs, VRAM requirements, and which GPUs can run it.

Llama-Guard-3-8B-INT8

Llama-Guard-3-8B-INT8 specs, VRAM requirements, and which GPUs can run it.

LlamaGuard-7b

LlamaGuard-7b specs, VRAM requirements, and which GPUs can run it.

llm-jp-3-3.7b-instruct

llm-jp-3-3.7b-instruct specs, VRAM requirements, and which GPUs can run it.

LocoOperator-4B

LocoOperator-4B specs, VRAM requirements, and which GPUs can run it.

maira-2

maira-2 specs, VRAM requirements, and which GPUs can run it.

MediPhi-Clinical

MediPhi-Clinical specs, VRAM requirements, and which GPUs can run it.

MediPhi-Instruct

MediPhi-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-8B

Meta-Llama-3-8B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3-8B-Instruct

Meta-Llama-3-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B

Meta-Llama-3.1-8B specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct

Meta-Llama-3.1-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct-bnb-4bit

Meta-Llama-3.1-8B-Instruct-bnb-4bit specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-3.1-8B-Instruct-FP8

Meta-Llama-3.1-8B-Instruct-FP8 specs, VRAM requirements, and which GPUs can run it.

Meta-Llama-Guard-2-8B

Meta-Llama-Guard-2-8B specs, VRAM requirements, and which GPUs can run it.

Mistral 7B

Mistral 7B specs, VRAM requirements, and which GPUs can run it. Efficient and fast for everyday tasks.

Mistral-7B-Instruct-v0.2

Mistral-7B-Instruct-v0.2 specs, VRAM requirements, and which GPUs can run it.

mistral-7b-v0.3-bnb-4bit

mistral-7b-v0.3-bnb-4bit specs, VRAM requirements, and which GPUs can run it.

Mistral-NeMo-Minitron-8B-Instruct

Mistral-NeMo-Minitron-8B-Instruct specs, VRAM requirements, and which GPUs can run it.

Nanbeige4.1-3B

Nanbeige4.1-3B specs, VRAM requirements, and which GPUs can run it.

Nanbeige4.1-3B-heretic

Nanbeige4.1-3B-heretic specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-4B-Base-8K

Nemotron-H-4B-Base-8K specs, VRAM requirements, and which GPUs can run it.

Nemotron-H-4B-Instruct-128K

Nemotron-H-4B-Instruct-128K specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-Mistral-7B-DPO

Nous-Hermes-2-Mistral-7B-DPO specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-2-SOLAR-10.7B

Nous-Hermes-2-SOLAR-10.7B specs, VRAM requirements, and which GPUs can run it.

Nous-Hermes-llama-2-7b

Nous-Hermes-llama-2-7b specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2

NVIDIA-Nemotron-Nano-9B-v2 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-Base

NVIDIA-Nemotron-Nano-9B-v2-Base specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-FP8

NVIDIA-Nemotron-Nano-9B-v2-FP8 specs, VRAM requirements, and which GPUs can run it.

NVIDIA-Nemotron-Nano-9B-v2-Japanese

NVIDIA-Nemotron-Nano-9B-v2-Japanese specs, VRAM requirements, and which GPUs can run it.

OLMo-2-1124-7B-Instruct

OLMo-2-1124-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-1025-7B

Olmo-3-1025-7B specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct

Olmo-3-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct-DPO

Olmo-3-7B-Instruct-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Instruct-SFT

Olmo-3-7B-Instruct-SFT specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think

Olmo-3-7B-Think specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think-DPO

Olmo-3-7B-Think-DPO specs, VRAM requirements, and which GPUs can run it.

Olmo-3-7B-Think-SFT

Olmo-3-7B-Think-SFT specs, VRAM requirements, and which GPUs can run it.

Olmo-3.1-7B-RL-Zero-Math

Olmo-3.1-7B-RL-Zero-Math specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-0724-hf

OLMo-7B-0724-hf specs, VRAM requirements, and which GPUs can run it.

OLMo-7B-hf

OLMo-7B-hf specs, VRAM requirements, and which GPUs can run it.

Olmo-Hybrid-Instruct-DPO-7B

Olmo-Hybrid-Instruct-DPO-7B specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0125

OLMoE-1B-7B-0125 specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0125-Instruct

OLMoE-1B-7B-0125-Instruct specs, VRAM requirements, and which GPUs can run it.

OLMoE-1B-7B-0924-Instruct

OLMoE-1B-7B-0924-Instruct specs, VRAM requirements, and which GPUs can run it.

Phi-3-mini-4k-instruct-gptq-4bit

Phi-3-mini-4k-instruct-gptq-4bit specs, VRAM requirements, and which GPUs can run it.

Phi-3-small-8k-instruct

Phi-3-small-8k-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-mini-MoE-instruct

Phi-mini-MoE-instruct specs, VRAM requirements, and which GPUs can run it.

Phi-tiny-MoE-instruct

Phi-tiny-MoE-instruct specs, VRAM requirements, and which GPUs can run it.

polyglot-ko-5.8b

polyglot-ko-5.8b specs, VRAM requirements, and which GPUs can run it.

pythia-12b

pythia-12b specs, VRAM requirements, and which GPUs can run it.

pythia-6.9b

pythia-6.9b specs, VRAM requirements, and which GPUs can run it.

Qwen2-7B-Instruct

Qwen2-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-3B

Qwen2.5-3B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-3B-Instruct

Qwen2.5-3B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-7B

Qwen2.5-7B specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-7B-Instruct

Qwen2.5-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct

Qwen2.5-Coder-7B-Instruct specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct-AWQ

Qwen2.5-Coder-7B-Instruct-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4

Qwen2.5-Coder-7B-Instruct-GPTQ-Int4 specs, VRAM requirements, and which GPUs can run it.

Qwen2.5-VL-7B-Instruct-NVFP4

Qwen2.5-VL-7B-Instruct-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-14B-NVFP4

Qwen3-14B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-AWQ

Qwen3-4B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-Instruct-2507-FP8

Qwen3-4B-Instruct-2507-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-4B-SafeRL

Qwen3-4B-SafeRL specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-AWQ

Qwen3-8B-AWQ specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-Base

Qwen3-8B-Base specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-FP8

Qwen3-8B-FP8 specs, VRAM requirements, and which GPUs can run it.

Qwen3-8B-NVFP4

Qwen3-8B-NVFP4 specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-4B-Safety-Thinking

Qwen3.5-4B-Safety-Thinking specs, VRAM requirements, and which GPUs can run it.

Qwen3.5-9B-abliterated

Qwen3.5-9B-abliterated specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-4B

Qwen3Guard-Gen-4B specs, VRAM requirements, and which GPUs can run it.

Qwen3Guard-Gen-8B

Qwen3Guard-Gen-8B specs, VRAM requirements, and which GPUs can run it.

saiga_llama3_8b

saiga_llama3_8b specs, VRAM requirements, and which GPUs can run it.

SOLAR-10.7B-v1.0

SOLAR-10.7B-v1.0 specs, VRAM requirements, and which GPUs can run it.

stablelm-base-alpha-7b-v2

stablelm-base-alpha-7b-v2 specs, VRAM requirements, and which GPUs can run it.

Starling-LM-7B-beta

Starling-LM-7B-beta specs, VRAM requirements, and which GPUs can run it.

steerling-8b

steerling-8b specs, VRAM requirements, and which GPUs can run it.

tiny-aya-global

tiny-aya-global specs, VRAM requirements, and which GPUs can run it.

wildguard

wildguard specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-6B

Yi-1.5-6B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-6B-Chat

Yi-1.5-6B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B

Yi-1.5-9B specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-32K

Yi-1.5-9B-32K specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-Chat

Yi-1.5-9B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-1.5-9B-Chat-16K

Yi-1.5-9B-Chat-16K specs, VRAM requirements, and which GPUs can run it.

Yi-6B

Yi-6B specs, VRAM requirements, and which GPUs can run it.

Yi-6B-200K

Yi-6B-200K specs, VRAM requirements, and which GPUs can run it.

Yi-6B-Chat

Yi-6B-Chat specs, VRAM requirements, and which GPUs can run it.

Yi-9B

Yi-9B specs, VRAM requirements, and which GPUs can run it.

Yi-9B-200K

Yi-9B-200K specs, VRAM requirements, and which GPUs can run it.

Yi-Coder-9B

Yi-Coder-9B specs, VRAM requirements, and which GPUs can run it.

Yi-Coder-9B-Chat

Yi-Coder-9B-Chat specs, VRAM requirements, and which GPUs can run it.

zephyr-7b-beta

zephyr-7b-beta specs, VRAM requirements, and which GPUs can run it.