Key Models
The following models are among the most commonly used with LLM Compressor: Llama 4, Qwen3, Kimi-K2, and Mistral Large 3. Each model page contains quantization examples with tested configurations and recommended parameters.
-
Llama 4
Meta's Llama 4 Scout multimodal model.
-
Qwen3
Qwen3-VL MoE vision-language model.
-
Kimi-K2
Kimi-K2 Thinking model.
-
Mistral Large 3
Mistral's 675B parameter model.