Skip to content

Key Models

The following models are among the most commonly used with LLM Compressor: Llama 4, Qwen3, Kimi-K2, and Mistral Large 3. Each model page contains quantization examples with tested configurations and recommended parameters.

  • Llama 4


    Meta's Llama 4 Scout multimodal model.

    Llama 4

  • Qwen3


    Qwen3-VL MoE vision-language model.

    Qwen3

  • Kimi-K2


    Kimi-K2 Thinking model.

    Kimi-K2

  • Mistral Large 3


    Mistral's 675B parameter model.

    Mistral Large 3