GGUF

gemma 3 1B QAT

google

gemma

Google's smallest Gemma 3 model in new quantization format that preserves bfloat16 quality

Model info

Model

gemma 3 1B QAT

Author

google

Arch

gemma

Parameters

1B

Format

gguf

Size on disk

about 720.43 MB

Download and run gemma 3 1B QAT

Open in LM Studio to view download options

Download gemma-3-1b-it-qat from the terminal

Download the model using lms — LM Studio's developer CLI.

lms get gemma-3-1b-it-qat

Call gemma-3-1b-it-qat from your code

curl http://localhost:1234/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemma-3-1b-it-qat",
    "messages": [
      { "role": "system", "content": "Always answer in rhymes." },
      { "role": "user", "content": "Introduce yourself." }
    ],
    "temperature": 0.7,
    "max_tokens": -1,
    "stream": true
  }'

Next Steps: Build! 🔨

Learn more


lmmy