Indeed. Hugging Face offers the majority of Gemma 3 models (1B, 4B, 12B, and 27B), which can be imported into vLLM using 16-bit quantization.

Was this answer helpful? 0 Users Found This Useful (0 Votes)