yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF Text Generation โข 12B โข Updated 16 days ago โข 641k โข 2.6k
view reply I tested this https://huggingface.co/unsloth/gemma-4-12B-it-qat-GGUF on an RTX 4060 (8GB VRAM) using https://github.com/AtomicBot-ai/atomic-llama-cpp-turboquant, and it worked perfectly. I even used the assistant for MTP https://huggingface.co/Janvitos/gemma-4-12B-it-qat-assistant-MTP-Q8_0-GGUF/tree/main and everything loaded into VRAM.