Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Rafael Medeiros's picture

Rafael Medeiros

RafaelOM
1
ยท

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago
yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF
repliedto danielhanchen's post 27 days ago
Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs. Google's new model, Gemma 4 12B Unified supports image, audio and 256K context. You can run and train the model via Unsloth Studio. GGUF: https://huggingface.co/unsloth/gemma-4-12b-it-GGUF Guide: https://unsloth.ai/docs/models/gemma-4
View all activity

Organizations

None yet

liked a model 19 days ago

yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF

Text Generation โ€ข 12B โ€ข Updated 16 days ago โ€ข 641k โ€ข 2.6k
replied to danielhanchen's post 27 days ago
view reply

I tested this https://huggingface.co/unsloth/gemma-4-12B-it-qat-GGUF on an RTX 4060 (8GB VRAM) using https://github.com/AtomicBot-ai/atomic-llama-cpp-turboquant, and it worked perfectly. I even used the assistant for MTP https://huggingface.co/Janvitos/gemma-4-12B-it-qat-assistant-MTP-Q8_0-GGUF/tree/main and everything loaded into VRAM.

Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs