RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic Text Generation • 561B • Updated 5 days ago • 1.17k • 1
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-block Text Generation • 561B • Updated 5 days ago • 873
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16 Text Generation • 565B • Updated 5 days ago • 890 • 3
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16 Text Generation • 565B • Updated 5 days ago • 890 • 3
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-block Text Generation • 561B • Updated 5 days ago • 873
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic Text Generation • 561B • Updated 5 days ago • 1.17k • 1
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-noise Image-Text-to-Text • 32B • Updated 24 days ago • 63
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-hybrid Image-Text-to-Text • 32B • Updated 24 days ago • 64
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-heuristic Image-Text-to-Text • 32B • Updated 24 days ago • 115
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-noise Image-Text-to-Text • 30B • Updated 24 days ago • 77
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-hybrid Image-Text-to-Text • 30B • Updated 24 days ago • 62
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-heuristic Image-Text-to-Text • 30B • Updated 24 days ago • 82
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-noise Image-Text-to-Text • 28B • Updated 24 days ago • 46
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated 24 days ago • 130
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-heuristic Image-Text-to-Text • 28B • Updated 24 days ago • 68
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-noise Image-Text-to-Text • 26B • Updated 24 days ago • 45
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-hybrid Image-Text-to-Text • 26B • Updated 24 days ago • 72