-
Whisper Realtime Transcription (Gradio UI)
👂4Transcribe audio in realtime - Gradio UI version
-
DeepSeek R1 Distill Qwen 1.5B Demo Q8
🔥10DeepSeek R1 Distill Qwen 1.5B Demo GGUF(Q8) Fully in CPU
-
Chain of Draft: Thinking Faster by Writing Less
Paper • 2502.18600 • Published • 50 -
Llama-4-Maverick-17B Research
🏃88Llama-4-Maverick-17B + Real Time Deep Research
Matricardi Fabio
FM-1976
AI & ML interests
control system engineering, AI, LLM with python. ThePoorGPUguy on substack
Recent Activity
liked a model 5 days ago
mradermacher/Tool-Star-Qwen-3B-GGUF liked a model 5 days ago
mradermacher/Tool-Star-Qwen-1.5B-GGUF liked a model 5 days ago
mradermacher/DeepSeek-R1-Distill-Llama-3B-tools-GGUFOrganizations
None yet
PAPERS
-
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 166 -
A Survey of Small Language Models
Paper • 2410.20011 • Published • 46 -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 43 -
Chain of Draft: Thinking Faster by Writing Less
Paper • 2502.18600 • Published • 50
SMALL-TINY
A Collection of Small native Models
Image Creation
Good and working HF spaces to create images with Diffusion models
- Running on ZeroAgentsFeatured2.01k
Stable Diffusion 3.5 Large
🏃2.01kGenerate images with SD3.5
- Running on ZeroAgentsFeatured9.47k
FLUX.1 [dev]
🖥9.47kGenerate images from text prompts
- Running on ZeroAgentsFeatured5.07k
FLUX.1 [Schnell]
🏎5.07kGenerate images from text prompts instantly
- Running on ZeroAgents1.79k
DALLE 3 XL v2
🔥1.79kGenerate high‑resolution images from your text prompt
Playgrounds
- PausedFeatured904
Zephyr Chat
🪁904Chat with an AI model
- Running on ZeroAgentsFeatured126
Qwen VL
⚡126Ask questions about any image
- Runtime errorAgentsFeatured178
Zero2Story
📖178Create a custom story with characters and plot
- Running on CPU Upgrade7.49k
MTEB Leaderboard
📊7.49kEmbedding Leaderboard
GRADIO examples
- Runtime errorAgents4
Whisper Realtime Transcription (Gradio UI)
👂4Transcribe audio in realtime - Gradio UI version
- Build error10
DeepSeek R1 Distill Qwen 1.5B Demo Q8
🔥10DeepSeek R1 Distill Qwen 1.5B Demo GGUF(Q8) Fully in CPU
-
Chain of Draft: Thinking Faster by Writing Less
Paper • 2502.18600 • Published • 50 - RunningAgents88
Llama-4-Maverick-17B Research
🏃88Llama-4-Maverick-17B + Real Time Deep Research
Image Creation
Good and working HF spaces to create images with Diffusion models
- Running on ZeroAgentsFeatured2.01k
Stable Diffusion 3.5 Large
🏃2.01kGenerate images with SD3.5
- Running on ZeroAgentsFeatured9.47k
FLUX.1 [dev]
🖥9.47kGenerate images from text prompts
- Running on ZeroAgentsFeatured5.07k
FLUX.1 [Schnell]
🏎5.07kGenerate images from text prompts instantly
- Running on ZeroAgents1.79k
DALLE 3 XL v2
🔥1.79kGenerate high‑resolution images from your text prompt
PAPERS
-
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 166 -
A Survey of Small Language Models
Paper • 2410.20011 • Published • 46 -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 43 -
Chain of Draft: Thinking Faster by Writing Less
Paper • 2502.18600 • Published • 50
Playgrounds
- PausedFeatured904
Zephyr Chat
🪁904Chat with an AI model
- Running on ZeroAgentsFeatured126
Qwen VL
⚡126Ask questions about any image
- Runtime errorAgentsFeatured178
Zero2Story
📖178Create a custom story with characters and plot
- Running on CPU Upgrade7.49k
MTEB Leaderboard
📊7.49kEmbedding Leaderboard
SMALL-TINY
A Collection of Small native Models