Alex
M0nteCarl0
AI & ML interests
NLP, CV, information security ML, FinTech ML
Recent Activity
liked a model 2 days ago
google/functiongemma-270m-it liked a model 7 days ago
openbmb/MiniCPM-V-4.6 new activity 9 days ago
Qwen/Qwen3-ASR-0.6B:How to use it on jetson orin nano ?Organizations
None yet
Diffusion models
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
Paper • 2601.08303 • Published • 21 -
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers
Paper • 2411.10510 • Published • 9 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 16 -
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Paper • 2603.06577 • Published • 50
Voice cloning & TTS
llm
llm attentions
-
Star Attention: Efficient LLM Inference over Long Sequences
Paper • 2411.17116 • Published • 53 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 50 -
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
Paper • 2604.04921 • Published • 114 -
Let ViT Speak: Generative Language-Image Pre-training
Paper • 2605.00809 • Published • 33
Video gen
3D reconstruct
Rag
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
DeepCode: Open Agentic Coding
Paper • 2512.07921 • Published • 35 -
Reinforcement Learning for Self-Improving Agent with Skill Library
Paper • 2512.17102 • Published • 42 -
Hyperagents
Paper • 2603.19461 • Published • 51
3d
Clusterisation
Video gen
Diffusion models
-
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices
Paper • 2601.08303 • Published • 21 -
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers
Paper • 2411.10510 • Published • 9 -
Dynamic Chunking Diffusion Transformer
Paper • 2603.06351 • Published • 16 -
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
Paper • 2603.06577 • Published • 50
3D reconstruct
Voice cloning & TTS
Rag
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
DeepCode: Open Agentic Coding
Paper • 2512.07921 • Published • 35 -
Reinforcement Learning for Self-Improving Agent with Skill Library
Paper • 2512.17102 • Published • 42 -
Hyperagents
Paper • 2603.19461 • Published • 51
llm
3d
llm attentions
-
Star Attention: Efficient LLM Inference over Long Sequences
Paper • 2411.17116 • Published • 53 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 50 -
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
Paper • 2604.04921 • Published • 114 -
Let ViT Speak: Generative Language-Image Pre-training
Paper • 2605.00809 • Published • 33