Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Yuhao Dong PRO
THUdyh
AI & ML interests
None yet
Recent Activity
upvoted a paper 9 days ago
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding authored a paper 17 days ago
From Pixels to Words -- Towards Native One-Vision Models at Scale upvoted a paper 18 days ago
GEM: Generative Supervision Helps Embodied Intelligence