Daniel MILLER's picture

Daniel MILLER

graysonwal73

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

upvoted a paper 3 days ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

upvoted a paper 4 days ago

Looped World Models

View all activity

Organizations

None yet

upvoted 2 papers 3 days ago

LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling

Paper • 2606.18023 • Published 8 days ago • 202

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

Paper • 2606.14700 • Published 12 days ago • 18

upvoted a paper 4 days ago

Looped World Models

Paper • 2606.18208 • Published 8 days ago • 455

upvoted a paper 20 days ago

Learning A Unified Risk Map for Autonomous Driving in Partially Observable Environments

Paper • 2605.22189 • Published May 21 • 8

upvoted 6 papers about 1 month ago

TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation

Paper • 2605.22355 • Published May 21 • 179

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published May 14 • 147

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published May 13 • 274

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published May 11 • 79

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 88

IntentGrasp: A Comprehensive Benchmark for Intent Understanding

Paper • 2605.06832 • Published May 7 • 8

upvoted 2 papers about 2 months ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published May 6 • 106

Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising

Paper • 2604.26694 • Published Apr 29 • 6

upvoted 4 papers 2 months ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 244

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 328

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 508

MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping

Paper • 2604.08364 • Published Apr 9 • 103

upvoted a paper 3 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

upvoted a paper 4 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198