Hang's picture

3

Hang

Hang14

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Skip a Layer or Loop It? Learning Program-of-Layers in LLMs

upvoted a paper 12 months ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

upvoted a paper over 1 year ago

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet