Pengcheng Wang

PengchengW

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Lean4Agent: Formal Modeling and Verification for Agent Workflow and Trajectory

upvoted a paper about 1 month ago

Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning

upvoted a paper about 2 months ago

Recursive Multi-Agent Systems

View all activity

Organizations

upvoted a paper 12 days ago

Lean4Agent: Formal Modeling and Verification for Agent Workflow and Trajectory

Paper • 2606.06523 • Published 21 days ago • 6

upvoted a paper about 1 month ago

Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning

Paper • 2605.22642 • Published May 21 • 35

upvoted 2 papers about 2 months ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 284

Entropy-Regularized Process Reward Model

Paper • 2412.11006 • Published Dec 15, 2024 • 1

authored 2 papers 2 months ago

Entropy-Regularized Process Reward Model

Paper • 2412.11006 • Published Dec 15, 2024 • 1

Active Prompting with Chain-of-Thought for Large Language Models

Paper • 2302.12246 • Published Feb 23, 2023

upvoted a paper 2 months ago

AgentSPEX: An Agent SPecification and EXecution Language

Paper • 2604.13346 • Published Apr 14 • 167

updated a model 8 months ago

PengchengW/anlp-hw2-outputs

Updated Nov 4, 2025

published a model 8 months ago

PengchengW/anlp-hw2-outputs

Updated Nov 4, 2025

upvoted 2 papers 8 months ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published Oct 14, 2025 • 28

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published Oct 13, 2025 • 26

updated a model 9 months ago

PengchengW/adv-nlp-hw1-pw29

Text Classification • 22.7M • Updated Oct 3, 2025 • 5

published a model 9 months ago

PengchengW/adv-nlp-hw1-pw29

Text Classification • 22.7M • Updated Oct 3, 2025 • 5

upvoted a paper 10 months ago

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3, 2025 • 23

upvoted a paper 12 months ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

Paper • 2506.18945 • Published Jun 23, 2025 • 43

upvoted a paper about 1 year ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

Pengcheng Wang

AI & ML interests

Recent Activity

Organizations

PengchengW's activity