
Zhenyu Hou
Language Model; Agent; Reinforcement Learning. My recent work centers on post-training for reasoning and alignment.
- Beijing, China
- Tsinghua University
- Google Scholar
- GitHub

Language Model; Agent; Reinforcement Learning. My recent work centers on post-training for reasoning and alignment.