T1: Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Published in International Conference on Machine Learning (ICML), 2025
This work investigates reinforcement-learning-based reasoning enhancement and compute scaling at inference time.
Recommended citation: Hu, W., Xu, C., Hou, Z., et al. (2025). "T1: Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling." ICML.
Download Paper
