T1: Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling

Published in International Conference on Machine Learning (ICML), 2025

This work investigates reinforcement-learning-based reasoning enhancement and compute scaling at inference time.

Recommended citation: Hu, W., Xu, C., Hou, Z., et al. (2025). "T1: Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling." ICML.
Download Paper