🏫 I’m a second-year CS PhD student at UW-Madison. Before that, I completed my B.Eng in Computer Science (IEEE Honor Class) at Shanghai Jiao Tong University, where I was fortunate to work with Prof. Pengfei Liu and Prof. Junxian He.
♍️ My research focuses on large language models. I’m driven by the curiosity to understand how complex systems work.
🚀 As a research intern at Microsoft, I currently focus on agentic RL training.
🎓 Educations
- 2024.09 - (now), PhD in CS, University of Wisconsin-Madison.
- 2020.09 - 2024.06, B.Eng in CS (IEEE Honor Class), Shanghai Jiao Tong University.
- 2023.02 - 2023.07, Exchange Student in CS, EPFL.
📖 Selected Publications
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models
Z. Cai, W. Xiao, H. Sun, C. Luo, Y. Zhang, K. Wan, et al.
PDF | Github | NeurIPS, 2025
Dissecting Human and LLM Preferences
J. Li, F. Zhou, S. Sun, Y. Zhang, H. Zhao, P. Liu
PDF | Github | ACL, 2024
Pygmtools: A Python Graph Matching Toolkit
R. Wang, Z. Guo, W. Pan, J. Ma, Y. Zhang, et al.
PDF | Github | JMLR, 2024
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models
Y. Huang*, Y. Bai*, Z. Zhu, J. Zhang, J. Zhang, T. Su, J. Liu, C. Lv, Y. Zhang, J. Lei, Y. Fu, M. Sun, J. He
PDF | Github | NeurIPS (Datasets and Benchmarks track), 2023
🧑🌾 Work Experience
-
Microsoft, 2025.6-Now
-
Amazon Web Services (AWS) Shanghai AI Lab, 2024.3-2024.8
-
Shanghai AI Lab, 2023.7-2023.10
🏆 Awards
-
Shanghai Outstanding Graduate (top 5%), 2024
-
Ubiquant Scholarship (awarded to 10 students across SJTU), 2023
-
SJTU Outstanding Student Scholarship (top 10%), 2020-2023
-
Longfor Foundation Scholarship (top 5%), 2021