🏫 I’m a second-year CS PhD student at UW-Madison. Before that, I completed my B.Eng in Computer Science (IEEE Honor Class) at Shanghai Jiao Tong University, where I was fortunate to work with Prof. Pengfei Liu and Prof. Junxian He.

♍️ My research focuses on large language models. I’m driven by the curiosity to understand how complex systems work.

🚀 As a research intern at Microsoft, I currently focus on agentic RL training.

🎓 Educations

  • 2024.09 - (now), PhD in CS, University of Wisconsin-Madison.
  • 2020.09 - 2024.06, B.Eng in CS (IEEE Honor Class), Shanghai Jiao Tong University.
  • 2023.02 - 2023.07, Exchange Student in CS, EPFL.

📖 Selected Publications

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models
Z. Cai, W. Xiao, H. Sun, C. Luo, Y. Zhang, K. Wan, et al.
PDF | Github | NeurIPS, 2025

Dissecting Human and LLM Preferences
J. Li, F. Zhou, S. Sun, Y. Zhang, H. Zhao, P. Liu
PDF | Github | ACL, 2024

Extending LLMs’ Context Window with 100 Samples
Y. Zhang, J. Li, P. Liu
PDF | Github | arXiv, 2024

Pygmtools: A Python Graph Matching Toolkit
R. Wang, Z. Guo, W. Pan, J. Ma, Y. Zhang, et al.
PDF | Github | JMLR, 2024

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models
Y. Huang*, Y. Bai*, Z. Zhu, J. Zhang, J. Zhang, T. Su, J. Liu, C. Lv, Y. Zhang, J. Lei, Y. Fu, M. Sun, J. He
PDF | Github | NeurIPS (Datasets and Benchmarks track), 2023

🧑‍🌾 Work Experience

  • Microsoft, 2025.6-Now

  • Amazon Web Services (AWS) Shanghai AI Lab, 2024.3-2024.8

  • Shanghai AI Lab, 2023.7-2023.10

🏆 Awards

  • Shanghai Outstanding Graduate (top 5%), 2024

  • Ubiquant Scholarship (awarded to 10 students across SJTU), 2023

  • SJTU Outstanding Student Scholarship (top 10%), 2020-2023

  • Longfor Foundation Scholarship (top 5%), 2021