My name is Jingyan Shen. I am a first-year CS Ph.D. student at New York University, advised by Prof. Matus Telgarsky and Prof. Pavel Izmailov. Previously, I earned my dual master degree from Tsinghua University and Columbia University. Prior to this, I completed my Bachelor’s degree at Wuhan University, majoring in Statistics.

My recent research focuses on LLM post-training, particularly on advancing the reasoning capabilities of large language models through reinforcement learning. I am broadly interested in data-centric machine learning and statistical machine learning.

I am also fortunate to work with many great scholars and mentors, and I’m deeply grateful for their guidance.

πŸ“ Preprints and Publications

(†: equal contribution)

Large Language Model Reasoning:

  • When Reasoning Meets Its Laws

    Junyu Zhang†, Yifan Sun†, Tianang Leng†, Jingyan Shen†, Ziyin Liu, Paul Pu Liang, Huan Zhang
    Efficient Reasoning Workshop at NeurIPS 2025 (Oral Presentation, Best Paper Nomination) [Paper] [Website]

  • Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

    Yifan Sun†, Jingyan Shen†, Yibin Wang†, Tianyu Chen, Zhendong Wang, Mingyuan Zhou, Huan Zhang
    NeurIPS 2025 [Paper]

Large Language Model Alignment:

  • MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

    Jingyan Shen†, Jiarui Yao†, Rui Yang†, Yifan Sun, Feng Luo, Rui Pan, Tong Zhang, Han Zhao
    EMNLP 2025 (Main) Outstanding Paper Award πŸ† [Paper]

  • Conformal Tail Risk Control for Large Language Model Alignment

    Catherine Chen, Jingyan Shen, Zhun Deng, Lihua Lei
    ICML 2025 [Paper]

  • Rethinking Diverse Human Preference Learning through Principal Component Analysis

    Feng Luo, Rui Yang, Hao Sun, Chunyuan Deng, Jiarui Yao, Jingyan Shen, Huan Zhang, Hanjie Chen
    ACL 2025 (Findings) [Paper]

Data-centric Machine Learning:

  • TimeInf: Time Series Data Contribution via Influence Functions

    Yizi Zhang†, Jingyan Shen†, Xiaoxue Xiong†, Yongchan Kwon
    ICLR 2025 [Paper] [Code]

  • 2D-OOB: Attributing Data Contribution through Joint Valuation Framework

    Yifan Sun†, Jingyan Shen†, Yongchan Kwon
    NeurIPS 2024 [Paper] [Code]

πŸŽ– Honors and Awards

  • Outstanding Graduate Student (Top 1%), Tsinghua University, 2024
  • Excellent Graduate Thesis Award, Tsinghua University, 2024
  • Graduate Fellowship, Columbia University, 2023
  • Outstanding Undergraduate Student, Wuhan University, 2021
  • National Scholarship for Undergraduates, Ministry of Education of China, 2018

πŸ’» Industry Experience

  • 2024.02 - 2025.06, Full-time machine learning engineer at Pinterest

🌟 Miscs

I value the diversity and richness of life, and I hold a deep respect for beauty and purity in all their forms. Beyond research, I enjoy playing table tennis and tennis, playing drums, losing myself in a good book, or discovering new places through travel. My recent favorite novels include Flowers for Algernon and My Name Is Red. I am also a big fan of Stephen Sondheim’s musicals, particularly Company and Sunday in the Park with George. Born in Guangzhou, I’m picky about food and I particularly enjoy Japanese and Cantonese cuisine for their focus on fresh ingredients and natural flavors.