About me

I earned my Ph.D. from RPI in December 2024 under the supervision of Dr. Tianyi Chen. I was fortunate to join Dr. Tianyi Chen’s group as the first Ph.D. student and to grow with the research group until my graduation. At the institute, my research primarily focused on optimization algorihthm for machine learning and reinforcement learning, with applications to LLM post-training and more.

After graduation, I joined Ant Group’s top talent program Ant Star, where I primarily work on LLM post-training algorithms with strong safety guarantees. Prior to this, I worked in IBM Research AI as a research intern mentored by Pin-Yu Chen and under the management of Payel Das. I also worked on offline reinforcement learning algorithms as a research scientist intern at IBM Research AI mentored by Songtao Lu and Xiaodong Cui.

News and highlights

Selected works

  • On Penalty-based Bilevel Gradient Descent Method
    Han Shen, Quan Xiao, Tianyi Chen
    conference version in ICML 2023, extended work in Mathematical Programming. [MAPR]

  • SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
    Han Shen, Pin-Yu Chen, Payel Das, Tianyi Chen
    ICLR 2025. [arxiv]

  • Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
    Heshan Fernando*, Han Shen*, Parikshit Ram, Yi Zhou, Horst Samulowitz, Nathalie Baracaldo, Tianyi Chen
    *equal contribution, new preprint. [arxiv]

  • Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
    Han Shen, Zhuoran Yang, Tianyi Chen
    conference version in ICML 2024, extended work to appear in JMLR [arxiv]

  • Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Approach
    Heshan D. Fernando, Han Shen, Miao Liu, Subhajit Chaudhury, Keerthiram Murugesan, Tianyi Chen
    ICLR 2023 oral. [arxiv]

Services

Reviewer/program committee for

  • Advances in Neural Information Processing Systems (NeurIPS)
  • International Conference on Machine Learning (ICML)
  • International Conference on Learning Representation (ICLR)
  • International Conference on Artificial Intelligence and Statistic (AISTATS)
  • Annual AAAI Conference on Artificial Intelligence (AAAI)
  • IEEE Transactions on Signal Processing (TSP)

Industry experiences

Ant Group. (CH) Present

  • Working as AI algorithm engineer in Ant Star talent program.

IBM Research AI. (US) 05.2024 - 08.2024

IBM Research AI. (US) 05.2021 - 08.2021