Publications

See an updated list of publications in my Google scholar page.

Conference publications

  • Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
    Han Shen, Zhuoran Yang, Tianyi Chen
    ICML 2024. [arxiv]

  • Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization
    A.F.M. Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen
    ICASSP 2024.

  • A Method For Bilevel Optimization With Convex Lower-level Problem
    Han Shen, Santiago Paternain, Gaowen Liu, Ramana Kompella, Tianyi Chen
    ICASSP 2024.

  • On Penalty-based Bilevel Gradient Descent Method
    Han Shen, Tianyi Chen
    ICML 2023. Extended version accepted to Mathematical Programming. [PMLR]

  • Alternating projected SGD for equality-constrained bilevel optimization
    Quan Xiao, Han Shen, Wotao Yin, Tianyi Chen
    AISTATS 2023. [arxiv]

  • A Single-timescale Analysis for Stochastic Approximation with Multiple Coupled Sequences
    Han Shen, Tianyi Chen
    NeurIPS 2022 (oral). [arxiv]

  • Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Approach
    Heshan D. Fernando, Han Shen, Miao Liu, Subhajit Chaudhury, Keerthiram Murugesan, Tianyi Chen
    ICLR 2022 (oral). [arxiv]

  • Distributed Offline Policy Optimization Over Batch Data
    Han Shen, Songtao Lu, Xiaodong Cui, Tianyi Chen
    AISTATS 2022. [html]

Journal publications

  • On Penalty-based Bilevel Gradient Descent Method
    Han Shen, Quan Xiao, Tianyi Chen
    Extended work of our ICML 2023 publication. Accepted in Mathematical Programming. [arxiv]

  • Towards Understanding Asynchronous Advantage Actor-critic: Convergence and Linear Speedup
    Han Shen, Kaiqing Zhang, Mingyi Hong, Tianyi Chen
    IEEE Transactions on Signal Processing. [arxiv]

  • Adaptive Temporal Difference Learning with Linear Function Approximation
    Tao Sun, Han Shen, Tianyi Chen, Dongsheng Li
    IEEE Transactions on Pattern Analysis and Machine Intelligence. [arxiv]

  • Byzantine-resilient Decentralized Policy Evaluation with Linear Function Approximation
    Zhaoxian Wu, Han Shen, Tianyi Chen, Qing Ling
    IEEE Transactions on Signal Processing. [arxiv]