Publications
See an updated list of publications in my Google scholar page.
Conference publications
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen, Zhuoran Yang, Tianyi Chen
ICML 2024. [arxiv]Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization
A.F.M. Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen
ICASSP 2024.A Method For Bilevel Optimization With Convex Lower-level Problem
Han Shen, Santiago Paternain, Gaowen Liu, Ramana Kompella, Tianyi Chen
ICASSP 2024.On Penalty-based Bilevel Gradient Descent Method
Han Shen, Tianyi Chen
ICML 2023. Extended version accepted to Mathematical Programming. [PMLR]Alternating projected SGD for equality-constrained bilevel optimization
Quan Xiao, Han Shen, Wotao Yin, Tianyi Chen
AISTATS 2023. [arxiv]A Single-timescale Analysis for Stochastic Approximation with Multiple Coupled Sequences
Han Shen, Tianyi Chen
NeurIPS 2022 (oral). [arxiv]Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Approach
Heshan D. Fernando, Han Shen, Miao Liu, Subhajit Chaudhury, Keerthiram Murugesan, Tianyi Chen
ICLR 2022 (oral). [arxiv]Distributed Offline Policy Optimization Over Batch Data
Han Shen, Songtao Lu, Xiaodong Cui, Tianyi Chen
AISTATS 2022. [html]
Journal publications
On Penalty-based Bilevel Gradient Descent Method
Han Shen, Quan Xiao, Tianyi Chen
Extended work of our ICML 2023 publication. Accepted in Mathematical Programming. [arxiv]Towards Understanding Asynchronous Advantage Actor-critic: Convergence and Linear Speedup
Han Shen, Kaiqing Zhang, Mingyi Hong, Tianyi Chen
IEEE Transactions on Signal Processing. [arxiv]Adaptive Temporal Difference Learning with Linear Function Approximation
Tao Sun, Han Shen, Tianyi Chen, Dongsheng Li
IEEE Transactions on Pattern Analysis and Machine Intelligence. [arxiv]Byzantine-resilient Decentralized Policy Evaluation with Linear Function Approximation
Zhaoxian Wu, Han Shen, Tianyi Chen, Qing Ling
IEEE Transactions on Signal Processing. [arxiv]