Projected Policy Gradient Converges in a Finite Number of Iterations
Jiacai Liu, Wenye Li, and Ke Wei. Projected Policy Gradient Converges in a Finite Number of Iterations. arXiv:2311.0110, 2023.
Ph.D in Reinforcement Learning, Fudan University, (2023-2026)
Supervisor : Ke Wei
M.S. in Statistics, Fudan University, (2021 - 2023)
B.M. in Shanghai University of Finance and Economic, (2017 - 2021)
I rank 1st in GaoKao in Guangyuan City, Sichuan Province
Jiacai Liu, Wenye Li, and Ke Wei. Projected Policy Gradient Converges in a Finite Number of Iterations. arXiv:2311.0110, 2023.
Jiacai Liu, Jinchi Chen, and Ke Wei. On the Linear Convergence of Policy Gradient under Hadamard Parameterization. arXiv:2305.19575, 2023.
Presentation at Applied Math Ph.D. Seminar, Rm1801,Guanghua East Tower