Zhao , MingmingLi , Yongfeng and Wen , Zaiwen. (2022). A Stochastic Trust-Region Framework for Policy Optimization. Journal of Computational Mathematics. 40 (6). 1004-1030. doi:10.4208/jcm.2104-m2021-0007