CVaR-Constrained Policy Optimization for Safe Reinforcement Learning.
Zhang Q, Leng S, Ma X, Liu Q, Wang X, Liang B, Liu Y, Yang J.
Zhang Q, et al. Among authors: liu y, liu q.
IEEE Trans Neural Netw Learn Syst. 2024 Feb 23;PP. doi: 10.1109/TNNLS.2023.3331304. Online ahead of print.
IEEE Trans Neural Netw Learn Syst. 2024.
PMID: 38393836