no code implementations • 24 May 2024 • Yinuo Wang, Likun Wang, YuXuan Jiang, Wenjun Zou, Tong Liu, Xujie Song, Wenxuan Wang, Liming Xiao, Jiang Wu, Jingliang Duan, Shengbo Eben Li
This algorithm conceptualizes the reverse process of the diffusion model as a novel policy function and leverages the capability of the diffusion model to fit multimodal distributions, thereby enhancing the representational capacity of the policy.
1 code implementation • 19 Mar 2024 • Wenjun Zou, Yao Lyu, Jie Li, Yujie Yang, Shengbo Eben Li, Jingliang Duan, Xianyuan Zhan, Jingjing Liu, Yaqin Zhang, Keqiang Li
Safe reinforcement learning (RL) offers advanced solutions to constrained optimal control problems.
1 code implementation • 14 Oct 2022 • Dongjie Yu, Wenjun Zou, Yujie Yang, Haitong Ma, Shengbo Eben Li, Jingliang Duan, Jianyu Chen
Furthermore, we build a safe RL framework to resolve constraints required by the DRC and its corresponding shield policy.
Model-based Reinforcement Learning reinforcement-learning +2