Search Results for author: Zhengbang Zhu

Found 11 papers, 4 papers with code

Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning

no code implementations29 May 2024 Hanye Zhao, Xiaoshen Han, Zhengbang Zhu, Minghuan Liu, Yong Yu, Weinan Zhang

We propose Dynamics Diffusion, short as DyDiff, which can inject information from the learning policy to DMs iteratively.

DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching

no code implementations4 Feb 2024 Guanghe Li, Yixiang Shan, Zhengbang Zhu, Ting Long, Weinan Zhang

In offline reinforcement learning (RL), the performance of the learned policy highly depends on the quality of offline datasets.

D4RL Data Augmentation +4

MADiff: Offline Multi-agent Learning with Diffusion Models

1 code implementation27 May 2023 Zhengbang Zhu, Minghuan Liu, Liyuan Mao, Bingyi Kang, Minkai Xu, Yong Yu, Stefano Ermon, Weinan Zhang

MADiff is realized with an attention-based diffusion model to model the complex coordination among behaviors of multiple agents.

Offline RL Trajectory Prediction

Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents

no code implementations18 Dec 2022 Minghuan Liu, Zhengbang Zhu, Menghui Zhu, Yuzheng Zhuang, Weinan Zhang, Jianye Hao

In reinforcement learning applications like robotics, agents usually need to deal with various input/output features when specified with different state/action spaces by their developers or physical restrictions.

Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems

no code implementations11 Oct 2022 Zhengbang Zhu, Rongjun Qin, JunJie Huang, Xinyi Dai, Yang Yu, Yong Yu, Weinan Zhang

The increase in the measured performance, however, can have two possible attributions: a better understanding of user preferences, and a more proactive ability to utilize human bounded rationality to seduce user over-consumption.

Benchmarking Sequential Recommendation

Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization

2 code implementations4 Mar 2022 Minghuan Liu, Zhengbang Zhu, Yuzheng Zhuang, Weinan Zhang, Jianye Hao, Yong Yu, Jun Wang

Recent progress in state-only imitation learning extends the scope of applicability of imitation learning to real-world settings by relieving the need for observing expert actions.

Imitation Learning Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.