RL for Planning and Planning for RL – Machine Learning Blog | ML@CMU | Carnegie Mellon University

RL for Planning and Planning for RL – Machine Learning Blog | ML@CMU | Carnegie Mellon University

The figure above illustrates the method:(a) Goal-conditioned RL often fails to reach distant goals, but can successfully reach the goal if starting nearby (inside the green region). (b) Our goal is to use observations in our replay buffer (yellow squares) as waypoints leading to the goal. (c) We aut

8 mentions: @DAIBuilds@towards_AI@stanford__ai
Date: 2020/02/13 15:00

Related Entries

Read more COTA: Improving Uber Customer Care with NLP & Machine Learning
0 users, 0 mentions 2018/06/11 19:29
Read more GitHub - asavinov/lambdo: Feature engineering and machine learning: together at last!
3 users, 23 mentions 2018/12/05 22:45
Read more GitHub - slundberg/shap: A unified approach to explain the output of any machine learning model.
0 users, 0 mentions 2018/06/27 10:28
Read more Variational Autoencoder in Tensorflow - facial expression low dimensional embedding - Machine learni...
0 users, 0 mentions 2018/04/22 03:40
Read more Proceedings of Machine Learning Research
0 users, 8 mentions 2019/05/25 08:18