2020/04/18
-----
// 強化學習演進路線 [7]。
-----
References
[1] 深度强化学习cs294 Lecture5: Policy Gradients Introduction_人工智能_无所知的博客-CSDN博客
https://blog.csdn.net/qq_25037903/article/details/84573048
[2] CS 285
http://rail.eecs.berkeley.edu/deeprlcourse/
[3] CS294-112 Fa18 9/5/18 - YouTube
https://m.youtube.com/watch?v=XGmd3wcyDg8&list=PLkFD6_40KJIxJMR-j5A1mkxK26gh_qg37&index=21
[4] 强化学习系列(十三):Policy Gradient Methods_网络_LagrangeSK的博客-CSDN博客
https://blog.csdn.net/lagrangesk/article/details/82865578
[5] Teaching - David Silver
https://www.davidsilver.uk/teaching/
[6] RL Course by David Silver - Lecture 7: Policy Gradient Methods - YouTube
https://m.youtube.com/watch?v=KHZVXao4qXs
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.