Thursday, February 25, 2021

Reinforcement Learning 4:Policy Gradient

強化學習(四):Policy Gradient

2020/04/18

-----


// 強化學習演進路線 [7]。

-----

References

[1] 深度强化学习cs294 Lecture5: Policy Gradients Introduction_人工智能_无所知的博客-CSDN博客
https://blog.csdn.net/qq_25037903/article/details/84573048

[2] CS 285
http://rail.eecs.berkeley.edu/deeprlcourse/

[3] CS294-112 Fa18 9/5/18 - YouTube
https://m.youtube.com/watch?v=XGmd3wcyDg8&list=PLkFD6_40KJIxJMR-j5A1mkxK26gh_qg37&index=21

[4] 强化学习系列(十三):Policy Gradient Methods_网络_LagrangeSK的博客-CSDN博客
https://blog.csdn.net/lagrangesk/article/details/82865578

[5] Teaching - David Silver
https://www.davidsilver.uk/teaching/

[6] RL Course by David Silver - Lecture 7: Policy Gradient Methods - YouTube
https://m.youtube.com/watch?v=KHZVXao4qXs

[7] 强化学习演进路线 - 知乎
https://zhuanlan.zhihu.com/p/49429128

-----


-----


-----


-----


 -----


 -----

No comments: