Michael Lubinsky's homepage

Programming, Math and Physics

View My GitHub Profile

https://habr.com/ru/companies/otus/articles/803041/

https://discord.com/invite/BCfHWaSMkr Aleksei Petrenko

https://arxiv.org/pdf/2304.00803.pdf A Tutorial Introduction to Reinforcement Learning

https://arxiv.org/pdf/2211.17217.pdf Tutorial on Neural Networks and Gradient-free Training

Tutorial on the Spectral Theory of Markov Chains https://arxiv.org/pdf/2207.02296.pdf

https://habr.com/ru/articles/789218/

https://www.youtube.com/watch?v=2pWv7GOvuf0

https://jonathan-hui.medium.com/rl-deep-reinforcement-learning-series-833319a95530

https://sites.ualberta.ca/~szepesva/RLBook.html

https://web.mit.edu/dimitrib/www/RLbook.html

https://www.manning.com/books/grokking-deep-reinforcement-learning

https://github.com/seungeunrho/minimalRL/blob/master/sac.py

https://stable-baselines3.readthedocs.io/en/master/

https://spinningup.openai.com/en/latest/

https://data-science-blog.com/blog/2021/07/31/my-elaborate-study-notes-on-reinforcement-learning/

Stanford CS330: Deep multitasking and meta learning:

https://www.youtube.com/watch?v=0rZtSwNOTQo&list=PLoROMvodv4rMC6zfYmnD7UG3LVvwaITY5

https://habr.com/ru/post/599923/

https://github.com/ShangtongZhang/reinforcement-learning-an-introduction

https://www.youtube.com/watch?v=i7q8bISGwMQ Steve Brunton

https://www.youtube.com/watch?v=0MNVhXEX9to

https://corecursive.com/061-reinforcement-learning/

http://incompleteideas.net/book/RLbook2018.pdf

https://news.ycombinator.com/item?id=19191746

http://incompleteideas.net/book/the-book.html . Sutton Barto book

https://www.newworldai.com/reinforcement-learning-course-deepmind/ . Course by Deep Mind

https://www.reddit.com/r/MachineLearning/comments/m0r7pe/d_best_book_hardcopy_for_rl_with_code/

http://benjamin-recht.github.io/2018/06/25/outsider-rl/

https://github.com/dennybritz/reinforcement-learning

https://github.com/marload/deep-rl-tf2

https://habr.com/ru/post/483078/

https://rlpyt.readthedocs.io/en/latest/

https://arxiv.org/abs/1911.08265 . Algo for Go, Chess, etc wich beats AlphaZero

https://www.youtube.com/watch?v=gTNNXi9ApVU

https://github.com/aikorea/awesome-rl

https://github.com/PacktPublishing/Deep-Reinforcement-Learning-Hands-On

https://habr.com/ru/post/443240/ . Q-learning

https://habr.com/ru/post/439674/ RL

https://habr.com/ru/company/hsespb/blog/444428/

http://www.argmin.net/2018/06/25/outsider-rl/

http://louiskirsch.com/maps/reinforcement-learning

https://spinningup.openai.com/en/latest/spinningup/keypapers.html

https://news.ycombinator.com/item?id=18412809

https://github.com/andri27-ts/60_Days_RL_Challenge

https://xaviergeerinck.com/markov-property-chain-reward-decision

https://mpatacchiola.github.io/blog/2016/12/09/dissecting-reinforcement-learning.html

https://github.com/Pulkit-Khandelwal/Reinforcement-Learning-Notebooks

https://habrahabr.ru/post/350328/ Reinforced learning

https://oneraynyday.github.io/ml/2018/05/06/Reinforcement-Learning-MDPs/

http://www.datasciencecentral.com/profiles/blogs/under-the-hood-with-reinforcement-learning-understanding-basic-rl

https://github.com/higgsfield/RL-Adventure-2

https://blog.insightdatascience.com/reinforcement-learning-from-scratch-819b65f074d8

https://news.ycombinator.com/item?id=17257143