Policy Gradient Theorem Explained - Reinforcement Learning
59,490
Published 2020-11-22
Policy gradient methods are used in many of the current state-of-the-art reinforcement learning algorithms, and I think it is likely that policy gradient methods will play be an important role in advancing the field of RL. I'm excited to continue exploring this field and sharing what I learn along the way.
Join our Discord community:
π¬ discord.gg/cdQhRgw
Connect with me:
π¦ Twitter - twitter.com/elliotwaite
π· Instagram - www.instagram.com/elliotwaite
π± Facebook - www.facebook.com/elliotwaite
πΌ LinkedIn - www.linkedin.com/in/elliotwaite
π΅ Kazukii - Return
β soundcloud.com/ohthatkazuki
β open.spotify.com/artist/5d07MpiIaNmmEMTq79KAga
β youtube.com/user/OfficialKazuki
All Comments (21)
-
Thank you, John Cena.
-
This is the most intuitive derivation of Policy Gradient Theorem on the internet. Thank you for being my teacher! Every RL intro class should begin with this video.
-
This is a unique and awesome channel. Seeing these cool animations really helps to build an intuition. Thank you for uploading.
-
it's so entertaining and intriguing, thanks for your work, Elliot!
-
Really great video! I enjoyed the graphical representation of the gradient. Can't wait to see more in-depth policy gradient related videos.
-
One of the best! I'll have to watch the video a few times, but it's already helped me figure things out more intuitively. The example and animations are excellent. Great work.
-
This is the only video you need to learn the intuition and basics of Reinforcement Learning. Amazingly done! Thanks!
-
Hey Elliot, Loved this explanation so much, man! Keep up the awesome work. I'm a beginner and I feel RL is often looked upon as a difficult paradigm due to the heavy math in there, but people like you are a blessing, for putting the ideas in such an outstanding fashion :)
-
This was a very helpful video for me, thank you Elliot! The toy problem with the robot was so helpful to visualize everything, especially showing the the state-action sequences with the probabilities
-
This is awesome! Thank you for this detailed explanation!
-
Great Video! Using graphics and combining the logical explanation with pseudocode was really helpful to me. Most of the time you only see one or the other.
-
Love it , never been this much clear as a visual person thank you, and we need alot alot more, Keep up, subscribe 100%
-
Best video on the subject. Thank you, Elliot for creating the such a thorough content.
-
One of the best Videos I have ever watched for RL AI! Great work and thanks a lot.
-
This is simply the best video I have seen on this topic this year!
-
This is pure gold! Thank you so much for time, hard work, and energy you put into this video. It's highly appreciated.
-
A perfect video, I have never seen anything so good ππ Thanks from Brazil π
-
Thank you! It is extremely helpful to see a detailed worked example. Your video is very much appreciated.
-
superb bro, this is something which is missing is most of the other videos...This really help building the intuition for beginners. Keep building such videaos , thatnks a ton