L1 MDPs, Exact Solution Methods, Max-ent RL (Foundations of Deep RL Series)

53,475
0
Publicado 2021-08-24

Todos los comentarios (21)
  • @prerakmathur1431
    This guy is seriously the god of reinforcement learning. He and Andrew Ng have single handedly transformed ML. Kudos to you Pieter.
  • @Prokage
    Thank you for everything you've done for the field over the years, Dr. Abbeel.
  • @Shah_Khan
    Thank you Pieter for briniging the latest lecture series on Deep RL. I was looking just for that.
  • @OK-bt6lu
    This was the best video lecture intro to deep RL that I have ever watched. Thanks a lot for sharing Prof. Abbeel! Please post more :)
  • Nice explanation about RL, Pieter! Will be watching your updates closer now
  • This is one of the best content I've seen in a long time, congratulations and thank you so much!
  • @itepsilon
    Thanks so much for sharing! Awesome!
  • @junghwanro4829
    Thank you for the great lecture. It was super helpful even after taking a RL course.
  • @user-zf5mf9lo6n
    Hi Pieter, thank you very much for this great lecture! I found a mistake on P54 of the slide attached. For the policy evaluation expression, the item in the last bracket should be " s' " instead of "s".
  • @blackdeutrium746
    Hi proffessor , the walking robot you just made and showed if I wanna make a similar type of robot what I have to learn ? I quite interested in deep reinforcement learning
  • Especially liked the 'intuition' part! What would be the best way to get more in-depth on some of the "prerequisites" for RL?
  • @eonr
    I believe there's a mistake at 51:01 . The last term in the last two equations should be the value function of s' instead of s.