L1 MDPs, Exact Solution Methods, Max-ent RL (Foundations of Deep RL Series)

53,475

1,028 0

Publicado 2021-08-24

Lecture 1 of a 6-lecture series on the Foundations of Deep RL
Topic: MDPs, Exact Solution Methods, Max-ent RL
Instructor: Pieter Abbeel

Slides: www.dropbox.com/s/f9xyrdkpqugtrvq/l1-mdps-exact-me…

Todos los comentarios (21)

@prerakmathur1431 hace 2 años

This guy is seriously the god of reinforcement learning. He and Andrew Ng have single handedly transformed ML. Kudos to you Pieter.
@unionsafetymatch hace 2 años

I don't believe what I've stumbled upon. This is amazing!
@Prokage hace 2 años

Thank you for everything you've done for the field over the years, Dr. Abbeel.
@Shah_Khan hace 2 años

Thank you Pieter for briniging the latest lecture series on Deep RL. I was looking just for that.
@OK-bt6lu hace 1 año

This was the best video lecture intro to deep RL that I have ever watched. Thanks a lot for sharing Prof. Abbeel! Please post more :)
@henriquepett2124 hace 1 año

Nice explanation about RL, Pieter! Will be watching your updates closer now
@danielmoreiradesousa185 hace 4 meses

This is one of the best content I've seen in a long time, congratulations and thank you so much!
@itepsilon hace 2 años

Thanks so much for sharing! Awesome!
@junghwanro4829 hace 9 meses

Thank you for the great lecture. It was super helpful even after taking a RL course.
@user-zf5mf9lo6n hace 2 años

Hi Pieter, thank you very much for this great lecture! I found a mistake on P54 of the slide attached. For the policy evaluation expression, the item in the last bracket should be " s' " instead of "s".
@BruinChang hace 2 años

Much thanks, no other words.
@hongkyulee9724 hace 4 meses

This lecture is my first and best RL lecture. ❤❤
@guoshenli4193 hace 2 años

great lecture, so much thanks!!!!
@user-or7ji5hv8y hace 2 años

Awesome!
@datascience6104 hace 2 años

Thanks for sharing 👍
@user-we5so5xm5y hace 2 años

Thank you!
@goldfishjy95 hace 2 años

omg thank you so much!!!!
@blackdeutrium746 hace 2 años

Hi proffessor , the walking robot you just made and showed if I wanna make a similar type of robot what I have to learn ? I quite interested in deep reinforcement learning
@offthepathworks9171 hace 6 meses

Especially liked the 'intuition' part! What would be the best way to get more in-depth on some of the "prerequisites" for RL?
@eonr hace 1 año

I believe there's a mistake at 51:01 . The last term in the last two equations should be the value function of s' instead of s.