Policy Gradient Methods Image source

Policy Gradient Methods Image source

SO
Author: kittie-lecroy
| Published: 2019-03-20 | 494 Views

Sources Stanford CS 231n Berkeley Deep RL course David Silvers RL course Policy Gradient Methods Instead of indirectly representing the policy using Qvalues it can be more efficient to parameterize and learn it directly

Embed this Presentation

Available Downloads

Download Notice

Download Presentation The PPT/PDF document "Policy Gradient Methods Image source" is the property of its rightful owner. Permission is granted to download and print the materials on this website for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Presentation Transcript