/
Policy Gradient Methods Image source Policy Gradient Methods Image source

Policy Gradient Methods Image source - PowerPoint Presentation

kittie-lecroy
kittie-lecroy . @kittie-lecroy
Follow
346 views
Uploaded On 2019-03-20

Policy Gradient Methods Image source - PPT Presentation

Sources Stanford CS 231n Berkeley Deep RL course David Silvers RL course Policy Gradient Methods Instead of indirectly representing the policy using Qvalues it can be more efficient to parameterize and learn it directly ID: 758314

gradient policy action function policy gradient function action asynchronous current actions stochastic learning advantage probability critic actor trajectory estimate methods deep update

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "Policy Gradient Methods Image source" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript