Deterministic Policy Gradient Algorithms David Silver DAVID DEEPMIND COM DeepMind Technologies London UK Guy Lever GUY LEVER UCL AC UK University College London UK Nicolas Heess Thomas Degris Daan Wi

Deterministic Policy Gradient Algorithms David Silver DAVID DEEPMIND COM DeepMind Technologies London UK Guy Lever GUY LEVER UCL AC UK University College London UK Nicolas Heess Thomas Degris Daan Wi

SO
Author: tatyana-admore
| Published: 2014-12-14 | 747 Views

The deterministic pol icy gradient has a particularly appealing form it is the expected gradient of the actionvalue func tion This simple form means that the deter ministic policy gradient can be estimated much more ef64257ciently than the usual sto

Embed this Presentation

Available Downloads

Presentation (PPTX)
Document (PDF)

Download Notice

Download Presentation The PPT/PDF document "Deterministic Policy Gradient Algorithms..." is the property of its rightful owner. Permission is granted to download and print the materials on this website for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Presentation Transcript