/
Alekh Agarwal, Microsoft Research NYC Alekh Agarwal, Microsoft Research NYC

Alekh Agarwal, Microsoft Research NYC - PowerPoint Presentation

liane-varnes
liane-varnes . @liane-varnes
Follow
357 views
Uploaded On 2018-10-05

Alekh Agarwal, Microsoft Research NYC - PPT Presentation

A Deployable Decision Service Contextual Bandit Learning Observe the state of the world aka context Choose an action Obtain feedback on the chosen action Repeat Goal Optimize feedback eg maximize reward for chosen actions ID: 684746

context learning service policy learning context policy service decision user library join server reward client api model contexts online rewards decisions offline

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "Alekh Agarwal, Microsoft Research NYC" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript