Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
(Generation)
(Reinforcement Learning)
Line 11: Line 11:
 
* Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
 
* Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
 
* Device Placement Optimization with Reinforcement Learning, Azalia Mirhoseini et al. https://arxiv.org/pdf/1706.04972.pdf
 
* Device Placement Optimization with Reinforcement Learning, Azalia Mirhoseini et al. https://arxiv.org/pdf/1706.04972.pdf
 +
* Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning, Gu et al., NIPS 2017 https://arxiv.org/abs/1706.00387
  
 
===Learning (General)===
 
===Learning (General)===

Revision as of 11:30, 5 September 2017

Relational Learning and Reasoning

Reinforcement Learning

Learning (General)

Generation

Dialog