Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
Line 12: Line 12:
 
* Device Placement Optimization with Reinforcement Learning, Azalia Mirhoseini et al. https://arxiv.org/pdf/1706.04972.pdf
 
* Device Placement Optimization with Reinforcement Learning, Azalia Mirhoseini et al. https://arxiv.org/pdf/1706.04972.pdf
 
* Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning, Gu et al., NIPS 2017 https://arxiv.org/abs/1706.00387
 
* Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning, Gu et al., NIPS 2017 https://arxiv.org/abs/1706.00387
 +
* SEQUENCE LEVEL TRAINING WITH RECURRENT NEURAL NETWORKS https://arxiv.org/pdf/1511.06732.pdf
  
 
===Learning (General)===
 
===Learning (General)===

Revision as of 10:31, 8 September 2017

Relational Learning and Reasoning

Reinforcement Learning

Learning (General)

Generation

Dialog