Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
(Reinforcement Learning)
(Reinforcement Learning)
Line 16: Line 16:
 
===Reinforcement Learning===
 
===Reinforcement Learning===
 
* FeUdal Networks for Hierarchical Reinforcement Learning, Vezhnevets et al., ICML 2017 https://arxiv.org/pdf/1703.01161.pdf
 
* FeUdal Networks for Hierarchical Reinforcement Learning, Vezhnevets et al., ICML 2017 https://arxiv.org/pdf/1703.01161.pdf
 +
* Deep reinforcement learning from human preferences, Christiano et al., https://arxiv.org/pdf/1706.03741.pdf
 
* Deep Reinforcement Learning that Matters, Henderson et al., arxiv https://arxiv.org/pdf/1709.06560.pdf
 
* Deep Reinforcement Learning that Matters, Henderson et al., arxiv https://arxiv.org/pdf/1709.06560.pdf
 
* Robust Imitation of Diverse Behaviors, Wang et al. 2017, https://arxiv.org/pdf/1707.02747.pdf
 
* Robust Imitation of Diverse Behaviors, Wang et al. 2017, https://arxiv.org/pdf/1707.02747.pdf

Revision as of 15:59, 26 September 2017

  • 09/26:
    • Mahnaz Summer research presentation: Reinforced Pointer-Generator Network for Abstractive Summarization.
    • Xin: FeUdal Networks for Hierarchical Reinforcement Learning, Vezhnevets et al., ICML 2017 https://arxiv.org/pdf/1703.01161.pdf

Word Embeddings

Relational Learning and Reasoning

Reinforcement Learning

Learning (General)

Generation

Dialog

NLP for Computational Social Science