Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
Line 16: Line 16:
 
*10/10:
 
*10/10:
 
** Generating Sentences by Editing Prototypes, Guu et al., arxiv https://arxiv.org/abs/1709.08878
 
** Generating Sentences by Editing Prototypes, Guu et al., arxiv https://arxiv.org/abs/1709.08878
** Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
+
** Ethan: Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments, Lowe et al., NIPS 2017 https://arxiv.org/abs/1706.02275
  
 
===Word Embeddings===
 
===Word Embeddings===
Line 29: Line 29:
  
 
===Reinforcement Learning===
 
===Reinforcement Learning===
* FeUdal Networks for Hierarchical Reinforcement Learning, Vezhnevets et al., ICML 2017 https://arxiv.org/pdf/1703.01161.pdf
+
* Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
 
* Deep reinforcement learning from human preferences, Christiano et al., https://arxiv.org/pdf/1706.03741.pdf
 
* Deep reinforcement learning from human preferences, Christiano et al., https://arxiv.org/pdf/1706.03741.pdf
 
* Deep Reinforcement Learning that Matters, Henderson et al., arxiv https://arxiv.org/pdf/1709.06560.pdf
 
* Deep Reinforcement Learning that Matters, Henderson et al., arxiv https://arxiv.org/pdf/1709.06560.pdf

Revision as of 15:33, 3 October 2017

Time: Tuesday 5-6pm. Location: HFH 1132.

If you registered this class, you should contact the instructor to lead the discussion of one paper below. If you don't lead the discussion, you will then need to write a 3-page final report in NIPS 2017 style, comparing any two of the papers below.

  • 09/26:
    • Mahnaz Summer research presentation: Reinforced Pointer-Generator Network for Abstractive Summarization.
    • Xin: FeUdal Networks for Hierarchical Reinforcement Learning, Vezhnevets et al., ICML 2017 https://arxiv.org/pdf/1703.01161.pdf

Word Embeddings

Relational Learning and Reasoning

Reinforcement Learning

Learning (General)

Generation

Dialog

NLP for Computational Social Science