Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
Line 44: Line 44:
 
*11/28:
 
*11/28:
 
** Adam: Generalization in Deep Learning, https://arxiv.org/pdf/1710.05468.pdf  
 
** Adam: Generalization in Deep Learning, https://arxiv.org/pdf/1710.05468.pdf  
 +
** Austin: Deep Reinforcement Learning that Matters, Henderson et al., arxiv https://arxiv.org/pdf/1709.06560.pdf
  
 
*12/05: No meeting, NIPS conference.
 
*12/05: No meeting, NIPS conference.
Line 57: Line 58:
 
===Reinforcement Learning===
 
===Reinforcement Learning===
 
* Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
 
* Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
* Deep Reinforcement Learning that Matters, Henderson et al., arxiv https://arxiv.org/pdf/1709.06560.pdf
 
 
* Robust Imitation of Diverse Behaviors, Wang et al. 2017, https://arxiv.org/pdf/1707.02747.pdf
 
* Robust Imitation of Diverse Behaviors, Wang et al. 2017, https://arxiv.org/pdf/1707.02747.pdf
 
* Programmable Agents, Denil et al., https://arxiv.org/pdf/1706.06383v1.pdf
 
* Programmable Agents, Denil et al., https://arxiv.org/pdf/1706.06383v1.pdf

Revision as of 22:18, 24 October 2017

Time: Tuesday 5-6pm. Location: HFH 1132.

If you registered this class, you should contact the instructor to lead the discussion of one paper below. If you don't lead the discussion, you will then need to write a 3-page final report in NIPS 2017 style, comparing any two of the papers below.

  • 09/26:
    • Mahnaz Summer research presentation: Reinforced Pointer-Generator Network for Abstractive Summarization.
    • Xin: FeUdal Networks for Hierarchical Reinforcement Learning, Vezhnevets et al., ICML 2017 https://arxiv.org/pdf/1703.01161.pdf
  • 12/05: No meeting, NIPS conference.
  • 12/12: No meeting, NAACL deadline.

Word Embeddings

Relational Learning and Reasoning

Reinforcement Learning

Generation

Dialog

NLP for Computational Social Science