Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
 
Line 43: Line 43:
  
 
*11/28:
 
*11/28:
 +
** Austin: Deep Reinforcement Learning that Matters, Henderson et al., arxiv https://arxiv.org/pdf/1709.06560.pdf
 
** Adam: Generalization in Deep Learning, https://arxiv.org/pdf/1710.05468.pdf  
 
** Adam: Generalization in Deep Learning, https://arxiv.org/pdf/1710.05468.pdf  
  
Line 56: Line 57:
  
 
===Reinforcement Learning===
 
===Reinforcement Learning===
* Deep Reinforcement Learning that Matters, Henderson et al., arxiv https://arxiv.org/pdf/1709.06560.pdf
 
 
* Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
 
* Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
 
* Robust Imitation of Diverse Behaviors, Wang et al. 2017, https://arxiv.org/pdf/1707.02747.pdf
 
* Robust Imitation of Diverse Behaviors, Wang et al. 2017, https://arxiv.org/pdf/1707.02747.pdf

Latest revision as of 18:00, 28 November 2017

Time: Tuesday 5-6pm. Location: HFH 1132.

If you registered this class, you should contact the instructor to lead the discussion of one paper below. If you don't lead the discussion, you will then need to write a 3-page final report in NIPS 2017 style, comparing any two of the papers below. Due: 12/18, 23:59pm PT to william@cs.ucsb.edu.

  • 09/26:
    • Mahnaz Summer research presentation: Reinforced Pointer-Generator Network for Abstractive Summarization.
    • Xin: FeUdal Networks for Hierarchical Reinforcement Learning, Vezhnevets et al., ICML 2017 https://arxiv.org/pdf/1703.01161.pdf
  • 12/05: No meeting, NIPS conference.
  • 12/12: No meeting, NAACL deadline.

Word Embeddings

Relational Learning and Reasoning

Reinforcement Learning

Generation

Dialog

NLP for Computational Social Science