Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
Line 13: Line 13:
 
**Vivek: The strange geometry of skip-gram with negative sampling, David Mimno and Laure Thompson, http://aclweb.org/anthology/D17-1307
 
**Vivek: The strange geometry of skip-gram with negative sampling, David Mimno and Laure Thompson, http://aclweb.org/anthology/D17-1307
 
**Yijun: Latent Intention Dialogue Models, Wen et al., ICML 2017  https://arxiv.org/pdf/1705.10229.pdf
 
**Yijun: Latent Intention Dialogue Models, Wen et al., ICML 2017  https://arxiv.org/pdf/1705.10229.pdf
 +
 +
*10/10:
 +
** Generating Sentences by Editing Prototypes, Guu et al., arxiv https://arxiv.org/abs/1709.08878
 +
** Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
  
 
===Word Embeddings===
 
===Word Embeddings===
Line 34: Line 38:
 
* Reinforcement Learning with Deep Energy-Based Policies  Haarnoja et al, ICML 2017 http://proceedings.mlr.press/v70/haarnoja17a/haarnoja17a.pdf
 
* Reinforcement Learning with Deep Energy-Based Policies  Haarnoja et al, ICML 2017 http://proceedings.mlr.press/v70/haarnoja17a/haarnoja17a.pdf
 
* Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, Kulkarni et al., 2016, https://arxiv.org/pdf/1604.06057.pdf
 
* Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation, Kulkarni et al., 2016, https://arxiv.org/pdf/1604.06057.pdf
* Modular Multitask Reinforcement Learning with Policy Sketches, Andreas et al., ICML 2017 https://arxiv.org/pdf/1611.01796.pdf
 
 
* Device Placement Optimization with Reinforcement Learning, Azalia Mirhoseini et al. https://arxiv.org/pdf/1706.04972.pdf
 
* Device Placement Optimization with Reinforcement Learning, Azalia Mirhoseini et al. https://arxiv.org/pdf/1706.04972.pdf
 
* Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning, Gu et al., NIPS 2017 https://arxiv.org/abs/1706.00387
 
* Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning, Gu et al., NIPS 2017 https://arxiv.org/abs/1706.00387

Revision as of 21:31, 2 October 2017

Time: Tuesday 5-6pm. Location: HFH 1132.

If you registered this class, you should contact the instructor to lead the discussion of one paper below. If you don't lead the discussion, you will then need to write a 3-page final report in NIPS 2017 style, comparing any two of the papers below.

  • 09/26:
    • Mahnaz Summer research presentation: Reinforced Pointer-Generator Network for Abstractive Summarization.
    • Xin: FeUdal Networks for Hierarchical Reinforcement Learning, Vezhnevets et al., ICML 2017 https://arxiv.org/pdf/1703.01161.pdf

Word Embeddings

Relational Learning and Reasoning

Reinforcement Learning

Learning (General)

Generation

Dialog

NLP for Computational Social Science