Difference between revisions of "Winter 2018 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
(Reinforcement Learning)
Line 41: Line 41:
 
* Reinforcement Learning with Deep Energy-Based Policies  Haarnoja et al, ICML 2017 http://proceedings.mlr.press/v70/haarnoja17a/haarnoja17a.pdf
 
* Reinforcement Learning with Deep Energy-Based Policies  Haarnoja et al, ICML 2017 http://proceedings.mlr.press/v70/haarnoja17a/haarnoja17a.pdf
 
* Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning, Gu et al., NIPS 2017 https://arxiv.org/abs/1706.00387
 
* Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning, Gu et al., NIPS 2017 https://arxiv.org/abs/1706.00387
* Sequence Level Training with Recurrent Neural Networks https://arxiv.org/pdf/1511.06732.pdf
 
 
* Distral: Robust multitask reinforcement learning, Teh et al., NIPS 2017 https://nips.cc/Conferences/2017/Schedule?showEvent=9227
 
* Distral: Robust multitask reinforcement learning, Teh et al., NIPS 2017 https://nips.cc/Conferences/2017/Schedule?showEvent=9227
 
* Repeated Inverse Reinforcement Learning, Amin et al., NIPS 2017 https://nips.cc/Conferences/2017/Schedule?showEvent=10107
 
* Repeated Inverse Reinforcement Learning, Amin et al., NIPS 2017 https://nips.cc/Conferences/2017/Schedule?showEvent=10107

Revision as of 15:47, 23 January 2018

Time: Monday 5-6pm, starting 01/22. Location: HFH 1132.

If you registered this class, you should contact the instructor to present one paper *and* be the discussant of one paper below.

  • Presenter: prepare a short summary of no more than 15 mins of presentation.
  • Discussant: by presenting a paper in one session, you automatically become the discussant of the other paper. Please prepare two questions for discussion about the paper.

If you don't present or lead the discussion, you will then need to write a 2-page final report in ICML 2018 style, comparing any two of the papers below. Due: TBD to william@cs.ucsb.edu.

  • 02/12
  • 02/19
  • 02/26
  • 03/05
  • 03/12

Reinforcement Learning

Generation

Dialog

Learning

NLP for Computational Social Science