Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
Line 12: Line 12:
 
* Hindsight Experience Replay, Andrychowicz et al, https://arxiv.org/pdf/1707.01495.pdf
 
* Hindsight Experience Replay, Andrychowicz et al, https://arxiv.org/pdf/1707.01495.pdf
 
* Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management, Su et al., SIGDIAL 2017 https://arxiv.org/pdf/1707.00130.pdf
 
* Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management, Su et al., SIGDIAL 2017 https://arxiv.org/pdf/1707.00130.pdf
 +
* Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning, Peng et al., EMNLP 2017. https://arxiv.org/abs/1704.03084

Revision as of 23:40, 1 August 2017