Difference between revisions of "Fall 2017 CS595I Advanced NLP/ML Seminar"

From courses
Jump to: navigation, search
Line 11: Line 11:
 
* Programmable Agents, Denil et al., https://arxiv.org/pdf/1706.06383v1.pdf
 
* Programmable Agents, Denil et al., https://arxiv.org/pdf/1706.06383v1.pdf
 
* Hindsight Experience Replay, Andrychowicz et al, https://arxiv.org/pdf/1707.01495.pdf
 
* Hindsight Experience Replay, Andrychowicz et al, https://arxiv.org/pdf/1707.01495.pdf
 +
* Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management, Su et al., SIGDIAL 2017 https://arxiv.org/pdf/1707.00130.pdf

Revision as of 14:19, 8 July 2017