Software


Scheduled Policy Optimization for Natural Language Communication with Intelligent Agents

We propose a novel scheduled policy optimization mechanism which dynamically schedules demonstration learning and reinforcement learning and addresses the discrepancy between training and inference in sequence decoding (IJCAI-ECAI 2018).
[Code]   [Paper]  

MojiTalk: Generating Emotional Responses at Scale

We exploit the generating emotional language of leveraging Twitter data that are naturally labeled with emojis. We investigate several conditional variational autoencoders training on conversations, which allow us to use emojis to control the emotion of the generated text (ACL 2018).
[Code]   [Paper]  

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

We propose an Adversarial REward Learning (AREL) framework to learn an implicit reward function from human demonstrations, and then optimize policy search with the learned reward function (ACL 2018).
[Code]   [Paper]  

Simple Models for Word Formation in English Slang

We propose generative models for three types of extra-grammatical word formation phenomena abounding in English slang: Blends, Clippings, and Reduplicatives (NAACL-HLT 2018).
[Code]   [Paper]  

KBGAN: Adversarial Learning for Knowledge Graph Embeddings

We introduce KBGAN, an adversarial learning framework to improve the performances of a wide range of existing knowledge graph embedding models (NAACL-HLT 2018).
[Code]   [Paper]  

DeepPath: Reinforcement Learning for Knowledge Graph Reasoning

We describe a novel reinforcement learning framework for learning multi-hop relational paths: we use a policy-based agent with continuous states based on knowledge graph embeddings, which reasons in a KG vector space by sampling the most promising relation to extend its path (EMNLP 2017).
[Code]   [Paper]  

Deep Residual Learning for Weakly-Supervised Relation Extraction

We design a novel convolutional neural network (CNN) with residual learning, and investigate its impacts on the task of distantly supervised noisy relation extraction (EMNLP 2017).
[Code]   [Paper]