Transferable Recursive Neural Networks for Cross-Domain Aspect and Opinion Extraction
Abstract
In fine-grained opinion mining, extracting aspect terms (a.k.a. opinion targets) and opinion terms (a.k.a. opinion expressions) from user-generated texts is the most fundamental task in order to generate structured opinion summarization. Existing studies have shown that the syntactic relations between aspect and opinion words play an important role for aspect and opinion terms extraction. However, most of the works either relied on pre-defined rules or separated relation mining with feature learning. Moreover, these works only focused on single-domain extraction which failed to adapt well to other domains of interest, where only unlabeled data is available. In real-world scenarios, annotated resources are extremely scarce for many domains, motivating knowledge transfer strategies from labeled source domain(s) to any unlabeled target domain. We observe that syntactic relations among target words are not only crucial for single-domain extraction, but also serve as invariant ``pivot'' information to bridge the gap between different domains. In this paper, we construct transferable recursive neural networks based on the dependency tree of each sentence to automatically learn the fine-grained interactions among aspect and opinion words which are invariant across domains. The transferability is built on an auxiliary task and a conditional domain adversarial network to reduce domain shift effectively in word level through syntactic relations. Specifically, the auxiliary task builds structural correspondences across domains by predicting the dependency relation for each path of the dependency tree in the recursive neural network. The conditional domain adversarial network helps to learn domain-invariant hidden representation for each word conditioned on the syntactic structure. In the end, we integrate the recursive neural network with a sequence labeling classifier on top that models contextual influence in the final predictions. Extensive experiments and analysis are conducted to demonstrate the effect of the proposed model and each component on 3 benchmark datasets.