Proceedings of ISP RAS


Sentiment-Based Topic Model for Mining Usability Issues and Failures with User Products

E. Tutubalina (KFU, Kazan)

Abstract

This paper describes an approach to problem phrase extraction from texts that contain user experience with products. During the last decades, consumer products have grown in complexity, and consumer dissatisfaction is increasingly caused by usability problems, in addition to problems with technical failures. Moreover, user reviews from online resources, that describe actual difficulties with products experienced by users, affect on other people's purpose decisions. In this paper, we present two probabilistic graphical models which aim to extract problems with products. We modify Latent Dirichlet Allocation (LDA) to incorporate information about problem phrases with words’ sentiment polarities (negative, neutral or positive). The proposed models learn a distribution over words, associated with topics, both sentiment and problem labels. Topic models were evaluated on reviews of different domains collected from online consumer review platforms. The algorithms achieve a better performance in comparison to several state-of-the-art models in terms of the likelihood of a held-out test and in terms of an accuracy of classification results. Qualitative analysis of the topics discovered using the proposed models indicates the utility of considering sentiment information in users’ critical feedback. Our contribution is that incorporating sentiment and problem information about words with reviews’ topics by the model's asymmetric priors gives an improvement for problem phrase extraction from user reviews published in English and Russian.

Keywords

opinion mining, problem phrases, user reviews, mining defects, topic modeling, LDA, problem phrase extraction

Edition

Proceedings of the Institute for System Programming, vol. 27, issue 4, 2015, pp. 111-128.

ISSN 2220-6426 (Online), ISSN 2079-8156 (Print).

DOI: 10.15514/ISPRAS-2015-27(4)-6

Full text of the paper in pdf (in Russian) Back to the contents of the volume