Article ID Journal Published Year Pages File Type
4948539 Neurocomputing 2016 26 Pages PDF
Abstract
Online social network presents a great opportunity to analyze user behavior and mine the implicit personality traits from the social network data. Considering the personality recognition as a multi-label classification problem, this paper proposes a new probabilistic topic model (PT-LDA model) to predict the personality traits within the framework of Five Factor Model. The proposed model extends the Latent Dirichlet Allocation (LDA) model to integrate the n-gram features into few latent topics and each topic is characterized by not only the multinomial distribution over words but also the Gaussian distributions over personality traits. This paper develops a Gibbs-EM algorithm to solve the proposed model iteratively based on Gibbs sampling and expectation maximization. Quantitative evaluation shows that PT-LDA is more accurate, efficient and robust than several baselines. Our experiment also shows that the proposed model can be used to extract the interpretable topics associated with each personality trait, which provides a new way to uncover user behaviors in online social network.
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , ,