Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4942989 | Expert Systems with Applications | 2017 | 62 Pages |
Abstract
This paper presents TCharM, a data analytics methodology based on cluster analysis and association rule discovery to gain interesting knowledge from large collections of Twitter data. TCharM explores tweet collections along the three dimensions characterizing tweets (i.e., text content, posting time and place) to support context-aware topic trend analysis. To discover groups of tweets with a good cohesion on the three tweet features, TCharM exploits a novel distance measure (TASTE) which allows driving the clustering task by considering in one step the three tweet features. Association rule analysis is then exploited to concisely describe the cluster content with a set of understandable and significant patterns which reveal underlying correlations among frequent topics, tweeting times and places. TCharM can provide useful information to understand the evolution of people's involvement in different topics, across geographical areas and over time. TCharM find applications in various domains by providing a valuable support in decision making to domain experts. The experimental evaluation performed on real datasets demonstrates the effectiveness of the proposed approach in discovering cohesive clusters and actionable knowledge from Twitter data.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Xin Xiao, Antonio Attanasio, Silvia Chiusano, Tania Cerquitelli,