Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
7379697 | Physica A: Statistical Mechanics and its Applications | 2014 | 9 Pages |
Abstract
With the rapid development of the Internet and the promotion of mobile Internet, microblogs have become a major source and route of transmission for public opinion, including burst topics that are caused by emergencies. To facilitate real time mining of a large range of burst topics, in this paper, we proposed a method to discover burst topics in real time and trace their trends based on the variation trends of word frequencies. First, for the variation trend of the words in microblogs, we adopt a non-homogeneous Poisson process model to fit the data. To represent the heat and trend of the words, we introduce heat degree factor and trend degree factor and realise the real time discovery and trend tracing of the burst topics based on these two factors. Second, to improve the computing performance, this paper was based on the Storm stream computing framework for real time computing. Finally, the experimental results indicate that by adjusting the observation window size and trend degree threshold, topics with different cycles and different burst strengths can be discovered.
Related Topics
Physical Sciences and Engineering
Mathematics
Mathematical Physics
Authors
Shihang Huang, Ying Liu, Depeng Dang,