Cost-effective online trending topic detection and popularity prediction in microblogging

Abstract

Identifying topic trends on microblogging services such as Twitter and estimating those topics’ future popularity have great academic and business value, especially when the operations can be done in real time. For any third party, however, capturing and processing such huge volumes of real-time data in microblogs are almost infeasible tasks, as there always exist API (Application Program Interface) request limits, monitoring and computing budgets, as well as timeliness requirements. To deal with these challenges, we propose a cost-effective system framework with algorithms that can automatically select a subset of representative users in microblogging networks in offline, under given cost constraints. Then the proposed system can online monitor and utilize only these selected users’ real-time microposts to detect the overall trending topics and predict their future popularity among the whole microblogging network. Therefore, our proposed system framework is practical for real-time usage as it avoids the high cost in capturing and processing full real-time data, while not compromising detection and prediction performance under given cost constraints. Experiments with real microblogs dataset show that by tracking only 500 users out of 0.6 million users and processing no more than 30,000 microposts daily, about 92% trending topics could be detected and predicted by the proposed system and, on average, more than 10 hours earlier than they appear in official trends lists.

Publication DOI: https://doi.org/10.1145/3001833
Divisions: College of Engineering & Physical Sciences > Adaptive communications networks research group
Uncontrolled Keywords: cost,microblogging,prediction,topic detection,Information Systems,General Business,Management and Accounting,Computer Science Applications
Publication ISSN: 1558-2868
Last Modified: 30 Oct 2024 08:12
Date Deposited: 17 Jan 2017 13:05
Full Text Link:
Related URLs: http://www.scop ... tnerID=8YFLogxK (Scopus URL)
PURE Output Type: Article
Published Date: 2017-06-09
Published Online Date: 2017-01-04
Accepted Date: 2016-09-21
Authors: Miao, Zhongchen
Chen, Kai
Fang, Yi
He, Jianhua (ORCID Profile 0000-0002-5738-8507)
Zhou, Yi
Zhang, Wenjun
Zha, Hongyuan

Download

[img]

Version: Accepted Version


Export / Share Citation


Statistics

Additional statistics for this record