• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Modeling and Evaluating Information Diffusion for Spam Detection in Micro-blogging Networks

Vol. 9, No. 8, August 30, 2015
10.3837/tiis.2015.08.014, Download Paper (Free):

Abstract

Spam has become one of the top threats of micro-blogging networks as the representations of rumor spreading, advertisement abusing and malware distribution. With the increasing popularity of micro-blogging, the problems will exacerbate. Prior detection tools are either designed for specific types of spams or not robust enough. Spammers may escape easily from being detected by adjusting their behaviors. In this paper, we present a novel model to quantitatively evaluate information diffusion in micro-blogging networks. Under this model, we found that spam posts differ wildly from the non-spam ones. First, the propagations of non-spam posts mostly result from their followers, but those of spam posts are mainly from strangers. Second, the non-spam posts relatively last longer than the spam posts. Besides, the non-spam posts always get their first reposts/comments much sooner than the spam posts. With the features defined in our model, we propose an RBF-based approach to detect spams. Different from the previous works, in which the features are extracted from individual profiles or contents, the diffusion features are not determined by any single user but the crowd. Thus, our method is more robust because any single user。ッs behavior changes will not affect the effectiveness. Besides, although the spams vary in types and forms, they。ッre propagated in the same way, so our method is effective for all types of spams. With the real data crawled from the leading micro-blogging services of China, we are able to evaluate the effectiveness of our model. The experiment results show that our model can achieve high accuracy both in precision and recall.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
K. Chen, P. Zhu, L. Chen, Y. Xiong, "Modeling and Evaluating Information Diffusion for Spam Detection in Micro-blogging Networks," KSII Transactions on Internet and Information Systems, vol. 9, no. 8, pp. 3005-3027, 2015. DOI: 10.3837/tiis.2015.08.014.

[ACM Style]
Kan Chen, Peidong Zhu, Liang Chen, and Yueshan Xiong. 2015. Modeling and Evaluating Information Diffusion for Spam Detection in Micro-blogging Networks. KSII Transactions on Internet and Information Systems, 9, 8, (2015), 3005-3027. DOI: 10.3837/tiis.2015.08.014.

[BibTeX Style]
@article{tiis:20860, title="Modeling and Evaluating Information Diffusion for Spam Detection in Micro-blogging Networks", author="Kan Chen and Peidong Zhu and Liang Chen and Yueshan Xiong and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2015.08.014}, volume={9}, number={8}, year="2015", month={August}, pages={3005-3027}}