Faculty Publications

A self-supervised approach to comment spam detection based on content analysis

A. Bhattarai, University of MemphisFollow
D. Dasgupta, University of MemphisFollow

Abstract

This paper studies the problems and threats posed by a type of spam in the blogosphere, called blog comment spam. It explores the challenges introduced by comment spam, generalizing the analysis substantially to any other short text type spam. The authors analyze different high-level features of spam and legitimate comments based on the content of blog postings. The authors use these features to cluster data separately for each feature using K-Means clustering algorithm. The authors also use self-supervised learning, which could classify spam and legitimate comments automatically. Compared with existing solutions, this approach demonstrates more fexibility and adaptability to the environment, as it requires minimal human intervention. The preliminary evaluation of the proposed spam detection system shows promising results. Copyright © 2011, IGI Global.

Publication Title

International Journal of Information Security and Privacy

Recommended Citation

Bhattarai, A., & Dasgupta, D. (2011). A self-supervised approach to comment spam detection based on content analysis. International Journal of Information Security and Privacy, 5 (1), 14-32. https://doi.org/10.4018/jisp.2011010102

Link to Full Text

COinS

Faculty Publications

A self-supervised approach to comment spam detection based on content analysis

Abstract

Publication Title

Recommended Citation

Search

Browse

Author Corner

Libraries

Faculty Publications

A self-supervised approach to comment spam detection based on content analysis

Authors

Abstract

Publication Title

Recommended Citation

Share

Search

Browse

Author Corner

Libraries