Logo image
Fraudulent review detection model focusing on emotional expressions and explicit aspects: investigating the potential of feature engineering
Journal article   Peer reviewed

Fraudulent review detection model focusing on emotional expressions and explicit aspects: investigating the potential of feature engineering

Ajay Kumar, Ram D. Gopal, Ravi Shankar and Kim Hua Tan
Decision Support Systems
01/04/2022

Abstract

online reviews Digital platforms Review manipulation Machine learning Opinion spamming Feature engineering
Reading customer reviews before purchasing items online has become a common practice; however, some companies use machine learning (ML) algorithms to generate false reviews in order to create positive brand images of their own products and negative images of competitors' offerings. Existing techniques use review content to identify fraudulent reviewers; however, spammers become more intelligent, started to learn from their mistakes, and changed their tactics in order to avoid detection techniques. Thus, investigating fraudulent accounts' behaviour of generating fake negative or positive reviews for competitors or themselves and the necessity of ML classifiers to identify fraudulent reviews, is more important than ever. In this research, we present a novel feature engineering approach in which we (1) extract several “review-centric” and “reviewer-centric” features from a dataset; (2) combine the cumulative effects of features distributions into a unified model that represents overall behavior of the fraudulent reviewers; (3) investigate the role of effective data pre-processing to improve detection accuracy; and (4) develop a probabilistic approach to detect fraudulent reviewers by learning a novel M-SMOTE model over a derived balanced dataset and feature distributions, which outperforms other ML models. Our study contributes to the literature on digital platforms and fraudulent review detection with significant managerial and theoretical implications through these novel findings.
pdf
DSS_Kumar_202204
Restricted Access
url
https://doi.org/10.1016/j.dss.2021.113728View
Published (Version of record) Open

Metrics

20 Record Views

Details

InCites Highlights

These are selected metrics from InCites Benchmarking & Analytics tool, related to this contribution

Collaboration types
Domestic collaboration
International collaboration
Citation topics
4 Electrical Engineering, Electronics & Computer Science
4.48 Knowledge Engineering & Representation
4.48.672 Natural Language Processing
Web of Science research areas
Computer Science, Artificial Intelligence
Computer Science, Information Systems
Operations Research & Management Science
Logo image