Predicting Fake Online Reviews: A Comprehensive Study of Supervised and Semi-Supervised Learning Models

Kondragunta Rama Krishnaiah

doi:10.17762/turcomat.v14i03.13997

PDF

Published: Jul 18, 2023

DOI: https://doi.org/10.17762/turcomat.v14i03.13997

Keywords:

data mining, text mining model, fake online reviews, supervised learning, semi supervised learning.

Kondragunta Rama Krishnaiah

Abstract

In today's business and commerce landscape, online reviews wield significant influence. Consumers heavily rely on user reviews when making purchasing decisions for products online. Unfortunately, this reliance has led to the rise of opportunistic individuals and groups attempting to manipulate product reviews for their own benefit. To combat this issue, a research paper introduces various text mining models, both semi-supervised and supervised, that aim to detect fake online reviews. The study also compares the effectiveness of these techniques using a dataset known as "Gold Standard." The focus of this research work was on implementing unsupervised machine learning algorithms, such as the expectation maximization-based naive Bayes (EM-NB) and expectation maximization-based support vector machine (EM-SVM). Additionally, supervised machine learning algorithms like NB and SVM models were utilized. To extract features from the dataset, the researchers employed the term frequency-inverse document frequency (TF-IDF) method, which helps uncover relevant properties related to the reviews. The extracted features using TF-IDF were then used to train all the models. After conducting simulations, the results showed that the proposed supervised SVM model outperformed the conventional EM-NB, EM-SVM, and supervised NB models in terms of detecting fake online reviews. This outcome highlights the potential of supervised learning techniques in effectively identifying and addressing fraudulent reviews, thereby bolstering the credibility of online reviews and aiding consumers in making informed decisions.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

How to Cite

Kondragunta Rama Krishnaiah. (2023). Predicting Fake Online Reviews: A Comprehensive Study of Supervised and Semi-Supervised Learning Models. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 14(03), 392–399. https://doi.org/10.17762/turcomat.v14i03.13997

Issue

Vol. 14 No. 03 (2023)

Section

Research Articles

You are free to:

Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
Adapt — remix, transform, and build upon the material for any purpose, even commercially.
The licensor cannot revoke these freedoms as long as you follow the license terms.

Under the following terms:

Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

Notices:

You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .

No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.

Article Sidebar

Main Article Content