Predicting Fraudulant Job Ads With Machine Learning

ISSN (Online) 2581-9429
IJARSCT
International Journal of Advanced Research in Science, Communication and Technology (IJARSCT)
Volume 3, Issue 2, April 2023

Impact Factor: 7.301
Predicting Fraudulant Job Ads with Machine

Learning
Hitesh Ahire1, Aashish Kumar Singh2, Arpit Bhorkar3, Shrushti Daware4, Prof. P. A. Deole5
Students, Department of Computer Engineering1,2,3,4
Assistant Professor, Department of Computer Engineering5
Smt. Kashibai Navale College of Engineering, Pune, Maharashtra, India
Abstract: Online Recruitment frauds are becoming an important issue in cyber-crime region. Companies
find it easier to hire people with the help of the internet rather than the old traditional way. But it has
greatly attracted scammers. In this project, we have proposed a solution on how to detect ORF. We have
presented our results based on the previous model and the methodologies, to create the ORF detection
model where we have used Jobs_Frauds.csv . We have selected this dataset from Kaggle . Furthermore,
Dummy Classifier, Random Forest Classifier, Support Vector Machine, Gradient Boosting, Naïve Bayes
Classifiers, XG Boost, SGD classifier, Passive Aggressive and KNN are the algorithms that have been used.
We have found the accuracy of different prediction models, where Passive Aggressive (98.12%) and
Gaussian Naïve Bayes (96.72%) give the highest accuracy. Through this project, we tried to create a
precise way for detecting fraudulent hiring posts.
Keywords: Passive Aggressive, Naïve Bayes, SVM , Job Ads.
I. INTRODUCTION
Most modern organizations use the web and social media platforms for employee recruitment and job opening
advertisements. Online job posts are easily accessed by interested job-seekers. Unfortunately, this trend creates an
opportunity for criminals to exploit job seekers with fake job offers.
The Job advertisement and recruitment process has been improved by using online advertisements. These ads are being
used by criminals as a means of committing fraud. A system that can automatically identify fake job ads using their
attributes will reduce the chances of job seekers falling a victim to scams. This project aims to develop a machine
learning classifier to flag fake and real job advertisements
II. PROBLEM STATEMENT

To avoid fraudulent post for job in the internet, an automated tool using machine learning based classification
techniques is proposed in the paper. Different classifiers are used for checking fraudulent post in the web and the results
of those classifiers are compared for identifying the best employment scam detection model. It helps in detecting fake
job posts from an enormous number of posts. Two major types of classifiers, such as gradient boost and naïve bayes
classifiers are considered for fraudulent job posts detection. However, experimental results indicate that naïve bayes
and gradient boost classifiers are the best classification to detect scams over other classifiers.
III. MOTIVATION
 Most modern organisations use the web and social media platforms for employee recruitment and job opening
advertisements.
 Online job posts are easily accessed by interested job-seekers, hence its popularity.
 Unfortunately, this trend creates an opportunity for criminals to exploit job seekers with fake job offers.
 Criminals extract personal information to be used in nefarious activities.
Copyright to IJARSCT DOI: 10.48175/IJARSCT-9123 238

www.ijarsct.co.in
IJARSCT
International Journal of Advanced Research in Science,, Communication and Technology (IJARSCT)

IV. SYSTEM REQUIREMENTS

4.1 Hardware Requirements:
 Processor : Intel i3 minimum or more
 Disk Space : At least 3GB for Python IDE
 RAM : 4GB minimum or more
4.2 Software Requirements

 Python IDE
 Stremlit (online tool for Deployment)
V. METHODOLOGY
Module l - (Ads acquisition)
 We are going to use dataset from Kaggle for classification.
 Jobs will be taken from external source or online.
 Currently we are going to take a dataset around 17880 jobs from an open source dataset platform(Kaggle).
Module 2 - (Preprocessing )
 The dataset was made up of features with a lot of null values and some with little or nno null values. A 60%
threshold for null was used to identify the columns that were dropped from the dataset.
 Null values were imputed with empty strings.
Use Case:
Sequence Diagram

www.ijarsct.co.in
IJARSCT
International Journal of Advanced Research in Science, Communication and Technology (IJARSCT)

Proposed System:
Algorithms used:
 Dummy Classifier
 Random Forest
 SVC
 Gaussian Naïve Bayes
 Passive Aggressive
VI. CONCLUSION
In our research we have compared the performance of different machine learning algorithms like Dummy Classifier,
Random Forest Classifier, Support Vector Machine, Gradient Boosting, Naïve Bayes Classifiers, XG Boost, SGD
classifier, Passive Aggressive and KNN and determined which algorithm works best on our dataset. We did not only
use the most common algorithms but also some latest ones so that we can determine how well do they work. We have
seen that range of accuracy of all our algorithms lie between 96% to 98%. The main purpose for our model is to help
the job seekers identify which jobs are fake and save themselves from fraudsters. Our model can also be used by
different online job recruitment sites to detect fraudulent jobs. We have identified that some features play an important
role in determining whether a job is fake or not. In future, we plan to incorporate these features in our dataset and thus
try to further increase the accuracy range of the algorithms.
REFERENCES
[1]. Ibrahim M. Nasser; Amjad H. Alzaanin; Ashraf Yunis Maghari, Online Recruitment Fraud Detection using
ANN, 2021
[2]. Hridita Tabassum; Gitanjali Ghosh; Afra Atika; Amitabha Chakrabarty, Detecting Online Recruitment Fraud
Using Machine Learning,2021
[3]. Sangeeta Lal; Rishabh Jiaswal; Neetu Sardana; Ayushi Verma; Amanpreet Kaur; Rahul Mourya,
ORFDetector: Ensemble Learning Based Online Recruitment Fraud Detection, 2019
[4]. Asad Mehboob & M. S. I. Malik, Smart Fraud Detection Framework for Job Recruitments, 2020
[5]. Bandar Alghamdi, Fahad Alharby, An Intelligent Model for Online Recruitment Fraud Detection, 2019

www.ijarsct.co.in

Predicting Fraudulant Job Ads With Machine Learning

Uploaded by

Copyright:

Available Formats

Predicting Fraudulant Job Ads With Machine Learning

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Predicting Fraudulant Job Ads With Machine Learning

Uploaded by

Copyright:

Available Formats

ISSN (Online) 2581-9429

Volume 3, Issue 2, April 2023

Predicting Fraudulant Job Ads with Machine

Keywords: Passive Aggressive, Naïve Bayes, SVM , Job Ads.

II. PROBLEM STATEMENT

Copyright to IJARSCT DOI: 10.48175/IJARSCT-9123 238

Volume 3, Issue 2, April 2023

IV. SYSTEM REQUIREMENTS

4.2 Software Requirements

Copyright to IJARSCT DOI: 10.48175/IJARSCT-9123 239

Volume 3, Issue 2, April 2023

Copyright to IJARSCT DOI: 10.48175/IJARSCT-9123 240

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.