Predicting Fraudulant Job Ads With Machine Learning
Predicting Fraudulant Job Ads With Machine Learning
Predicting Fraudulant Job Ads With Machine Learning
IJARSCT
International Journal of Advanced Research in Science, Communication and Technology (IJARSCT)
Abstract: Online Recruitment frauds are becoming an important issue in cyber-crime region. Companies
find it easier to hire people with the help of the internet rather than the old traditional way. But it has
greatly attracted scammers. In this project, we have proposed a solution on how to detect ORF. We have
presented our results based on the previous model and the methodologies, to create the ORF detection
model where we have used Jobs_Frauds.csv . We have selected this dataset from Kaggle . Furthermore,
Dummy Classifier, Random Forest Classifier, Support Vector Machine, Gradient Boosting, Naïve Bayes
Classifiers, XG Boost, SGD classifier, Passive Aggressive and KNN are the algorithms that have been used.
We have found the accuracy of different prediction models, where Passive Aggressive (98.12%) and
Gaussian Naïve Bayes (96.72%) give the highest accuracy. Through this project, we tried to create a
precise way for detecting fraudulent hiring posts.
I. INTRODUCTION
Most modern organizations use the web and social media platforms for employee recruitment and job opening
advertisements. Online job posts are easily accessed by interested job-seekers. Unfortunately, this trend creates an
opportunity for criminals to exploit job seekers with fake job offers.
The Job advertisement and recruitment process has been improved by using online advertisements. These ads are being
used by criminals as a means of committing fraud. A system that can automatically identify fake job ads using their
attributes will reduce the chances of job seekers falling a victim to scams. This project aims to develop a machine
learning classifier to flag fake and real job advertisements
III. MOTIVATION
Most modern organisations use the web and social media platforms for employee recruitment and job opening
advertisements.
Online job posts are easily accessed by interested job-seekers, hence its popularity.
Unfortunately, this trend creates an opportunity for criminals to exploit job seekers with fake job offers.
Criminals extract personal information to be used in nefarious activities.
V. METHODOLOGY
Module l - (Ads acquisition)
We are going to use dataset from Kaggle for classification.
Jobs will be taken from external source or online.
Currently we are going to take a dataset around 17880 jobs from an open source dataset platform(Kaggle).
Module 2 - (Preprocessing )
The dataset was made up of features with a lot of null values and some with little or nno null values. A 60%
threshold for null was used to identify the columns that were dropped from the dataset.
Null values were imputed with empty strings.
Use Case:
Sequence Diagram
Proposed System:
Algorithms used:
Dummy Classifier
Random Forest
SVC
Gaussian Naïve Bayes
Passive Aggressive
VI. CONCLUSION
In our research we have compared the performance of different machine learning algorithms like Dummy Classifier,
Random Forest Classifier, Support Vector Machine, Gradient Boosting, Naïve Bayes Classifiers, XG Boost, SGD
classifier, Passive Aggressive and KNN and determined which algorithm works best on our dataset. We did not only
use the most common algorithms but also some latest ones so that we can determine how well do they work. We have
seen that range of accuracy of all our algorithms lie between 96% to 98%. The main purpose for our model is to help
the job seekers identify which jobs are fake and save themselves from fraudsters. Our model can also be used by
different online job recruitment sites to detect fraudulent jobs. We have identified that some features play an important
role in determining whether a job is fake or not. In future, we plan to incorporate these features in our dataset and thus
try to further increase the accuracy range of the algorithms.
REFERENCES
[1]. Ibrahim M. Nasser; Amjad H. Alzaanin; Ashraf Yunis Maghari, Online Recruitment Fraud Detection using
ANN, 2021
[2]. Hridita Tabassum; Gitanjali Ghosh; Afra Atika; Amitabha Chakrabarty, Detecting Online Recruitment Fraud
Using Machine Learning,2021
[3]. Sangeeta Lal; Rishabh Jiaswal; Neetu Sardana; Ayushi Verma; Amanpreet Kaur; Rahul Mourya,
ORFDetector: Ensemble Learning Based Online Recruitment Fraud Detection, 2019
[4]. Asad Mehboob & M. S. I. Malik, Smart Fraud Detection Framework for Job Recruitments, 2020
[5]. Bandar Alghamdi, Fahad Alharby, An Intelligent Model for Online Recruitment Fraud Detection, 2019