0% found this document useful (0 votes)

30 views4 pages

Final Project

Uploaded by

mertdene10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views4 pages

Final Project

Uploaded by

mertdene10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

ADA 442 Statistical Learning | Classification

Author: Dr. Hakan Emekci

Created: 13 Nov 2023
Version: v23.01.01
Final Project Assignment
Objective
The objective of this project is to build a machine learning model to predict
whether a client of a bank will subscribe to a term deposit or not. The dataset
used for this project is the Bank Marketing Data Set, which can be found
at https://archive.ics.uci.edu/ml/datasets/Bank+Marketing. Use the “bank-
additional.csv” with 10% of the examples (4119), randomly selected from full
dataset, and 20 inputs.
The data is related with direct marketing campaigns (phone calls) of a Portuguese
banking institution. The classification goal is to predict if the client will subscribe
a term deposit (variable y). The data is related with direct marketing campaigns
of a Portuguese banking institution. The marketing campaigns were based on
phone calls. Often, more than one contact to the same client was required, in
order to access if the product (bank term deposit) would be (‘yes’) or not (‘no’)
subscribed.

Source:
[Moro et al., 2014] S. Moro, P. Cortez and P. Rita. A Data-Driven Approach to
Predict the Success of Bank Telemarketing. Decision Support Systems, Elsevier,
62:22-31, June 2014

Requirements
Your project should meet the following requirements:
1. Data cleaning: Perform necessary data cleaning operations to make sure
the data is in a suitable format for analysis.
2. Data preprocessing: Perform necessary data preprocessing operations such
as feature scaling, encoding categorical variables, etc.
3. Feature selection: Use feature selection techniques to select the most
relevant features for the model.
4. Model selection: Compare the performance of at least three different
models (e.g., logistic regression, random forest, neural network) and choose
the best one based on evaluation metrics.
5. Hyperparameter tuning: Tune the hyperparameters of the selected model
to improve its performance.

1
6. Evaluation: Evaluate the performance of the final model using appropriate
evaluation metrics.
7. Deployment: Deploy the final model using streamlit and create a web
interface for the model.

Grading
The project will be graded based on the following criteria:
• Data Cleaning (10%): The dataset should be thoroughly cleaned, and any
data quality issues should be addressed appropriately.
• Data Preprocessing (10%): The categorical variables should be appropri-
ately encoded, and numerical variables should be scaled if necessary.
• Feature Engineering (10%): New features should be created where appro-
priate.
• Model Selection (20%): Several models should be trained, and the best-
performing model should be selected based on appropriate metrics. Stu-
dents should evaluate the performance of their model using appropriate
metrics and compare it with other models. The selected model’s hyperpa-
rameters should be tuned using appropriate techniques.
• Creating Pipeline (20%): Students should create a pipeline cover all process.
• Deployment (30%): The selected model should be deployed using the
Streamlit framework, and the deployed model should be usable by end-
users.

Submissions
Submit a Jupyter Notebook containing the code for the project. Make sure
to include sufficient documentation and comments in your code. Also, provide
a separate document with instructions on how to run and interact with the
deployed mode in your report file. Each student should submit three files zipped
in a single file as “Group_0XX.zip” on LMS system:
• Jupyter Notebook (project.ipynb)
• Powerpoint Presentation (presentation.ppt) (Max 5 slides)
• A report (report.pdf) summarizing your findings and recommendations.
(Max 2 pages). Give the Streamlit cloud address of your project.
The deadline for submission is 24 December 2023 at 11:59 PM.

Instructions
• The project is open-book and open-internet. You are free to use any
resources available to you, but you are not allowed to collaborate with
other groups or to copy from other sources.

2
• This is an group project.
• Each student has to implement the required steps and come up with an
optimal solution.
• You are allowed to use any Python libraries for data analysis and modeling.
• You have to provide brief explanations of each step in the notebook and
presentation.
• The notebook should be well documented and easy to follow.
• Your performance will be evaluated based on the grading criteria mentioned
above.
• Academic honesty is expected from each student. Any act of plagiarism or
cheating will not be tolerated and will be reported to the university.
• All submissions must be made on or before the specified deadline. Late
submissions will not be accepted.
• Each group has to prepare short presentation for their project. Selected
projects will be discussed in the class as a demo.

Academic Honesty
• This is an group project, and each group is expected to work independently
and each member of group contribute equally to the project.
• Collaboration between groups is not allowed, and any instance of academic
dishonesty will be reported to the university authorities.
• You may use online resources and libraries, but you must cite them properly
in your code and presentation.
• Your code and presentation must be original and free of plagiarism.

Deadline
The deadline for submission is 24.12.2023 at 11:59 PM. Late submissions will
not be accepted.

Resources
You may find the following resources helpful:
• Our Lecture notes and notebooks on Teams
• [Pandas documentation] (https://pandas.pydata.org/docs/)
• [Scikit-learn documentation] (https://scikit-learn.org/stable/documentation.html)
• [Seaborn documentation] (https://seaborn.pydata.org/documentation.html)
• [Matplotlib documentation] (https://matplotlib.org/stable/contents.html)
• [Streamlit documentation] (https://streamlit.io/documentation)
• [Hyperparameter tuning with GridSearchCV] (https://scikit-
learn.org/stable/modules/generated/sklearn.model_selection.GridSearchCV.html)
• [Understanding Precision-Recall in Scikit-Learn] (https://scikit-
learn.org/stable/auto_examples/model_selection/plot_precision_recall.html)
• [Data Cleaning and Preprocessing in Python] (https://towardsdatascience.com/data-
cleaning-and-preprocessing-techniques-for-your-machine-

3
learning-project-ec50b8b7996b)
• [Machine Learning Pipeline: What is it and how to build one]
(https://towardsdatascience.com/machine-learning-pipeline-
what-is-it-and-how-to-build-one-7fddc3413e1d)
• [How to Deploy Machine Learning Models with Streamlit]
(https://towardsdatascience.com/how-to-deploy-machine-
learning-models-with-streamlit-379493145b58)
• [Machine Learning Project Checklist] (https://www.kdnuggets.com/2018/05/general-
ml-advice-project-checklist.html)

C - C4H22 - 2411 SAP Certified Associate Exam Dumps
No ratings yet
C - C4H22 - 2411 SAP Certified Associate Exam Dumps
9 pages
Task - Case Study - DLMDSME01
No ratings yet
Task - Case Study - DLMDSME01
7 pages
CS 2 3 4 Aml
No ratings yet
CS 2 3 4 Aml
70 pages
Pixologic ZBrush 4R7: A Comprehensive Guide
From Everand
Pixologic ZBrush 4R7: A Comprehensive Guide
Prof. Sham Tickoo
No ratings yet
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
F21DL 2024-25 Coursework-1 - 240918 - 110502
No ratings yet
F21DL 2024-25 Coursework-1 - 240918 - 110502
7 pages
Homework Emilio
No ratings yet
Homework Emilio
2 pages
Blockchain Foundation Courseware - English
From Everand
Blockchain Foundation Courseware - English
Eppo Luppes
No ratings yet
Assignment 3-PDS Python-24S3
No ratings yet
Assignment 3-PDS Python-24S3
5 pages
Ce473 Project - Fall 2024
No ratings yet
Ce473 Project - Fall 2024
8 pages
Week 1: Python Basics: Class 1: Getting Started With Python
No ratings yet
Week 1: Python Basics: Class 1: Getting Started With Python
6 pages
Data Mining & Machine Learning Courseoutline
No ratings yet
Data Mining & Machine Learning Courseoutline
7 pages
Project Big Data
No ratings yet
Project Big Data
2 pages
CSC 603 - Final Project
No ratings yet
CSC 603 - Final Project
3 pages
Project2 - 158755. 4.21
No ratings yet
Project2 - 158755. 4.21
3 pages
Methods and Models
No ratings yet
Methods and Models
12 pages
Project Requirements Student Version 1.0
No ratings yet
Project Requirements Student Version 1.0
6 pages
MCS-034: Software Engineering
From Everand
MCS-034: Software Engineering
Dr. DK Sukhani
No ratings yet
Implementing the Stakeholder Based Goal-Question-Metric (Gqm) Measurement Model for Software Projects
From Everand
Implementing the Stakeholder Based Goal-Question-Metric (Gqm) Measurement Model for Software Projects
Dr. Prashanth Harish Southekal
No ratings yet
AI Class Mid Project - Classification Application Using Sklearn and Gradio
No ratings yet
AI Class Mid Project - Classification Application Using Sklearn and Gradio
2 pages
Introduction To Big Data Ecosystems: Description
No ratings yet
Introduction To Big Data Ecosystems: Description
4 pages
CT1-MLOPs S1 2
No ratings yet
CT1-MLOPs S1 2
68 pages
Project Description Document
No ratings yet
Project Description Document
7 pages
Spring 2025 - CS619 - 10969
No ratings yet
Spring 2025 - CS619 - 10969
4 pages
DP-600: Implementing Analytics Solutions Using Microsoft Fabric Exam Preparation
From Everand
DP-600: Implementing Analytics Solutions Using Microsoft Fabric Exam Preparation
Georgio Daccache
No ratings yet
Industrial Copper Modeling Project Explanation
No ratings yet
Industrial Copper Modeling Project Explanation
1 page
Capstone Project - Jaro-Prof. Babji
No ratings yet
Capstone Project - Jaro-Prof. Babji
5 pages
Assignment-2 IDS
No ratings yet
Assignment-2 IDS
2 pages
Final Project Guidelines: Dataset Selection & Planning
No ratings yet
Final Project Guidelines: Dataset Selection & Planning
3 pages
New ITRAdd On
No ratings yet
New ITRAdd On
6 pages
Machine Learning Assignment-02
No ratings yet
Machine Learning Assignment-02
2 pages
Sari Go MM Ulaan U Deep Resume
No ratings yet
Sari Go MM Ulaan U Deep Resume
3 pages
Syllabus - ML Lab
No ratings yet
Syllabus - ML Lab
3 pages
Deep Learning Nanodegree Syllabus: Project: Find Donors For Charityml
No ratings yet
Deep Learning Nanodegree Syllabus: Project: Find Donors For Charityml
13 pages
PyTorch Foundations and Applications: Definitive Reference for Developers and Engineers
From Everand
PyTorch Foundations and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Exploring AutoCAD Civil 3D 2018, 8th Edition
From Everand
Exploring AutoCAD Civil 3D 2018, 8th Edition
Prof. Sham Tickoo
No ratings yet
ML Lab Manual
No ratings yet
ML Lab Manual
90 pages
Model Based Environment: A Practical Guide for Data Model Implementation with Examples in Powerdesigner
From Everand
Model Based Environment: A Practical Guide for Data Model Implementation with Examples in Powerdesigner
Vladimir Pantic
No ratings yet
Arsalan Shirzad's Mini Projects Portfolio
No ratings yet
Arsalan Shirzad's Mini Projects Portfolio
24 pages
Context: Description
No ratings yet
Context: Description
5 pages
Kartik MLP 4-9prg
No ratings yet
Kartik MLP 4-9prg
10 pages
Digital Transformation in Banking
No ratings yet
Digital Transformation in Banking
4 pages
Model Deployment GL
No ratings yet
Model Deployment GL
20 pages
Machine Learning Mastery for Engineers
From Everand
Machine Learning Mastery for Engineers
Abdellatif Sadeq
No ratings yet
Data Science Fundamentals
No ratings yet
Data Science Fundamentals
44 pages
Phase3 Credit Card Fraud Detection
No ratings yet
Phase3 Credit Card Fraud Detection
7 pages
Big Data Framework Final Project
No ratings yet
Big Data Framework Final Project
2 pages
Raushan Dec-2023
No ratings yet
Raushan Dec-2023
2 pages
New Text Document
No ratings yet
New Text Document
4 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Raushan Nov-2023
No ratings yet
Raushan Nov-2023
2 pages
Java™ Programming: A Complete Project Lifecycle Guide
From Everand
Java™ Programming: A Complete Project Lifecycle Guide
Nitin Shreyakar
No ratings yet
Exploring Autodesk Revit 2021 for MEP, 7th Edition
From Everand
Exploring Autodesk Revit 2021 for MEP, 7th Edition
Prof. Sham Tickoo
No ratings yet
Demonstrating Design for Six Sigma
From Everand
Demonstrating Design for Six Sigma
Robert Perrine
3/5 (2)
Mastering matplotlib
From Everand
Mastering matplotlib
Duncan M. McGreggor
No ratings yet
Final Documentation
No ratings yet
Final Documentation
101 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
Applied Machine Learning with Scikit-learn: Definitive Reference for Developers and Engineers
From Everand
Applied Machine Learning with Scikit-learn: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Important Questions
No ratings yet
Important Questions
4 pages
ML Assignment 2
No ratings yet
ML Assignment 2
3 pages
Multiclass Fish Image Classification
No ratings yet
Multiclass Fish Image Classification
5 pages
Ex04 - To Create A CI Pipeline Using Jenkins
No ratings yet
Ex04 - To Create A CI Pipeline Using Jenkins
3 pages
Ata 31 A320
No ratings yet
Ata 31 A320
3,527 pages
1769 td006 - en P
No ratings yet
1769 td006 - en P
132 pages
PLC Programming Example - Palletizer - Acc Automation
No ratings yet
PLC Programming Example - Palletizer - Acc Automation
22 pages
Steve Jobs - Leadership
No ratings yet
Steve Jobs - Leadership
14 pages
VB Script Coding Conventions
No ratings yet
VB Script Coding Conventions
11 pages
S4 HANA PTD-YTD Wage Types Requirements Canada
No ratings yet
S4 HANA PTD-YTD Wage Types Requirements Canada
14 pages
Screenshot 2024-09-28 at 10.49.49 PM
No ratings yet
Screenshot 2024-09-28 at 10.49.49 PM
1 page
Fusing EDC Specifications and Design Veeva Vault CDMS
No ratings yet
Fusing EDC Specifications and Design Veeva Vault CDMS
9 pages
Machine Learning Internship Report
No ratings yet
Machine Learning Internship Report
19 pages
Basic Principles and Calculations in Chemical Engineering
No ratings yet
Basic Principles and Calculations in Chemical Engineering
2 pages
LTLE 150 - Information in Contemporary Society: James Madison University
No ratings yet
LTLE 150 - Information in Contemporary Society: James Madison University
6 pages
Formula Tutorial1
No ratings yet
Formula Tutorial1
61 pages
Simulacro de Examen
No ratings yet
Simulacro de Examen
6 pages
Simtronics - Operator Training Simulator (OTS)
No ratings yet
Simtronics - Operator Training Simulator (OTS)
1 page
Mastering OpenLayers 3 - Sample Chapter
No ratings yet
Mastering OpenLayers 3 - Sample Chapter
29 pages
Str-103-Details of Lintel Beam-1 PDF
No ratings yet
Str-103-Details of Lintel Beam-1 PDF
1 page
FitSM Advanced Training SOC V2.5 PDF
No ratings yet
FitSM Advanced Training SOC V2.5 PDF
136 pages
Variables, Data Types, Operators, Keywords, Control Statements
No ratings yet
Variables, Data Types, Operators, Keywords, Control Statements
21 pages
Sediment Spill
No ratings yet
Sediment Spill
9 pages
Safe Transportation System Report MICRO
No ratings yet
Safe Transportation System Report MICRO
29 pages
Circular - VPS Launches A New Version of SampLogic
No ratings yet
Circular - VPS Launches A New Version of SampLogic
1 page
Assignment 1
No ratings yet
Assignment 1
3 pages
Srslte Docs
No ratings yet
Srslte Docs
197 pages
CS2351 Ai Notes PDF
No ratings yet
CS2351 Ai Notes PDF
91 pages
Dinesh Resume
No ratings yet
Dinesh Resume
1 page
Byasadev Nayak
No ratings yet
Byasadev Nayak
7 pages
SQL MCQ's
100% (1)
SQL MCQ's
63 pages
Operating System Imp Questions
No ratings yet
Operating System Imp Questions
34 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Final Project

Uploaded by

Final Project

Uploaded by

ADA 442 Statistical Learning | Classification

Author: Dr. Hakan Emekci

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.