Project Description1
Project Description1
Project Description1
Project
100%
Instructions:
This project provides students with the opportunity to learn and apply the following skills;
• Identify real world problem that has data mining and warehousing solution.
• Perform appropriate data mining process to solve problem.
• Illustrate appropriate data warehouse design for the problem.
• Apply data mining software and toolkits to solve the problem.
2. Assessment
CLO 4: Perform data mining process using data mining software and toolkits in a range of
applications. (P4-PLO3)
3. Group Formation
Group yourself into a team of 5 persons per team for this project.
Every team member is expected to contribute and participate actively in the entire process of
completing the project. Sharing of ideas and assistance in the completion of project among
members is required.
Students are required to identify a real-world problem that need solution from data mining
and data warehousing. Students are strictly not allowed to use existing project or work from
any resources to avoid plagiarism.
Assume that your team has been commissioned to initiate a project with the objectives to
determine the existing real-world problem and propose solution on solving the problem by
performing data mining process. You are required to use R software to perform the data
mining process which including the data preparation, analysis or modelling, and evaluation.
At the end, you are required to produce a report which consists of the following sections:
a) Introduction
i. Choose and describe ONE existing real-world problem which can be solved by
using data mining task that is Classification, Clustering, or Outlier Detection. Your
description should also include the background and the importance of solving the
problem.
ii. Describe the proposed data mining method used to solve the problem.
b) Literature Review
i. Review the existing most recent data mining research works which are related
to solving your chosen problem.
ii. Based on your review, choose and describe TWO data mining methods that are
best to solve the problem.
c) Data Preparation
i. Based on the identified problem, use appropriate ONE data set and data pre-
processing methods for preparing the chosen data set for analysis. You are
encouraged to collect the real data sets by your own using questionnaire.
ii. Perform the data preparation process by using R software.
iii. Describe the data set, the data preparation method, and the preparation
process.
d) Data Mining
i. Use the chosen data mining method for exploring, analyzing, and extracting
important information from the prepared data set.
ii. Perform the data mining process based on the chosen method by using R
software.
iii. Describe the data mining method, the resulting data mining model, and any
important information obtained from the mining process.
e) Evaluation
i. You are required to fine tune the parameter setting of the data mining method
in order to achieve high quality of model.
ii. Perform appropriate evaluation on the model resulting from the data mining
process by using R.
iii. Discuss the evaluation on the quality of the model.
f) Data Warehouse
i. Construct a data warehouse that is best to improve the data mining process.
g) Conclusion
i. Provide the summary of the important findings obtained from the project.
The final project report for all Parts should contain the following items:
(a) Cover sheet (Appendix - FORM 1)
(b) Table of Contents
(c) Body of answers
(d) Reference section (Students are required to use Harvard Referencing System format)
(e) Appendices: Plagiarism report
The report must be type-written using MS-Word. You are recommended to format your
report according to the following specification:
Font Style Use Times New Roman for body text. Main headings and sub-
headings should be clearly stated using suitable font styles (e.g.
Arial).
Headers and Appropriate footers and headers should be used to enhance clarity
Footers and presentation.
Page Numbering Ensure that all pages (except cover page) are numbered.
Binding 2-hole plastic binder clip. Use only one side of the paper.
6. Submission Deadline
Project Part 1 report presentation (Item a and b): Week 9 25 May 2023 (Thursday)
Final project report (Part 1 and 2) deadline: Week 14 6 July 2023 (Thursday)
7. Late Submission
In certain circumstances, a student may be allowed to submit the project report late with valid
reason. S/he must inform the lecturer at least one week before the project is due. The
lecturer will evaluate whether the circumstance warrants submitting the project report late,
but no guarantee that the students will not be penalized.
As a general rule, no extension of time will be granted. The project description and its due
dates are normally disclosed in advance to students in order that they will be able to manage
their time according to different course study progress and complete this project on time.
8. Academic Integrity and Plagiarism
Any cheating attempt to cheat, plagiarism, collusion and any other attempts to gain an
unfair advantage in assessment will cause the students concerned to be penalized.
No mark will be given for student who does not contributing in completing the project.
FACULTY OF COMPUTING AND INFORMATICS
Project Title:
Prepare By,