DAS601 Project
DAS601 Project
DAS601 Project
To Read:
The initial phase of the project involves finding a problem that can be addressed
through the application of machine learning techniques. The issue might pertain to
various domains, including science, social science, health, business, finance, and
others. Furthermore, it is imperative to gather a suitable dataset in order to address
the problem effectively. You can utilize the following resources to gather data:
1. https://www.kaggle.com/datasets (Kaggle)
2. https://archive.ics.uci.edu/ (UCI Machine Learning Repository)
3. https://paperswithcode.com/datasets (Papers with Code)
Once the problem has been determined and the necessary data has been gathered,
it is imperative to compose an Introduction chapter. This chapter should consist of
the following paragraphs:
The introduction section of the subsequent papers may serve as a model. Attempt
to comprehend and comprehend the structure and progression of the introduction,
and endeavor to organize your own introduction in a similar manner.
In this section, your task is to delve into the dataset and systematically identify
and document its various characteristics. You are required to compose different
sections of the Methodology and Results chapters. The detailed instructions for
each chapter are as follows:
2. Lastly, determine & document the quantity of data you wish to retain for
training and testing purposes and divide the dataset accordingly. Please
utilize the training dataset exclusively for all subsequent assignments from
this point forward.
After that, you will select the best features among the existing ones and the newly
created ones using the following methods:
Models
This section pertains to the development of machine learning models. Initially, you
must identify a minimum of five (you can choose more) distinct models using your
expertise in machine learning and the results of your literature evaluation.
Subsequently, establish a subsection entitled "Models" and provide a detailed
account of the theories, advantages, and disadvantages of each model, supported
by diagrams and references, and your reasons for choosing them.
Overall Structure
0 Abstract
1 Introduction
2 Methodology
2.1 Dataset
2.2 Feature Engineering
2.3 Model Selection
2.4 Model Evaluation
3 Results
3.1 Data
3.2 Model Selection
3.3 Model Evaluation
3.4 Error Analysis
4 Discussion
5 Conclusion