Machine Learning CS5011 Assignment #2: Dr. B. Ravindran

Download as pdf or txt
Download as pdf or txt
You are on page 1of 8

Machine Learning CS5011

Assignment #2
Submission Date: 07 October 2016

Instructor: Dr. B. Ravindran

Avinash Sharma(CS16D401)

1
Avinash Sharma(CS16D401) Assignment #2

Q1: SVM

You have been provided with training instances for an image classification problem DS2. You
have to train an SVM to classify the test images into either of the following four categories:
coast, forest, inside-city, mountain.
Use the training data to build classification models using the following kernels: Linear, Poly-
nomial, Gaussian and Sigmoid kernel.
Come up with the kernel parameters for the various models. You can use a fraction of data
supplied to do a n-fold cross validation to find the best model parameters.

Optimal Hyperparameter values for different SVC kernals are:

Linear Kernal

C = 20

Polynomial Kernal

C=2
=1
Degree = 4

Linear Kernal

C = 10
= 0.5

Linear Kernal

C = 20
= 0.1

Table 1: Performance Index for SVC using different Kernals

Kernal Accuracy Precision Recall F-measure


Linear 0.6 0.626 0. 0.626
Polynomial 0.65 0.642 0.642 0.642
RBF (Gaussian) 0.7 0.704 0.704 0.704
Sigmoid 0.537 0.578 0.578 0.578

Page 2 of 8
Avinash Sharma(CS16D401) Assignment #2 Q1: SVM

Q2: Neural Network

Implement original back-propagation algorithm. Use DS2 for training your neural network.
Report per-class precision, recall and F-measure on the test data used in Question 1. Now
consider the alternate error function. Derive the gradient descent update rule for this definition
of R. Now train your neural network with this new error function. Report per-class precision,
recall and F-measure on the same test data. What will happen when you vary the value of ?
Vary the value of from 102 to 102 in multiples of 10 and repeat the experiment and report
the results. Can you figure out the effect of in the results? Look at the weights learnt using
the new error function. What do you infer from them?

Table 2: Performance Index for NN using different values

Accuracy Precision Recall F-measure


0 0.6875 0.689 0.6875 0.6775
0.01 0.725 0.742 0.725 0.728
0.1 0.7 0.713 0.7 0.686
1 0.6875 0.718 0.6875 0.656
10 0.25 0.0625 0.25 0.1
100 0.25 0.0625 0.25 0.1

Gradient Descent Weight Update Rule for L2 normalized is given in Eq. 1 and 2.
R
km km (1 )/N (1)
km
R
km km (1 )/N (2)
km

On increasing the accuracy, precision, recall and F-measure are getting decreased. But at 0.1 and 0.01
values the performance in terms of these performance indices is better in comparison to that without the
regularization. This is due to decrease in the variance of the fit. Further, as we are increasing the the
learned weights are getting reduced as it essentially applies penalty on the weights of the network.

Q3: Decision Tree

You need to use Weka for this question. We will use Mushroom dataset from UCI ma-
chine learning repository (https://archive.ics.uci.edu/ml/datasets/Mushroom). This is a 2-
class problem with 8124 instances. Use the last 1124 instances as test data and the rest as
training data.

Convert the data into ARFF format.

Q3: Decision Tree continued on next page. . . Page 3 of 8


Avinash Sharma(CS16D401) Q3: Decision Tree
Assignment
(continued)Q3:
#2 Decision Tree continued on next page. . .

Run J48 Decision Tree algorithm from Weka. Report precision, recall and f1- mea- sure.
What is the effect of MinNumObj on the performance? What happens when you do
reducedErrorPruning?

What are the important features in deciding whether a mushroom is edible or not?

Turn in the Decision Tree learnt by the model (the decision tree with the best perfor-
mance).

Unprunned Tree:

Accuracy = 1

Precision =1

Recall = 1

F1-measue = 1

By default the value of MinNumObj is 2. On incresing its value the performance of tree remains same upto
MinNumObj = 24. After that the performance indices slightly reduces.

Accuracy = 0.994

Precision = 0.985

Recall = 1

F1-measue = 0.993

On using reduced error prunning the performance of tree is not affected by change in confidence interval but
it gets reduced to below mentioned values at 5 folds. Important features are: odor, stalk-shape, spore-print-
color, gill-size, gill-spacing and population.

Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Q3: Decision Tree (continued)Q3:
Decision Tree continued on next page. . .

Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Q3: Decision Tree

Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Page 4 of 8
Avinash Sharma(CS16D401) Q3: Decision Tree
Assignment
(continued)Q3:
#2 Decision Tree continued on next page. . .

Figure 1: Unprunned tree

Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Page 5 of 8
Avinash Sharma(CS16D401) Q3: Decision Tree
Assignment
(continued)Q3:
#2 Decision Tree continued on next page. . .

Figure 2: Prunned Tree at 4 folds

Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Page 6 of 8
Avinash Sharma(CS16D401) Q3: Decision Tree
Assignment
(continued)Q3:
#2 Decision Tree continued on next page. . .

Figure 3: Prunned Tree at 10 folds

Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Page 7 of 8
Avinash Sharma(CS16D401) Q3: Decision Tree
Assignment
(continued)Q3:
#2 Decision Tree continued on next page. . .

Figure 4: Prunned Tree at 70 folds

Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Page 8 of 8

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy