Assignment 3

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

Assignment-3

Q.1 What is 5-fold cross-validation?


a. The original sample is randomly partitioned into 5 equal sub-samples. Out of the 5 sub-
samples, a single randomly chosen subsample is retained for testing the model and the
remaining 4 are used as training data.
b. The original data set is randomly split into 20% testing data and 80% training the model.
This process is repeated exactly 5 times, and the average is calculated to obtain an
estimate.
c. The original sample is randomly partitioned into 5 equal sub-samples. Out of the 5 sub-
samples, a single subsample is retained for testing the model and the remaining 4 are
used as training data. This process is repeated 5 times with each sub-sample used exactly
once as testing data. The results are then averaged.
d. The original data set is randomly split into 20% testing data and 80% training the model.

Ans- c

Note- Use the below content to answer Q. 2, Q. 3, and Q. 4

The below table provides the actual values (Ya) and the predicted values (Yp) for students who
obtained marks in the end-semester examination. The values 1 and 0 represent whether the
students scored greater than 50% or less than 50%, respectively.

Roll Ya Yp
no.

1 1 1

2 1 0

3 0 0

4 1 0

5 1 1

6 0 1

7 0 0

8 0 1

9 1 0

10 1 1

11 0 0
12 1 0

13 1 1

14 0 1

15 1 0

Q.2 For the above table calculate Precision Value.


a. 0.4
b. 0.75
c. 0.57
d. 0.25

Ans- c

Q3. For the above table, calculate the value of Recall.

a. 0.25
b. 0.50
c. 0.44
d. 0.80
Ans- c

Q4. For the above table, calculate the value of accuracy.

a. 0.25
b. 0.46
c. 0.75
d. 0.80
Ans- b

Q.5 While conducting studies, Researcher wants to classify engaged and unengaged behavior
from the collected data. He uses two classification techniques namely logistic regression and DT
classifier, the value of characteristic parameters from these two classifiers are as follows,

Classifier Accuracy TPR TNR


1 Logistic 0.68 0.6 0.9
Regression(LR)

2 Decision Tree(DT) 0.69 0.7 0.2

From the above data, we can conclude that:

a. Classifier 1(LR) is better


b. Classier 2 (DT) is better
c. Both will give the same results
d. Data Insufficient

Ans- a

Q.6 Two Raters 1 and 2 rates the student engagement of the students while they are working in a
TELE. The following tables depict the observation made by them

Rater-1 Engaged Rater-1 not Engaged

Rater-2 Engaged 25 10

Rater-2 not Engaged 15 20


Note -
Report your
answer correctly to 2 decimal places
Calculate the value of kappa:
a. 0.28
b. 0.43
c. 0.50
d. 0.17
Ans- a

Q.7 The harmonic mean of Precision and Recall is another performance metric known as
a. Cohen’s Kappa
b. F1-score
c. Sensitivity
d. Specificity
Ans- b

Q.8 ML is not required to be implemented in situations where


a. A personalized learning solution is required
b. Human expertise exists and automation is not required.
c. Rules are difficult to extract
d. Learners’ emotions must be predicted.
Ans- b

Q.9 For the below confusion matrix, the Accuracy value is ______
(Report your answers correctly to two decimal places)
Actual A Actual !A
Predicted A 15 30
Predicted !A 20 10

a. 33%
b. 20%
c. 60%
d. 30%
Ans: a
Q.10 For imbalanced datasets, which of the following performance metric is generally used?
a. Precision
b. F-Score
c. Recall
d. Accuracy

Ans: b

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy