Questions For Chapter 2
Questions For Chapter 2
5. Assume that we have a data set containing information about 200 individuals. One hundred of these
individuals have purchased life insurance. A supervised data mining session has discovered the
following rule:
How many individuals in the class life insurance= no have credit card insurance and are less than 30
years old?
a. 140
b. 60
c. 42
d. 18
6. Which statement is true about neural network and linear regression models?
a. Both models require input attributes to be numeric.
b. Both models require numeric attributes to range between 0 and 1.
c. The output of both models is a categorical attribute value.
d. Both techniques build models whose output is determined by a linear sum of weighted input
attribute values.
e. More than one of a,b,c or d is true.
(2.3)
(2.4)
(2.5)
9. The average positive difference between computed and desired outcome values.
a. root mean squared error
b. mean squared error
c. mean absolute error
d. mean positive error
12. With this method, all available data are partitioned into n fixed-size units. n - 1 of the
units are used for training, whereas the nth unit is the test set.
a. x-prediction
b. stratification
c. cross validation
d. bootstrapping
Computed Decision
Class 1 10 5 3
Class 2 5 15 3
Class 3 2 2 5
Use the confusion matrix for Model X and confusion matrix for Model Y to answer questions 4 through 6.
Accept 10 5 Accept 6 9
Reject 25 60 Reject 15 70