May Jun 2022
May Jun 2022
May Jun 2022
8
23
P812 [5870] - 1133
[Total No. of Pages : 2
ic-
T.E. (Computer Engineering)
tat
7s
DATA SCIENCE AND BIG DATA ANALYTICS
6:5
(2019 Pattern) (Semester - II) (310251)
02 91
8:3
Time : 2½ Hours] [Max. Marks : 70
0
20
Instructions to the candidates:
9/0 13
1) Answer Q.1 or Q.2, Q.3 or Q.4, Q.5 or Q.6, Q.7 or Q.8.
0
1) Neat diagrams must be drawn whenver necessary.
6/2
.23 GP
8
C
23
4) Assume suistable data, if necessary.
ic-
16
Q1) a) What is driving data deluge? Explain with one example. [9]
tat
8.2
7s
b) What is data science? Differentiate between Business Intelligence and
.24
6:5
Data Science. [9]
91
49
8:3
30
OR
20
01
02
Q2) a) What are the sources of Big Data. Explain model building phase with
6/2
example. [9]
GP
9/0
8
discovery phase. Explain with example. [9]
23
.23
ic-
16
tat
8.2
7s
6:5
8:3
OR
6/2
GP
i) Linear Regression
.23
8
23
i) Time series Analysis
ic-
tat
ii) TF - IDF. [9]
7s
6:5
02 91
b) What is clustering? With suitable example explain the steps involved in
8:3
k - means algorithm. [9]
0
20
9/0 13
OR
0
6/2
.23 GP
8
i) Confusion matrix
C
23
ic-
ii) AVC - ROC curve [9]
16
tat
8.2
7s
b) Discuss Holdout method and Random Sub Sampling methods. [9]
.24
6:5
91
49
8:3
30
Q7) a) With a suitable example explain Histogram and explain its usages. [8]
20
01
02
in brief. [9]
GP
9/0
OR
CE
82
8
23
Q8) a) With a suitable example explain and draw a Box plot and explain its
.23
tat
8.2
7s
b) Describe the challenges of data visualization. Draw box plot and explain
.24
6:5
8:3
30
20
01
02
6/2
GP
9/0
CE
82
.23
16
8.2
.24
[5870] - 1133 2
49