May Jun 2023
May Jun 2023
May Jun 2023
8
23
P490 [Total No. of Pages : 2
ic-
[6003]-711
tat
T.E. (I.T.)
9s
1:3
DATA SCIENCE AND BIG DATA ANALYTICS
02 91
0:4
(2019 Pattern) (Semester-II) (314452)
0
31
Time : 2½ Hours]
3/0 13 [Max. Marks : 70
0
6/2
Instructions to the candidates:
.23 GP
8
C
23
3) Neat diagrams must be drawn wherever necessary.
ic-
4) Assume suitable data, if necessary.
16
tat
8.2
9s
Q1) a) Explain Google file system and its advantages. [10]
.24
1:3
91
b) Explain Hadoop distributed file system [8]
49
0:4
30
OR
31
01
Q2) a) Why map reduce is required in Hadoop? Explain the stages involved in
02
b) Describe the various types of NoSQL Databases with example and also
CE
8
23
.23
ic-
16
Q3) a) Explain Mean, Mode and variance and standard deviation with suitable
tat
8.2
example. [9]
9s
.24
1:3
0:4
OR
30
31
Q4) a) Explain Min-max scaling. For the following dataset carry out min-max
01
02
b) What is data Wrangling? Why do you need it? explain data Wrangling
3/0
methods? [8]
CE
82
.23
OR
.24
49
P.T.O.
8
Q6) a) Explain data visualization with the help of example? What are the
23
advantages of data visualization? [9]
ic-
tat
b) Explain Data Visulization with Tableau. [9]
9s
1:3
02 91
Q7) a) Explain Big Data Analytics Challenges in brief. [9]
0:4
0
b) Explain types of Mobile Analytics. [8]
31
3/0 13
OR
0
6/2
.23 GP
Q8) a) What is Porters valuation creation model? Explain porter’s value chain
analysis. [9]
E
82
8
C
23
b) What is social media analytic? Explain the process of social media data
ic-
analytic. [8]
16
tat
8.2
9s
.24
1:3
91
49
0:4
30
31
01
02
6/2
GP
3/0
CE
82
8
23
.23
ic-
16
tat
8.2
9s
.24
1:3
91
49
0:4
30
31
01
02
6/2
GP
3/0
CE
82
.23
16
8.2
.24
[6003]-711
49