0% found this document useful (0 votes)

5 views6 pages

DATASCIENCE(Unit-1) Question Bank

The document provides answer keys for a Data Science course, covering multiple choice questions and detailed explanations regarding Big Data, data wrangling, NumPy, Pandas, web scraping, and APIs. It includes both theoretical concepts and practical programming tasks related to data manipulation and analysis. Additionally, it outlines the importance of data cleaning, handling missing values, and ethical considerations in data acquisition.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views6 pages

DATASCIENCE(Unit-1) Question Bank

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

SRM INSTITUTE OF SCIENCE AND TECHNOLOGY

RAMAPURAM
SCHOOL OF COMPUTER SCIENCE ENGINEERING
FACULTY OF ENGINEERING AND TECHNOLOGY
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

21CSS303T-DATA SCIENCE

UNIT-1 ANSWER KEY

PART-A(Multiple Choice Question)

1.Which of the following is NOT a characteristic of Big Data?

a) Veracity
b) Velocity
c) Virtualization
d) Variety

Answer: c) Virtualization

2.What is the purpose of data wrangling in the data science process?

a) To collect raw data
b) To clean and structure data for analysis
c) To visualize insights
d) To build machine learning models

Answer: b) To clean and structure data for analysis

3.Data Science is an interdisciplinary field that combines:

a) Statistics, Computer Science, and Domain Knowledge
b) Biology, Chemistry, and Mathematics
c) Marketing, Sales, and Customer Service
d) Finance, Economics, and Law

Answer: a) Statistics, Computer Science, and Domain Knowledge

4.What is the default data type of a NumPy array if not specified?

a) Integer
b) Float
c) String
d) Boolean

Answer: b) Float

5.Which of the following creates a NumPy array of zeros?

a) np.empty((3,3))
b) np.zeros((3,3))
c) np.ones((3,3))
d) np.full((3,3))
Answer: b) np.zeros((3,3))

6.What will be the output of np.arange(5, 15, 2)?

a) [5, 7, 9, 11, 13]
b) [5, 6, 7, 8, 9, 10, 11, 12, 13, 14]
c) [5, 10, 15]
d) [5, 15]

Answer: a) [5, 7, 9, 11, 13]

7.How can you reshape a NumPy array?

a) array.reshape(rows, columns)
b) array.split(rows, columns)
c) array.combine(rows, columns)
d) array.transpose(rows, columns)

·Answer: a) array.reshape(rows, columns)

8.Which function sorts elements in a NumPy array?

a) np.order()
b) np.arrange()
c) np.sort()
d) np.index()

Answer: c) np.sort()

9.What will np.eye(3) return?

a) A 3x3 identity matrix
b) A 3x3 matrix of zeros
c) A 3x3 matrix with diagonal elements as 3
d) A 3x3 matrix of ones

· Answer: a) A 3x3 identity matrix

10.What is the correct way to create a Pandas Series?

a) pd.Series([1, 2, 3, 4])
b) pd.DataSeries([1, 2, 3, 4])
c) pd.ListSeries([1, 2, 3, 4])
d) pd.series([1, 2, 3, 4])

Answer: a) pd.Series([1, 2, 3, 4])

11.What will df.head(3) return?

a) The first 3 rows of the DataFrame
b) The last 3 rows of the DataFrame
c) The first 3 columns of the DataFrame
d) The summary statistics of the DataFrame

Answer: a) The first 3 rows of the DataFrame

12.How do you drop a row in a Pandas DataFrame?
a) df.drop(index=[row_number])
b) df.remove([row_number])
c) df.delete([row_number])
d) df.clear([row_number])

Answer: a) df.drop(index=[row_number])

13.What will df.sort_values(by='column_name') do?

a) Sort the DataFrame by the values in column_name
b) Rename the column_name
c) Delete column_name
d) Convert the column_name to integers

Answer: a) Sort the DataFrame by the values in column_name

14.How do you check for missing values in a DataFrame?

a) df.isnull()
b) df.isna()
c) df.notnull()
d) Both a and b

Answer: d) Both a and b

15.What will df['column_name'].rank() do?

a) Assign ranks to the values in column_name
b) Count unique values in column_name
c) Sort column_name
d) Drop duplicates from column_name

Answer: a) Assign ranks to the values in column_name

16.What is the purpose of Web Scraping?

a) To extract and collect data from websites
b) To clean and structure datasets
c) To train machine learning models
d) To store data in databases

Answer: a) To extract and collect data from websites

17.Which Python library is commonly used for web scraping?

a) scrapy
b) beautifulsoup4
c) requests
d) All of the above

Answer: d) All of the above

18.What does an API return data in?
a) CSV format
b) JSON or XML format
c) HTML format
d) PDF format

Answer: b) JSON or XML format

19. What function in the requests library is used to fetch data from an API?
a) requests.fetch()
b) requests.call()
c) requests.get()
d) requests.retrieve()

Answer: c) requests.get()

20.What is the purpose of Open Data sources?

a) To provide free access to data for research and analysis
b) To sell data to private companies
c) To restrict data access to specific users
d) To store confidential information

Answer: a) To provide free access to data for research and analysis

PART-B(4 Marks)

1.Explain the four Vs of Big Data with examples.

2.What are the main steps in the Data Science Process? Briefly explain each step.
3.How is Data Science different from Data Analytics? Provide examples.
4.Why is data cleaning important in Data Science? Give two common data cleaning
techniques.
5.What is NumPy? Explain its advantages over Python lists.
6.Write a Python program to create a 3x3 NumPy array with random numbers and
print its shape and size.
7.Explain the difference between shallow copy and deep copy in NumPy with an
example.
8.What is an identity matrix? How can you create it using NumPy?
9.What is a Pandas Series? How is it different from a Python list?
10.Explain the difference between loc[] and iloc[] in Pandas with an example.
11.How can you sort a Pandas DataFrame based on multiple columns? Give an
example.
15.What is Web Scraping? Mention two Python libraries used for it and their
functions.
16.Explain how to extract data from an API using the Python requests library with an
example.
17.What are Open Data Sources? Give two examples of publicly available datasets.
18.What are the ethical considerations in data acquisition? Discuss any two.
19.What are missing values in a dataset? Explain two methods to handle missing
values in Pandas.

Part-C(12 Marks)

1. Explain the Data Science process in detail. Describe each step with relevant
examples and discuss its importance

2.What are the key challenges in working with Big Data? Explain the four Vs of Big
Data and discuss how these challenges are addressed in Data Science.

3.Compare and contrast Data Science, Data Analytics, and Machine Learning.
Provide examples of their applications in real-world scenarios.

4.Explain NumPy arrays and their advantages over Python lists. Provide examples
demonstrating array creation, indexing, and basic operations.

5.Write a Python program that creates a NumPy array and performs the following
operations:

A. Reshape the array

B. Find the maximum and minimum values
C. Sort the array
D. Perform matrix multiplication with another array
Explain each step in detail.

6.Discuss various ways to manipulate the shape of a NumPy array. Explain with
examples of reshaping, flattening, and transposing an array.
7.What is a Pandas DataFrame? Explain its structure with examples. Discuss common
operations like selecting data, filtering rows, and modifying columns.
8.How can missing data be handled in Pandas? Explain different methods such as
dropping, filling, and interpolation with examples.
9.Write a Python program to load a dataset into a Pandas DataFrame and perform the
following tasks:

a) Display basic information and summary statistics

b) Sort the data based on a column
c) Filter specific rows based on conditions
d) Rename columns and reset the index
Explain the output for each operation.

10Explain Web Scraping in detail. Discuss its applications, ethical concerns, and
demonstrate how to scrape data using BeautifulSoup with a Python example.
11.What are APIs, and how are they used in Data Science? Explain the process of
fetching data from an API with an example using the Python requests library.
12.Discuss different data sources used in Data Science. Compare Open Data, APIs,
and Web Scraping in terms of ease of access, reliability, and ethical considerations.

Web Based Transcript and Result Processing System
No ratings yet
Web Based Transcript and Result Processing System
32 pages
What Is BI?: "Fundamentals of Business Analytics" RN Prasad and Seema Acharya
60% (5)
What Is BI?: "Fundamentals of Business Analytics" RN Prasad and Seema Acharya
20 pages
Assignment Unit I and II
No ratings yet
Assignment Unit I and II
3 pages
DATASCIENCE
No ratings yet
DATASCIENCE
2 pages
data science
No ratings yet
data science
10 pages
Data Science Papers
No ratings yet
Data Science Papers
109 pages
Data Science Notes
No ratings yet
Data Science Notes
44 pages
Data Science Using Python
No ratings yet
Data Science Using Python
7 pages
DVW 203105491_6697_Question_Paper
No ratings yet
DVW 203105491_6697_Question_Paper
2 pages
Revision Questions
No ratings yet
Revision Questions
19 pages
Machine Learning Lecture2
No ratings yet
Machine Learning Lecture2
38 pages
OCS353_Review Questions
No ratings yet
OCS353_Review Questions
3 pages
DS MCQ SEMESTER SUGGESSTION
No ratings yet
DS MCQ SEMESTER SUGGESSTION
26 pages
OCS353 Data Science Fundamentals QB_(Common to EEE,Mech,Civil)
No ratings yet
OCS353 Data Science Fundamentals QB_(Common to EEE,Mech,Civil)
7 pages
Ocs353 Dcf
No ratings yet
Ocs353 Dcf
4 pages
DATA SCIENCE QB
No ratings yet
DATA SCIENCE QB
2 pages
DS-DS Lab-1
No ratings yet
DS-DS Lab-1
4 pages
Sac QB 2023-2024
No ratings yet
Sac QB 2023-2024
2 pages
Report
No ratings yet
Report
18 pages
Datascience
No ratings yet
Datascience
8 pages
Data Science QnA
No ratings yet
Data Science QnA
15 pages
Class X HHW
No ratings yet
Class X HHW
2 pages
Sample MCQ Questions
100% (1)
Sample MCQ Questions
26 pages
3rd EXPERIMENT
No ratings yet
3rd EXPERIMENT
13 pages
python2 materials
No ratings yet
python2 materials
27 pages
Constitution
No ratings yet
Constitution
3 pages
ACFrOgAVOh5a8ifEUMpcOMxc3IztRK_ZZXEZdZiFXnlOdgSSKeU4SpPZnDEubXVlFH5lckQPagU7QFAN34VCQT4R5o1fFLDVhR3GZIv5aPM9xCK3tKWBmtN-DLRMHQXh6e6p0_LmrqAbwFZy57LTyqB_pvoltpGb33aGx-_zTdFx4mQD9r8l7qTQtCOlkCN-86YXb82Kyi9rkpTFZlYL
No ratings yet
ACFrOgAVOh5a8ifEUMpcOMxc3IztRK_ZZXEZdZiFXnlOdgSSKeU4SpPZnDEubXVlFH5lckQPagU7QFAN34VCQT4R5o1fFLDVhR3GZIv5aPM9xCK3tKWBmtN-DLRMHQXh6e6p0_LmrqAbwFZy57LTyqB_pvoltpGb33aGx-_zTdFx4mQD9r8l7qTQtCOlkCN-86YXb82Kyi9rkpTFZlYL
2 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
Set-D_CT2_answerKey
No ratings yet
Set-D_CT2_answerKey
11 pages
data science lab exp lis
No ratings yet
data science lab exp lis
72 pages
Data Science Workshop - Day 1
No ratings yet
Data Science Workshop - Day 1
80 pages
FDS - 1 SOLVED
No ratings yet
FDS - 1 SOLVED
17 pages
Python GTU Study Material Presentations Unit-2 24072020062038AM
No ratings yet
Python GTU Study Material Presentations Unit-2 24072020062038AM
18 pages
MCQ FDS (1)
No ratings yet
MCQ FDS (1)
5 pages
Unit 1
100% (1)
Unit 1
69 pages
Python Interview QA DataScience GenAI
No ratings yet
Python Interview QA DataScience GenAI
4 pages
fds_merged (3) (1)
No ratings yet
fds_merged (3) (1)
102 pages
DVW 203105491_5926_Question_Paper (1)
No ratings yet
DVW 203105491_5926_Question_Paper (1)
2 pages
End Semester Question Paper
No ratings yet
End Semester Question Paper
3 pages
Foundation of Data Science Solve Question Paper Aug 2022
No ratings yet
Foundation of Data Science Solve Question Paper Aug 2022
7 pages
Question Bank CIA 2
No ratings yet
Question Bank CIA 2
3 pages
Numpy Merged
No ratings yet
Numpy Merged
59 pages
PDSC_Few_Questions_Answers_2020
No ratings yet
PDSC_Few_Questions_Answers_2020
36 pages
22am901 Data Science Using Python Unit 2
No ratings yet
22am901 Data Science Using Python Unit 2
116 pages
Ocs353 Dsf Question Bank 25-26
No ratings yet
Ocs353 Dsf Question Bank 25-26
13 pages
Chapter - 2: Data Science & Python
No ratings yet
Chapter - 2: Data Science & Python
17 pages
Data Analytics
No ratings yet
Data Analytics
11 pages
PDS Question Bank
No ratings yet
PDS Question Bank
19 pages
100 Python Interview Questions
No ratings yet
100 Python Interview Questions
68 pages
DS 3-MARKS SEMESETER SUGGESTION (2)
No ratings yet
DS 3-MARKS SEMESETER SUGGESTION (2)
54 pages
data science unit 1
No ratings yet
data science unit 1
30 pages
Question Papers
No ratings yet
Question Papers
55 pages
Ocs353 Dsf Question Bank 25-26
No ratings yet
Ocs353 Dsf Question Bank 25-26
13 pages
6205solved Ip CL Xii 2020
No ratings yet
6205solved Ip CL Xii 2020
11 pages
Data Science Unit 1 Notes
No ratings yet
Data Science Unit 1 Notes
30 pages
Assignment DS EC11 3
No ratings yet
Assignment DS EC11 3
1 page
IDA_Sample Questions FA1
No ratings yet
IDA_Sample Questions FA1
2 pages
Data Analysis and Visualization LAB
No ratings yet
Data Analysis and Visualization LAB
2 pages
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
No ratings yet
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
8 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
21CSS303T DATA SCIENCE SYLLABUS
No ratings yet
21CSS303T DATA SCIENCE SYLLABUS
2 pages
UNIT-4
No ratings yet
UNIT-4
106 pages
21CSE356T-NLP-Unit 4.1
No ratings yet
21CSE356T-NLP-Unit 4.1
46 pages
21CSE356T-NLP- Unit 5
No ratings yet
21CSE356T-NLP- Unit 5
118 pages
NLP_Unit-2_QB_Updated
No ratings yet
NLP_Unit-2_QB_Updated
10 pages
DataScience_Project-new[1]
No ratings yet
DataScience_Project-new[1]
16 pages
Dsa Team 4 Project
No ratings yet
Dsa Team 4 Project
11 pages
05 - A Unified OLAP or OLTP Big Data Processing Framework in Telecom Industry
No ratings yet
05 - A Unified OLAP or OLTP Big Data Processing Framework in Telecom Industry
6 pages
Josepha Telma-Resume (1)
No ratings yet
Josepha Telma-Resume (1)
4 pages
Ôn Tập Integrate
No ratings yet
Ôn Tập Integrate
5 pages
SAP HANA System Replication Guide en
No ratings yet
SAP HANA System Replication Guide en
274 pages
Types of RDBMS
No ratings yet
Types of RDBMS
6 pages
Event-Driven Programming 240930 142125
No ratings yet
Event-Driven Programming 240930 142125
3 pages
6.03 - Threat - Diagnostic - StoredProcedure - V10x - Lab
No ratings yet
6.03 - Threat - Diagnostic - StoredProcedure - V10x - Lab
18 pages
Dahua AI Network Video Recorder User's Manual V1.0.7 PDF
No ratings yet
Dahua AI Network Video Recorder User's Manual V1.0.7 PDF
419 pages
Kotlin Android Sqlite Example Application
No ratings yet
Kotlin Android Sqlite Example Application
8 pages
3-Statistical Measures of Data PDF
No ratings yet
3-Statistical Measures of Data PDF
19 pages
CB3401-DBMSS
No ratings yet
CB3401-DBMSS
25 pages
100 SQL Tips
No ratings yet
100 SQL Tips
176 pages
DMS PDF
No ratings yet
DMS PDF
181 pages
LTE OMC Functions Introduction
No ratings yet
LTE OMC Functions Introduction
46 pages
Knowledge Discovery of Weighted RFM Sequential Patterns From Customer
No ratings yet
Knowledge Discovery of Weighted RFM Sequential Patterns From Customer
10 pages
Aggregate Function
No ratings yet
Aggregate Function
5 pages
Exercise 7 OUTER JOINS
No ratings yet
Exercise 7 OUTER JOINS
8 pages
Niis Project SIP
No ratings yet
Niis Project SIP
61 pages
58614
No ratings yet
58614
62 pages
EDB116 CG v1.0 SS
No ratings yet
EDB116 CG v1.0 SS
12 pages
SAP HANA On Power Level 1 Quiz - Attempt Review
No ratings yet
SAP HANA On Power Level 1 Quiz - Attempt Review
16 pages
Computer 12 Class Notes (Book)
No ratings yet
Computer 12 Class Notes (Book)
254 pages
Database Management System L6 MARCH MOCK
No ratings yet
Database Management System L6 MARCH MOCK
5 pages
Cricket Management System - TutorialsDuniya
No ratings yet
Cricket Management System - TutorialsDuniya
51 pages
Video Data Description
No ratings yet
Video Data Description
1 page
Scientific Publication And: Evaluation of Scientific Activity
No ratings yet
Scientific Publication And: Evaluation of Scientific Activity
23 pages
Lesson Plan BCA103 Database Management System
No ratings yet
Lesson Plan BCA103 Database Management System
2 pages
2021 AWS Xray Guide
No ratings yet
2021 AWS Xray Guide
355 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

DATASCIENCE(Unit-1) Question Bank

Uploaded by

DATASCIENCE(Unit-1) Question Bank

Uploaded by

SRM INSTITUTE OF SCIENCE AND TECHNOLOGY

UNIT-1 ANSWER KEY

PART-A(Multiple Choice Question)

1.Which of the following is NOT a characteristic of Big Data?

2.What is the purpose of data wrangling in the data science process?

Answer: b) To clean and structure data for analysis

3.Data Science is an interdisciplinary field that combines:

Answer: a) Statistics, Computer Science, and Domain Knowledge

4.What is the default data type of a NumPy array if not specified?

5.Which of the following creates a NumPy array of zeros?

6.What will be the output of np.arange(5, 15, 2)?

Answer: a) [5, 7, 9, 11, 13]

7.How can you reshape a NumPy array?

·Answer: a) array.reshape(rows, columns)

8.Which function sorts elements in a NumPy array?

9.What will np.eye(3) return?

· Answer: a) A 3x3 identity matrix

10.What is the correct way to create a Pandas Series?

Answer: a) pd.Series([1, 2, 3, 4])

11.What will df.head(3) return?

Answer: a) The first 3 rows of the DataFrame

13.What will df.sort_values(by='column_name') do?

Answer: a) Sort the DataFrame by the values in column_name

14.How do you check for missing values in a DataFrame?

Answer: d) Both a and b

15.What will df['column_name'].rank() do?

Answer: a) Assign ranks to the values in column_name

16.What is the purpose of Web Scraping?

Answer: a) To extract and collect data from websites

17.Which Python library is commonly used for web scraping?

Answer: d) All of the above

Answer: b) JSON or XML format

20.What is the purpose of Open Data sources?

Answer: a) To provide free access to data for research and analysis

1.Explain the four Vs of Big Data with examples.

A. Reshape the array

a) Display basic information and summary statistics

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.