0% found this document useful (0 votes)

63 views4 pages

Chapter 4 - Data Science

Uploaded by

vanshrajsingh.14082009

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views4 pages

Chapter 4 - Data Science

Uploaded by

vanshrajsingh.14082009

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Chapter 4: Data Science

Class X

Q1. What is data science?

Ans: Data science is a field that uses scientific methods, processes, algorithms, and systems to
extract knowledge and insights from many structural and unstructured data to apply in AI
applications.

Q2. What is targeted advertising?

Ans: Targeted advertising is a form of advertising, including online advertising, that is directed
towards an audience with certain traits, based on the product or person that advertiser is
promoting. It makes use of past data about the needs and choices of the user and fixes products
and time for advertising the product accordingly.

Q3. What is the recommended system?

Ans: A recommended system refers to a system that is capable of predicting the future
preference of a set of items for a user, and recommending the top item. Recommended system
helps the retailers/sellers and the users by suggesting items similar to the ones a person likes or
by suggesting items like by people who are similar to the user.

Q4. How has data science impacted the healthcare field?

Ans: Data science provides practical insights in the crucial decision making concerning
healthcare. Data driven decision making opens up new possibilities to boost healthcare quality.
Data science has improved the healthcare in various ways, such as

i. Improving diagnostic accuracy and efficiency

ii. Turning patient, Care into process medicine

iii. Advancing pharmaceutical research to find cure

iv. Reducing hospital re-admissions by suggesting preventive care and many more.

Q5. In what ways data science is helpful to the airline industry?

Ans: Data science really proved to be a boon to this industry as it helps to:

* Predict flight delay

* Decide which class of airplanes to buy

* Whether to directly land at the destination or take a hold in between

* Effectively drive, customer loyalty programs

Q6. Explain the term Outliers data. Give an example.

Ans: Outliers means the data that differs drastically from the rest of the data. The kind of unusual
data needs to be removed or replaced from the data set for accurate results. For example, value
zero, given in marks of a student who is absent instead of exemption. This will not give an
accurate class average.

Q7. What type of data can be used by pandas?

Ans: Pandas can be used for the following:

* Tabular data with heterogeneously typed columns, as in an SQL table or Excel spreadsheet

* Ordered and unordered time series data

* Arbitrary matrix data with row and column labels

* Any other form of observation/statistical data sets.

Q8. Why is KNN called a lazy learner algorithm?

Ans: KNN is also called a lazy learner algorithm because it does not learn from the training set
immediately. Instead, it stores the data set and at the time of classification, might perform an
action on the data set.

Q9. What are the important points to remember when data is collected?

Ans: While handling data online or off-line, the following points to be always remembered:

* The source of data should be authentic and reliable, as the random data source could provide
wrong or unusable data.

* For proper training of AI model, the authenticity of data is must.

* Privacy of data sources should always be kept in mind, as it is a fundamental right of

everyone.

* Consent of the owner of the data should be seeked, before using someone’s personal data set.

* Data present in the public domain should preferably be used, if available.

Q10. Explain the box plot graph.

Ans: The box plot graph represents the summary of the set of data values where a box is created
for each having properties like minimum, first quartile, median, third quartile and maximum. A
vertical line goes to the box at the median. Here, X axis denotes the data to be plotted while the
Y axis shows the frequency distribution.
Q11. Differentiate between arrays and lists in Python.

Ans: The following are the differences between a arrays and lists:

Array List

Array is a collection of homogenous values. List is a collection of heterogeneous values.

In arrays data of one type does not support List works perfectly by using data of one
data of another type. type by converting it into another data type.

Arrays can be accessed only through the List occupies more memory space and can
package – NumPy and occupies less be accessed directly in python without any
memory space. package support.

In arrays, the mathematical operators can In list the mathematical operators cannot be
be directly used. used directly on it instead need to be used
separately on individual elements.

Q12. What are pandas used for?

Ans: Panda is an open- source Python Library used for data manipulation and data analysis.

Q13. What is erroneous data? Explain its two types.

Ans: Erroneous data is test data that falls outside of what is acceptable and should be rejected by
the system.

The two types of erroneous data are:

Incorrect Values: The values in the dataset at random placers are not correct. Either the data is
mismatched or it is not relevant to that position.

Invalid or null values: It means value is either corrupted or has no meaning. These values when
occurring in a dataset need to be removed as they hold no value for data processing.

Q14. What are packages in Python?

Ans: Python Packages are a way to organize and structure our Python code into reusable
components. It is like a folder that contains related Python files (modules) that work together to
provide certain functionality. Packages help keep our code organized, make it easier to manage
and maintain, and allow us to share our code with others.
Q15. Explain the different formats in which the tabular dataset can be stored

Ans: The tabular data set can be stored in different formats. Some of the commonly used formats
are:

CSV: it stands for, separated values. It is a simple file format used to store tabular data. Each line
of this file is a data record and each record consists of one or more fields which are separated by
commas.

Spreadsheet: A spreadsheet is a piece of paper or a computer program which is used for

accounting and recording data using rows and columns into which information can be entered.

SQL: structured query language is a domain specific programming language used in

programming and is designed for managing data held in different kinds of DBMS. It is
particularly useful in handling structured data.

Extra Questions

QUARTER 1 MELC 2 Plate Boundaries
No ratings yet
QUARTER 1 MELC 2 Plate Boundaries
19 pages
Data science
No ratings yet
Data science
16 pages
Data Analytics Lab QA
No ratings yet
Data Analytics Lab QA
7 pages
ibm_ps.1_trayambak.
No ratings yet
ibm_ps.1_trayambak.
3 pages
0 - Bethune College - IDC Syllabus of All Department - 230810 - 194239
No ratings yet
0 - Bethune College - IDC Syllabus of All Department - 230810 - 194239
5 pages
DS Final 3 Marks
No ratings yet
DS Final 3 Marks
10 pages
Ch-04: Data and Analysis - Short Question and Answers | PDF
No ratings yet
Ch-04: Data and Analysis - Short Question and Answers | PDF
10 pages
Q_CELLS_Data_sheet_Q.PEAK_DUO_BLK_ML-G9_365-385_2021-05_Rev04_NA
No ratings yet
Q_CELLS_Data_sheet_Q.PEAK_DUO_BLK_ML-G9_365-385_2021-05_Rev04_NA
2 pages
FDS IMP DOCS
No ratings yet
FDS IMP DOCS
22 pages
Data Science selection Questions and their answers 2022
No ratings yet
Data Science selection Questions and their answers 2022
5 pages
Paper - II Linguistics
No ratings yet
Paper - II Linguistics
16 pages
ixs8h-l8mgc
No ratings yet
ixs8h-l8mgc
40 pages
Learning Activities: Activity 1. Directions: Answer The Following Questions Below: Write Your Answer Inside The Box
No ratings yet
Learning Activities: Activity 1. Directions: Answer The Following Questions Below: Write Your Answer Inside The Box
3 pages
Data Science Exam Material
No ratings yet
Data Science Exam Material
10 pages
class 8
No ratings yet
class 8
5 pages
12 2marks With Ans
No ratings yet
12 2marks With Ans
21 pages
MJD32C 73287
No ratings yet
MJD32C 73287
6 pages
Data Science
No ratings yet
Data Science
10 pages
Class 9 (Chap #4)
No ratings yet
Class 9 (Chap #4)
9 pages
17 SPEAKING TOPICS
No ratings yet
17 SPEAKING TOPICS
5 pages
Budget sheet format
No ratings yet
Budget sheet format
8 pages
Data Science_notes_X
No ratings yet
Data Science_notes_X
3 pages
Water Cooler Trainer
No ratings yet
Water Cooler Trainer
2 pages
Electra Complex
No ratings yet
Electra Complex
2 pages
UNIT 4 Data Science
No ratings yet
UNIT 4 Data Science
7 pages
Data Analytics Short and Focused Answers
No ratings yet
Data Analytics Short and Focused Answers
3 pages
Oscam Server Bak
No ratings yet
Oscam Server Bak
1 page
Chapter No.4 Exercise Solution (Computer)
No ratings yet
Chapter No.4 Exercise Solution (Computer)
8 pages
Crack_Data_Science_Interview_�_1731300339
No ratings yet
Crack_Data_Science_Interview_�_1731300339
132 pages
ds viva
No ratings yet
ds viva
9 pages
Chapter 6_Data science and k nearest neighbour model (PART B)
No ratings yet
Chapter 6_Data science and k nearest neighbour model (PART B)
5 pages
Ds Revision 1
No ratings yet
Ds Revision 1
5 pages
Work Positions Ranking - Methods and Techniques
No ratings yet
Work Positions Ranking - Methods and Techniques
8 pages
viva questions-2024-25
No ratings yet
viva questions-2024-25
3 pages
question bank with answers
No ratings yet
question bank with answers
103 pages
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
No ratings yet
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
8 pages
DS
No ratings yet
DS
7 pages
Cost Optimization Vs Cost Reduction
No ratings yet
Cost Optimization Vs Cost Reduction
3 pages
S1509
No ratings yet
S1509
16 pages
ADS_Viva
No ratings yet
ADS_Viva
55 pages
UNIT 1
No ratings yet
UNIT 1
34 pages
ML LAB VIVA
No ratings yet
ML LAB VIVA
6 pages
Introduction To Course & Organizational Behavior (OB) Definition & Concept of Organization Foundation of OB Levels in OB Challenges Faced by OB
No ratings yet
Introduction To Course & Organizational Behavior (OB) Definition & Concept of Organization Foundation of OB Levels in OB Challenges Faced by OB
10 pages
paper
No ratings yet
paper
4 pages
Data Science
No ratings yet
Data Science
14 pages
Ch.4.Data Science X-1
No ratings yet
Ch.4.Data Science X-1
3 pages
data science
No ratings yet
data science
28 pages
Exercise pdf
No ratings yet
Exercise pdf
9 pages
1080p Diy Module v2
No ratings yet
1080p Diy Module v2
4 pages
cls10datascience_24082024_113123
No ratings yet
cls10datascience_24082024_113123
4 pages
Data Sciences Class 10 Notes
100% (2)
Data Sciences Class 10 Notes
3 pages
Data Science Interview Questions
No ratings yet
Data Science Interview Questions
32 pages
Data Science
No ratings yet
Data Science
24 pages
12 2marks With Ans
No ratings yet
12 2marks With Ans
21 pages
Unit 4
No ratings yet
Unit 4
10 pages
Top Data Science Interview Questions and Answers in 2023 PDF
100% (1)
Top Data Science Interview Questions and Answers in 2023 PDF
14 pages
Unit-4
No ratings yet
Unit-4
6 pages
(Thesis) Neide Simões 2013 PDF
No ratings yet
(Thesis) Neide Simões 2013 PDF
164 pages
Data Science Notes
No ratings yet
Data Science Notes
44 pages
Data Science Class X Notes
No ratings yet
Data Science Class X Notes
3 pages
DS MCQ
No ratings yet
DS MCQ
29 pages
25 Important Data Science Interview Questions 1719736087
No ratings yet
25 Important Data Science Interview Questions 1719736087
15 pages
Quiz 4 5 6
No ratings yet
Quiz 4 5 6
11 pages
Model of Human Occupation Frame of Reference: Theoretical Base
No ratings yet
Model of Human Occupation Frame of Reference: Theoretical Base
14 pages
UNIT 4 Data Science Notes
No ratings yet
UNIT 4 Data Science Notes
4 pages
Data Science QnA
No ratings yet
Data Science QnA
15 pages
DA_1733591326
No ratings yet
DA_1733591326
132 pages
CAI Vibration Training
No ratings yet
CAI Vibration Training
119 pages
Data Analytics with Generative AI
From Everand
Data Analytics with Generative AI
Younish P
No ratings yet
Broschuere Solenergy 40s1-60s2 en A2 2014-09 Web Neu
No ratings yet
Broschuere Solenergy 40s1-60s2 en A2 2014-09 Web Neu
4 pages
Asco Long Life Valves Catalog
No ratings yet
Asco Long Life Valves Catalog
4 pages
04kuliah 4bpressure Enthalpy Diagram
No ratings yet
04kuliah 4bpressure Enthalpy Diagram
22 pages
AIL Quiz Loc
No ratings yet
AIL Quiz Loc
33 pages
Role Name Terms of Reference (Duties and Responsibilities)
No ratings yet
Role Name Terms of Reference (Duties and Responsibilities)
5 pages
ML_DS_interview_quetions
No ratings yet
ML_DS_interview_quetions
17 pages
Kenny-230718-Top 70 Microsoft Data Science Interview Questions
No ratings yet
Kenny-230718-Top 70 Microsoft Data Science Interview Questions
17 pages
119 - The Time Terror
No ratings yet
119 - The Time Terror
54 pages
2014 NCEES 8hr Exam Standards
No ratings yet
2014 NCEES 8hr Exam Standards
6 pages
km70 THN 1998 Tabel TTG Pengawakan KPL Niaga PDF
No ratings yet
km70 THN 1998 Tabel TTG Pengawakan KPL Niaga PDF
5 pages
Excelente - MATLAB - Design, Modeling and Evaluation of Protective Relays For Power Systems PDF
100% (2)
Excelente - MATLAB - Design, Modeling and Evaluation of Protective Relays For Power Systems PDF
316 pages
Honda GX 120 Tech Manual
No ratings yet
Honda GX 120 Tech Manual
21 pages
Data Science
100% (1)
Data Science
7 pages
ML Interview
No ratings yet
ML Interview
17 pages
FDS - Unit 1 Question Bank
No ratings yet
FDS - Unit 1 Question Bank
16 pages
Product Brochure: Manufacturers and Stockists of High Pressure Pipeline and Drilling Equipment
No ratings yet
Product Brochure: Manufacturers and Stockists of High Pressure Pipeline and Drilling Equipment
30 pages
Data Science Interview Questions
No ratings yet
Data Science Interview Questions
31 pages
Linder 316 IC Side Loader Forklift Service Manual
No ratings yet
Linder 316 IC Side Loader Forklift Service Manual
142 pages
Exploring the World of Data Science and Machine Learning
From Everand
Exploring the World of Data Science and Machine Learning
NIBEDITA Sahu
No ratings yet
500 Data Science Interview Questions and Answers - Vamsee Puligadda PDF
75% (8)
500 Data Science Interview Questions and Answers - Vamsee Puligadda PDF
141 pages
Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked
From Everand
Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
1/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter 4 - Data Science

Uploaded by

Chapter 4 - Data Science

Uploaded by

Chapter 4: Data Science

Q1. What is data science?

Q2. What is targeted advertising?

Q3. What is the recommended system?

Q4. How has data science impacted the healthcare field?

i. Improving diagnostic accuracy and efficiency

ii. Turning patient, Care into process medicine

iii. Advancing pharmaceutical research to find cure

Q5. In what ways data science is helpful to the airline industry?

* Predict flight delay

* Decide which class of airplanes to buy

* Whether to directly land at the destination or take a hold in between

Q6. Explain the term Outliers data. Give an example.

Q7. What type of data can be used by pandas?

Ans: Pandas can be used for the following:

* Ordered and unordered time series data

* Arbitrary matrix data with row and column labels

* Any other form of observation/statistical data sets.

Q8. Why is KNN called a lazy learner algorithm?

* For proper training of AI model, the authenticity of data is must.

* Privacy of data sources should always be kept in mind, as it is a fundamental right of

* Data present in the public domain should preferably be used, if available.

Q10. Explain the box plot graph.

Array is a collection of homogenous values. List is a collection of heterogeneous values.

Q12. What are pandas used for?

Q13. What is erroneous data? Explain its two types.

The two types of erroneous data are:

Q14. What are packages in Python?

Spreadsheet: A spreadsheet is a piece of paper or a computer program which is used for

SQL: structured query language is a domain specific programming language used in

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.