0% found this document useful (0 votes)

12 views3 pages

Tutorial 2 Questions

DORSCON is a color-coded framework used in Singapore to show the current disease situation and provide guidelines on prevention and reduction of infections. It has 4 statuses - Green, Yellow, Orange and Red - depending on severity and spread. DORSCON data would be considered [2] Ordinal data since the statuses have a natural ordering from least to most severe but the distances between statuses are not quantified. A boxplot displays a dataset using 5 statistics: minimum, maximum, median, first and third quartiles. The number of data points between the first and third quartiles amounts to [50] percent of the total number of data displayed.

Uploaded by

clement hung

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views3 pages

Tutorial 2 Questions

Uploaded by

clement hung

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

EE2211 Tutorial 2 (Python coding)

(Data Reading and Visualization, simple data structure)

Question 1:
A Comma Separated Values (CSV) file is a plain text file that contains a list of data. These files are often used for
exchanging data between different applications. Download the file “government-expenditure-on-education.csv”
from https://data.gov.sg/dataset/government-expenditure-on-education. Plot the educational expenditure over the
years. (Hint: you might need “import pandas as pd” and “import matplotlib.pyplot as plt”.)

(Data Reading and Visualization, slightly more complicated data structure)

Question 2:
Download the CSV file from https://data.gov.sg/dataset/annual-motor-vehicle-population-by-vehicle-type.
Extract and plot the number of Omnibuses, Excursion buses and Private buses over the years as shown below.
(Hint: you might need “import pandas as pd” and “import matplotlib.pyplot as plt”.)

(Data Reading and Visualization, distribution)

Question 3:
The “iris” flower data set consists of measurements such as the length, width of the petals, and the length, width of the
sepals, all measured in centimeters, associated with each iris flower. Get the data set “from sklearn.datasets import
load_iris” and do a scatter plot as shown below. (Hint: you might need “from pandas.plotting import
scatter_matrix”)
(Data Wrangling/Normalization)
Question 4:
You are given a set of data for supervised learning. A sample block of data looks like this:
“ 1.2234, 0.3302, 123.50, 0.0081, 30033.81, 1
1.3456, 0.3208, 113.24, 0.0067, 29283.18, -1
0.9988, 0.2326, 133.45, 0.0093, 36034.33, 1
1.1858, 0.4301, 128.55, 0.0077, 34037.35, 1
1.1533, 0.3853, 116.70, 0.0066, 22033.58, -13
1.2755, 0.3102, 118.30, 0.0098, 30183.65, 1
1.0045, 0.2901, 123.52, 0.0065, 31093.98, -1
1.1131, 0.3912, 113.15, 0.0088, 29033.23, -1 ”
Each row corresponds to a sample data measurement with 5 input features and 1 response.
(a) What kind of undesired effect can you anticipate if this set of raw data is used for learning?
(b) How can the data be preprocessed to handle this issue?

(Missing Data)
Question 5:
The Pima Indians Diabetes Dataset involves predicting the onset of diabetes within 5 years in Pima Indians
given medical details. Download the Pima-Indians-Diabetes data from
https://raw.githubusercontent.com/jbrownlee/Datasets/master/pima-indians-diabetes.data.csv.
It is a binary (2-class) classification problem. The number of observations for each class is not balanced. There are 768 observations with 8 input
variables and 1 output variable. The variable names are as follows:
0. Number of times pregnant.
1. Plasma glucose concentration a 2 hours in an oral glucose tolerance test.
2. Diastolic blood pressure (mm Hg).
3. Triceps skinfold thickness (mm).
4. 2-Hour serum insulin (mu U/ml).
5. Body mass index (weight in kg/(height in m)^2).
6. Diabetes pedigree function.
7. Age (years).
8. Class variable (0 or 1).
(a) Print the summary statistics of this data set.
(b) Count the number of “0” entries in columns [1,2,3,4,5].
(c) Replace these “0” values by “NaN”.
(Hint: you might need the “.describe()” and “.replace(0, numpy.NaN)” functions “from pandas
import read_csv”.)

(In Quiz and Exam format)

Question 6:

Disease Outbreak Response System Condition (DORSCON) in Singapore is a colour-coded framework that shows the
current disease situation. The framework provides us with general guidelines on what needs to be done to prevent and
reduce the impact of infections. There are 4 statuses – Green, Yellow, Orange and Red, depending on the severity and
spread of the disease. Which type of data does DORSCON belong to ?
(1) Categorical; (2) Ordinal; (3) Continuous; (4) Interval
(In Quiz and Exam format)
Question 7:

A boxplot is a standardized way of displaying the dataset based on a five-number summary: the minimum, the
maximum, _BLANK1_, and the first and third quartiles, where the number of data points that fall between the first and
third quartiles amounts to _BLANK2_ percent of the total number of data on display.

Ch-4 Plotting Data Using Matplotlib
No ratings yet
Ch-4 Plotting Data Using Matplotlib
32 pages
Question Bank Class XII IP 065 Long Question Answer
No ratings yet
Question Bank Class XII IP 065 Long Question Answer
35 pages
2023 Data Analysis and Visualization Using Python
100% (2)
2023 Data Analysis and Visualization Using Python
9 pages
Worksheet-1 (Python)
No ratings yet
Worksheet-1 (Python)
9 pages
Introduction To MChip Advance Card Application Specifications - Payment
No ratings yet
Introduction To MChip Advance Card Application Specifications - Payment
38 pages
CS3361 Set1
No ratings yet
CS3361 Set1
5 pages
Midterm Review CS 4372
No ratings yet
Midterm Review CS 4372
42 pages
MBA - MRCET - R18 Course Structure and Syllabus
100% (1)
MBA - MRCET - R18 Course Structure and Syllabus
63 pages
CS 3362 FDS
No ratings yet
CS 3362 FDS
53 pages
722 9 5 2011 Review
No ratings yet
722 9 5 2011 Review
101 pages
M.sc. Maths
100% (1)
M.sc. Maths
23 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
Nptel Assignment Answers
No ratings yet
Nptel Assignment Answers
52 pages
cs3362 Foundations of Data Science Lab Manual
No ratings yet
cs3362 Foundations of Data Science Lab Manual
53 pages
Tutorial2 Q&A
No ratings yet
Tutorial2 Q&A
5 pages
cs3362 Foundations of Data Science Lab Manual
75% (8)
cs3362 Foundations of Data Science Lab Manual
53 pages
Data Science Laboratory
No ratings yet
Data Science Laboratory
40 pages
Algorithms
100% (1)
Algorithms
47 pages
Manishadav
No ratings yet
Manishadav
27 pages
GE Practical Sem 2
No ratings yet
GE Practical Sem 2
28 pages
21hcs4108 Davpracticals
No ratings yet
21hcs4108 Davpracticals
29 pages
Blessing Laptop 29 September 2022
No ratings yet
Blessing Laptop 29 September 2022
164 pages
23HCS4142 PDF
No ratings yet
23HCS4142 PDF
24 pages
Question 1
No ratings yet
Question 1
25 pages
Sem1 Module2 Unit1 Graphs
No ratings yet
Sem1 Module2 Unit1 Graphs
23 pages
Lab Manual
No ratings yet
Lab Manual
19 pages
IP Program File
No ratings yet
IP Program File
21 pages
DS Slips Solutions Sem 5
No ratings yet
DS Slips Solutions Sem 5
23 pages
DAV Practical File 234003
No ratings yet
DAV Practical File 234003
14 pages
Syadatajveez
No ratings yet
Syadatajveez
21 pages
CS3361 Set2
No ratings yet
CS3361 Set2
12 pages
Python 1
No ratings yet
Python 1
16 pages
CS3361 Set1
No ratings yet
CS3361 Set1
10 pages
FDS Aim Algorithm
No ratings yet
FDS Aim Algorithm
18 pages
CS3361 Set2
No ratings yet
CS3361 Set2
9 pages
Data Project
No ratings yet
Data Project
12 pages
CS3361 Set1
No ratings yet
CS3361 Set1
9 pages
Dav Obe 2021
No ratings yet
Dav Obe 2021
4 pages
Python Programs
No ratings yet
Python Programs
8 pages
CoSc3311 - Udated Slides - Design and Arch
No ratings yet
CoSc3311 - Udated Slides - Design and Arch
52 pages
IPSA Introduction NJ
No ratings yet
IPSA Introduction NJ
13 pages
Function of Management: Jahid Hasan Assistant Professor Department of IPE, SUST
No ratings yet
Function of Management: Jahid Hasan Assistant Professor Department of IPE, SUST
54 pages
Web Development Syllabus Explanation
No ratings yet
Web Development Syllabus Explanation
2 pages
Ip 123 Questions
No ratings yet
Ip 123 Questions
6 pages
Pracfile Program Index XII-C IP 2023-24
No ratings yet
Pracfile Program Index XII-C IP 2023-24
6 pages
"Consumer Buying Behaviour: A Shift From Offline To Online Retail
No ratings yet
"Consumer Buying Behaviour: A Shift From Offline To Online Retail
38 pages
KJSCE - TY EXTC Final Syllabus 3rdjan 2019 - BAS - 4 - Jan - 2018
No ratings yet
KJSCE - TY EXTC Final Syllabus 3rdjan 2019 - BAS - 4 - Jan - 2018
40 pages
Data Sci
No ratings yet
Data Sci
6 pages
Consumer Rbi Guidlines
No ratings yet
Consumer Rbi Guidlines
44 pages
DAV Practicle File
No ratings yet
DAV Practicle File
28 pages
IPU Computer Network File Sem-6
No ratings yet
IPU Computer Network File Sem-6
36 pages
Class 12 IP
No ratings yet
Class 12 IP
4 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Data Science Lab QP
No ratings yet
Data Science Lab QP
4 pages
Model Practical Examination 2024-25 Python Pandas QP
No ratings yet
Model Practical Examination 2024-25 Python Pandas QP
3 pages
Excel Exercise 3.4
0% (1)
Excel Exercise 3.4
2 pages
Practical List 2022-23
100% (1)
Practical List 2022-23
4 pages
Patch Installation Guide Huawei Hardware
No ratings yet
Patch Installation Guide Huawei Hardware
23 pages
Traffic Sign Detection in Weather Conditions: Agothi Vaibhav Anjani Kumar
No ratings yet
Traffic Sign Detection in Weather Conditions: Agothi Vaibhav Anjani Kumar
3 pages
Game Theory 4 5
No ratings yet
Game Theory 4 5
19 pages
Ashutosh Project
No ratings yet
Ashutosh Project
19 pages
Class Property
No ratings yet
Class Property
13 pages
Parallel Testing and Implementing New QC Material
No ratings yet
Parallel Testing and Implementing New QC Material
14 pages
Bartec Tablet Solution PDF
No ratings yet
Bartec Tablet Solution PDF
12 pages
Optical Communication
No ratings yet
Optical Communication
26 pages
M18 FHIWP12 M18 FHIWF12: Original Instructions
No ratings yet
M18 FHIWP12 M18 FHIWF12: Original Instructions
11 pages
CS3361 Set2
No ratings yet
CS3361 Set2
6 pages
End Sem PYQ
No ratings yet
End Sem PYQ
8 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
Ip Worksheet 2 - Q'S
No ratings yet
Ip Worksheet 2 - Q'S
7 pages
Computer Engineering (4) :computer Networks: September 2015
No ratings yet
Computer Engineering (4) :computer Networks: September 2015
8 pages
Ip CLSS Xii 2024-25 Hy
No ratings yet
Ip CLSS Xii 2024-25 Hy
14 pages
CS3361 Set1
No ratings yet
CS3361 Set1
5 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
Raspberry Pi 15.3W USB-C Power Supply: Published in June 2019 by Raspberry Pi Trading LTD
No ratings yet
Raspberry Pi 15.3W USB-C Power Supply: Published in June 2019 by Raspberry Pi Trading LTD
8 pages
CLS - Xii - Ip - Practical & Project - 2022-23
No ratings yet
CLS - Xii - Ip - Practical & Project - 2022-23
6 pages
Synopsis Online Shopping
No ratings yet
Synopsis Online Shopping
7 pages
Nursery Management System
No ratings yet
Nursery Management System
8 pages
Kendriya Vidyalaya No. 3, Nal, Bikaner SESSION: 2021-22 Unit Test - 1
No ratings yet
Kendriya Vidyalaya No. 3, Nal, Bikaner SESSION: 2021-22 Unit Test - 1
2 pages
FLY - Actuation Systems For Operating Tables - EN
No ratings yet
FLY - Actuation Systems For Operating Tables - EN
4 pages
Practical File (Ip Class Xii) 2024-25
No ratings yet
Practical File (Ip Class Xii) 2024-25
27 pages
NCERT Plotting CH - 4 Ex Solutons
No ratings yet
NCERT Plotting CH - 4 Ex Solutons
6 pages
Database Triggers
100% (4)
Database Triggers
11 pages
Ip Practical 2024
No ratings yet
Ip Practical 2024
12 pages
2024 Fods Ques
No ratings yet
2024 Fods Ques
4 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Ip Sample Paper 2
No ratings yet
Ip Sample Paper 2
6 pages
Officesuite Uc Quick Reference Guide - Mitel 6940 Ip Phone: Activating Your Phone Getting Started
No ratings yet
Officesuite Uc Quick Reference Guide - Mitel 6940 Ip Phone: Activating Your Phone Getting Started
1 page
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
From Everand
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
Ahmed Ph. Abbasi
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Tutorial 2 Questions

Uploaded by

Tutorial 2 Questions

Uploaded by

EE2211 Tutorial 2 (Python coding)

(Data Reading and Visualization, simple data structure)

(Data Reading and Visualization, slightly more complicated data structure)

(Data Reading and Visualization, distribution)

(In Quiz and Exam format)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.