0% found this document useful (0 votes)

36 views

Lab 2 Solved

Uploaded by

shahrukhmuhammad480

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

Lab 2 Solved

Uploaded by

shahrukhmuhammad480

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

10/24/24, 8:31 AM Lab 2 Solved

2022F-BSE-014

Lab 2
DATASET PREPARATION WITH EXCEL SPREADSHEET AND DATASET PREPROCESSING AND SCALING
TECHNIQUES

OBJECTIVE
Dataset preparation by selecting all the possible features on the given scenario using excel. Load
designated data set to working environment. Checking the data set for missing values and outliers.
Implementing Normalization and Standardization techniques to scale the values

Lab Tasks:

1. Write a python code to load an excel spreadsheet containing two different sheets and print both of
them.

In [15]: import pandas as pd

df1 = pd.read_excel('Book1.xlsx',sheet_name = 'Sheet1')
print('Sheet 1 \n',df1)
df2 = pd.read_excel('Book1.xlsx',sheet_name = 'Sheet2')
print('Sheet 2 \n',df2)

Sheet 1
Name age
0 person1 20
1 person2 30
2 person3 40
Sheet 2
Name work
0 person3 software
1 person 4 tuitor
2 person5 student

2. Write a python cade to generate a pandas data frame having 4 columns and 5 rows. Column 1 must
contain the index values like Ali, Amir, Kamran, etc and Row 1 must contain the subject names.

In [19]: import pandas as pd

df = pd.DataFrame({
'Math':[85, 90, 80, 70,85],
'Science':[75, 88, 85, 65,90],
'English':[95, 92, 88, 75,92],
'History':[60, 78, 70, 80,88],},
index = ['Ali', 'Amir', 'Kamran', 'Sara', 'Zain'])
print(df)

file:///C:/Users/DC/Downloads/Lab 2 Solved (1).html 1/3

10/24/24, 8:31 AM Lab 2 Solved

Math Science English History

Ali 85 75 95 60
Amir 90 88 92 78
Kamran 80 85 88 70
Sara 70 65 75 80
Zain 85 90 92 88

3. Write a python code to read an excel spreadsheet and only print first two columns using pandas data
frame.

In [21]: import pandas as pd

reqiure_coulumns = [0,3]
df = pd.read_excel('Student_Score.xlsx',usecols= reqiure_coulumns)
print(df)

Unnamed: 0 English
0 Ali 95
1 Amir 92
2 Kamran 88
3 Sara 75
4 Zain 92

4. Write a python code to skip the first two rows of excel spreadsheet and print the output using
pandas data frame.

In [29]: import pandas as pd

df = pd.read_excel('Student_Score.xlsx',skiprows = 2)
print(df)

Amir 90 88 92 78
0 Kamran 80 85 88 70
1 Sara 70 65 75 80
2 Zain 85 90 92 88

5. Write a python code to fill all the null values in Gender column of employees.csv with “No Gender”.
Print the first 10 to 30 rows of the data frame for visualization.

In [31]: import pandas as pd

df = pd.read_csv('employee.csv')
df.replace(to_replace='no',value = 'No Gender')
print(df)

Employee_ID Name Age Gender Department

0 1 John Smith 30 Male IT
1 2 Jane Doe 25 Female HR
2 3 Sam Jones 35 Male Marketing
3 4 Emily Ray 28 Female Sales
4 5 Michael 40 No Gender IT
5 6 Sarah Lee 22 Female HR
6 7 Tom Hanks 45 Male Marketing
7 8 Lisa Kim 32 No Gender Sales
8 9 David Lee 29 Male IT
9 10 Nina Patel 26 Female HR
10 11 Mark Fox 50 Male Marketing
11 12 Amy Adams 38 Female Sales
12 13 Ben Price 42 No Gender IT
13 14 Lily Chen 23 Female HR
14 15 Robert Z 36 Male Marketing
15 16 Anna Brown 31 No Gender Sales

file:///C:/Users/DC/Downloads/Lab 2 Solved (1).html 2/3

10/24/24, 8:31 AM Lab 2 Solved

6. Write a python code to scale the values of features (Age and Salary) using Min-Max Normalization
technique. Verify your answers by applying the formula mentioned above.

In [46]: import pandas as pd

import numpy as np
from sklearn import preprocessing
x = np.array([[25.0, 32.0, 45.0, 29.0, 38.0],
[50000.0, 70000.0, 120000.0, 65000.0, 80000.0]])
minmax = preprocessing.MinMaxScaler(feature_range=(0,1))
print(minmax.fit(x).transform(x))

[[0. 0. 0. 0. 0.]
[1. 1. 1. 1. 1.]]

7. Write a python code to scale the values of features (Age and Salary) using Standardization technique.
Verify your answers by applying the formula mentioned above. Age Salary 25 42000 36 50000 30
45000 27 43000 38 51000 42 62000 34 48000

In [50]: import pandas as pd

from sklearn import preprocessing
df = pd.DataFrame({'Age':[25,36,30,27,38,42,34],
'Salary':[42000,50000,45000,43000,51000,62000,48000]})
x = np.array(df)
sd = preprocessing.StandardScaler();
print(sd.fit(x).transform(x))

[[-1.43672117 -1.07039567]
[ 0.50411269 0.20496938]
[-0.55452396 -0.59213377]
[-1.08384229 -0.91097504]
[ 0.85699158 0.36439001]
[ 1.56274934 2.11801696]
[ 0.15123381 -0.11387188]]

8. Given this dictionary, create a dataframe from dictionary and interpolate the missing values using
backward interpolation. Hint: use interpolate().

dict = {'First Score': [100, 90, np.nan, 95],

'Second Score': [30, 45, 56, np.nan],

'Third Score': [np.nan, 40, 80, 98]}

In [6]: import pandas as pd

import numpy as np
dict = {'First Score': [100, 90, np.nan, 95],
'Second Score': [30, 45, 56, np.nan],
'Third Score': [np.nan, 40, 80, 98]}
df = pd.DataFrame(dict)
print(df.interpolate(value = np.nan,direction='backword'))

First Score Second Score Third Score

0 100.0 30.0 NaN
1 90.0 45.0 40.0
2 92.5 56.0 80.0
3 95.0 56.0 98.0

file:///C:/Users/DC/Downloads/Lab 2 Solved (1).html 3/3

Intro To Philosophy Notes Cheat Sheet
100% (1)
Intro To Philosophy Notes Cheat Sheet
3 pages
QP DAV 3rd Sem Dec 2023
No ratings yet
QP DAV 3rd Sem Dec 2023
12 pages
dav 2024 pyq
No ratings yet
dav 2024 pyq
7 pages
Cs Sem III Dav Upc 2343012002 Sl. No. Qp. 1673 Dec '23
No ratings yet
Cs Sem III Dav Upc 2343012002 Sl. No. Qp. 1673 Dec '23
12 pages
LL
No ratings yet
LL
5 pages
Grade 12 Informatics Practical practice 2024-25
No ratings yet
Grade 12 Informatics Practical practice 2024-25
12 pages
DIVP PYQ 2023
No ratings yet
DIVP PYQ 2023
7 pages
DS Question Bank Unit-1 Part-2
No ratings yet
DS Question Bank Unit-1 Part-2
3 pages
Chennai Sahodaya Ip (065) Set 3 Answer Key
No ratings yet
Chennai Sahodaya Ip (065) Set 3 Answer Key
7 pages
S7 Practice Questions
No ratings yet
S7 Practice Questions
7 pages
Informatics Practices Practical List22-2323
No ratings yet
Informatics Practices Practical List22-2323
6 pages
FDS-Practical-Exam-Qs
No ratings yet
FDS-Practical-Exam-Qs
4 pages
Class 12 IP Practical Record
No ratings yet
Class 12 IP Practical Record
33 pages
Wa0012.
No ratings yet
Wa0012.
30 pages
Info Pract Xii Ms PB 1 Set 1
No ratings yet
Info Pract Xii Ms PB 1 Set 1
4 pages
QP_IP_XII_SET 2
No ratings yet
QP_IP_XII_SET 2
8 pages
Xii Ip Ut 2 Marking Scheme
No ratings yet
Xii Ip Ut 2 Marking Scheme
4 pages
KEY IP PRE BOARD 2024-25 (1)
No ratings yet
KEY IP PRE BOARD 2024-25 (1)
10 pages
Uecm1534 September2022
No ratings yet
Uecm1534 September2022
10 pages
prints
No ratings yet
prints
43 pages
Python
No ratings yet
Python
32 pages
CLS - Xii - Ip - Practical & Project - 2022-23
No ratings yet
CLS - Xii - Ip - Practical & Project - 2022-23
6 pages
Class XII - Informatics Practices - Sample Paper III
No ratings yet
Class XII - Informatics Practices - Sample Paper III
9 pages
12th - Mid-Term-IP
No ratings yet
12th - Mid-Term-IP
5 pages
B12 ANSWER
No ratings yet
B12 ANSWER
6 pages
python 1
No ratings yet
python 1
16 pages
Ip CLSS Xii 2024-25 Hy
No ratings yet
Ip CLSS Xii 2024-25 Hy
14 pages
12th - QPAPER - Half Yearly 2023
No ratings yet
12th - QPAPER - Half Yearly 2023
9 pages
NEEL (1) Edited Edited
No ratings yet
NEEL (1) Edited Edited
12 pages
NEEL (1)
No ratings yet
NEEL (1)
12 pages
Bca212 Ids 2023
No ratings yet
Bca212 Ids 2023
3 pages
Python practice questions (1)
No ratings yet
Python practice questions (1)
5 pages
DataFrame Revision
No ratings yet
DataFrame Revision
5 pages
IP Practical Board 2024-25
No ratings yet
IP Practical Board 2024-25
14 pages
Document (4)-1
No ratings yet
Document (4)-1
15 pages
CS3361 Set2
No ratings yet
CS3361 Set2
6 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
Class XII: Informatics Practices (065) Sample Question Paper (2020 - 21) Marking Scheme
No ratings yet
Class XII: Informatics Practices (065) Sample Question Paper (2020 - 21) Marking Scheme
12 pages
XII IP MS
No ratings yet
XII IP MS
6 pages
Document (4)
No ratings yet
Document (4)
15 pages
2023 Data Analysis and Visualization Using Python
100% (2)
2023 Data Analysis and Visualization Using Python
9 pages
NEEL (1)_edited
No ratings yet
NEEL (1)_edited
12 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Informatics Practices - Marking Scheme
No ratings yet
Informatics Practices - Marking Scheme
6 pages
DAV Previous Year
No ratings yet
DAV Previous Year
7 pages
Shivansh Rawat IP Practical File XII
No ratings yet
Shivansh Rawat IP Practical File XII
43 pages
Half Yearly Examination 2022-23 PT2: Class XII
No ratings yet
Half Yearly Examination 2022-23 PT2: Class XII
7 pages
IP-MS SET-1.docx
No ratings yet
IP-MS SET-1.docx
8 pages
XII IP SPECIAL MS SET A 2022-23
No ratings yet
XII IP SPECIAL MS SET A 2022-23
5 pages
Neel
No ratings yet
Neel
12 pages
21hcs4108 Davpracticals
No ratings yet
21hcs4108 Davpracticals
29 pages
Practice Ques ip pract
No ratings yet
Practice Ques ip pract
6 pages
Python ClassXII AI
No ratings yet
Python ClassXII AI
4 pages
Pragya File
No ratings yet
Pragya File
31 pages
Sample Paper General Instruction
No ratings yet
Sample Paper General Instruction
12 pages
Content
No ratings yet
Content
12 pages
Nitin
No ratings yet
Nitin
41 pages
Oisb Cbse-gr 12 Sa1 Ip.docx
No ratings yet
Oisb Cbse-gr 12 Sa1 Ip.docx
8 pages
Sample Paper 1 7
No ratings yet
Sample Paper 1 7
7 pages
PRACTICAL FILE (XII - IP) (1)
No ratings yet
PRACTICAL FILE (XII - IP) (1)
32 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Lecture 1 notes
No ratings yet
Lecture 1 notes
2 pages
Solution_HCI_Assignment#2
No ratings yet
Solution_HCI_Assignment#2
7 pages
Lab 4 solved (1)
No ratings yet
Lab 4 solved (1)
6 pages
Book1
No ratings yet
Book1
2 pages
Solved Lab 1
No ratings yet
Solved Lab 1
6 pages
Screenshot 2024-01-17 at 4.27.27 PM
No ratings yet
Screenshot 2024-01-17 at 4.27.27 PM
35 pages
RC - Nulec N Series - V 1.0
No ratings yet
RC - Nulec N Series - V 1.0
15 pages
Assignment I Business Statistics PGDBRM Term I
No ratings yet
Assignment I Business Statistics PGDBRM Term I
3 pages
Perfiles
No ratings yet
Perfiles
4 pages
Admin, Eng12 PDF
No ratings yet
Admin, Eng12 PDF
9 pages
Chemical Bonds Concept Map
No ratings yet
Chemical Bonds Concept Map
2 pages
Q3.W9.D3Types of Simple Machines
No ratings yet
Q3.W9.D3Types of Simple Machines
35 pages
Unit 2 - Reinforcement-AY 2023-24 - PDF
No ratings yet
Unit 2 - Reinforcement-AY 2023-24 - PDF
19 pages
Bmematlog
No ratings yet
Bmematlog
357 pages
JDBC
No ratings yet
JDBC
35 pages
The Waterfall Model
No ratings yet
The Waterfall Model
3 pages
Calculator PDF
No ratings yet
Calculator PDF
6 pages
Ag Belt
100% (2)
Ag Belt
759 pages
Linear Regression Analysis
No ratings yet
Linear Regression Analysis
18 pages
Diesel Pump
100% (4)
Diesel Pump
96 pages
04 Light and Shading 2023
No ratings yet
04 Light and Shading 2023
29 pages
8JR00381 Cat 312c Schematics Caterpill4r
No ratings yet
8JR00381 Cat 312c Schematics Caterpill4r
2 pages
G12-Research
No ratings yet
G12-Research
23 pages
Collision Domain
No ratings yet
Collision Domain
4 pages
4L Signal List
No ratings yet
4L Signal List
1 page
Reviewsheet Ap01
No ratings yet
Reviewsheet Ap01
3 pages
Lecture 1
No ratings yet
Lecture 1
22 pages
Vegas Tunnel 4HR
100% (1)
Vegas Tunnel 4HR
11 pages
2223 Term 1 Pre-Assessment Unit 1-3
No ratings yet
2223 Term 1 Pre-Assessment Unit 1-3
17 pages
Phenomenal Perfectionism and The Actualization of The Potential Self
100% (1)
Phenomenal Perfectionism and The Actualization of The Potential Self
48 pages
Wireguard Presentation
No ratings yet
Wireguard Presentation
24 pages
JEE 3 Day Revision Plan
No ratings yet
JEE 3 Day Revision Plan
4 pages
Digital Image Correlation and Tracking With Matlab
No ratings yet
Digital Image Correlation and Tracking With Matlab
46 pages
Types of Graph Theory
No ratings yet
Types of Graph Theory
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lab 2 Solved

Uploaded by

Lab 2 Solved

Uploaded by

10/24/24, 8:31 AM Lab 2 Solved

In [15]: import pandas as pd

In [19]: import pandas as pd

file:///C:/Users/DC/Downloads/Lab 2 Solved (1).html 1/3

Math Science English History

In [21]: import pandas as pd

In [29]: import pandas as pd

In [31]: import pandas as pd

Employee_ID Name Age Gender Department

file:///C:/Users/DC/Downloads/Lab 2 Solved (1).html 2/3

In [46]: import pandas as pd

In [50]: import pandas as pd

dict = {'First Score': [100, 90, np.nan, 95],

'Second Score': [30, 45, 56, np.nan],

'Third Score': [np.nan, 40, 80, 98]}

In [6]: import pandas as pd

First Score Second Score Third Score

file:///C:/Users/DC/Downloads/Lab 2 Solved (1).html 3/3

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.