0% found this document useful (0 votes)

5 views

Pandas Notes

This document provides notes on data handling and analysis using Pandas, covering key operations such as reading, cleaning, filtering, analyzing, grouping, merging, and exporting data. It includes code examples for each operation, demonstrating how to manipulate DataFrames effectively. The notes serve as a quick reference guide for performing common data tasks in Python with Pandas.

Uploaded by

mehul garje

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Pandas Notes

Uploaded by

mehul garje

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Pandas Notes - Data Handling & Analysis

Reading Data

- Use `pd.read_csv('file.csv')` to read CSV files.

- Use `pd.read_excel('file.xlsx')` to read Excel files.
- Use `df.head()` to view the first 5 rows.
- Use `df.tail()` to view the last 5 rows.
- Use `df.info()` to see data types and non-null counts.
Example:
df = pd.read_csv('data.csv')

Cleaning Data

- Use `df.isnull().sum()` to check missing values.

- Fill missing data: `df.fillna(value)`.
- Drop missing rows: `df.dropna()`.
- Rename columns: `df.rename(columns={'old':'new'})`.
Example:
df['age'].fillna(df['age'].mean(), inplace=True)

Filtering Data

- Single condition: `df[df['age'] > 30]`

- Multiple conditions: `df[(df['age'] > 25) & (df['marks'] > 80)]`
- Equality: `df[df['name'] == 'Bob']`
- `isin()`: `df[df['name'].isin(['Alice', 'David'])]`
- String match: `df[df['name'].str.startswith('A')]`
Example:
df.query('age > 30 and marks < 90')

Analyzing Data

- `df.describe()` gives statistical summary.

- Column stats: `mean()`, `max()`, `min()`, `mode()`.
- `value_counts()` for frequency count.
- `df.groupby('col')['val'].mean()` for grouped mean.
- `df.corr()` for correlation.
Example:
df.groupby('department')['marks'].agg(['min', 'max', 'mean'])

Grouping Data

- Use `groupby()` to group and aggregate.

- Average marks: `df.groupby('department')['marks'].mean()`
- Multiple stats: `agg(['min', 'max'])`
- Group by multiple: `df.groupby(['dept', 'name'])`
Pandas Notes - Data Handling & Analysis
- Reset index: `reset_index()` to flatten result.
Example:
df.groupby('department')['marks'].sum().reset_index()

Merging Data

- `pd.merge(df1, df2, on='id')` for inner join.

- `how='left'`, `'right'`, `'outer'` for other joins.
- Merge on multiple keys: `on=['id', 'name']`.
Example:
pd.merge(students, marks, on='id', how='left')

Exporting Data

- To CSV: `df.to_csv('file.csv', index=False)`

- To Excel: `df.to_excel('file.xlsx', index=False)`
- To JSON: `df.to_json('file.json')`
Example:
df.to_csv('cleaned_data.csv', index=False)

Let Us Python by Yashavant Kanetkar
92% (25)
Let Us Python by Yashavant Kanetkar
429 pages
Data Analytics Using Python
100% (1)
Data Analytics Using Python
982 pages
Python in Excel (2024)
100% (10)
Python in Excel (2024)
607 pages
The Python Bible
97% (31)
The Python Bible
506 pages
Data Visualization in Python Preview PDF
100% (8)
Data Visualization in Python Preview PDF
58 pages
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
100% (18)
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
208 pages
The Python Manual
97% (31)
The Python Manual
196 pages
Data Analysis From Scratch With Python - Beginner Guide Using Python, Pandas, NumPy, Scikit-Learn, IPython, TensorFlow and
100% (10)
Data Analysis From Scratch With Python - Beginner Guide Using Python, Pandas, NumPy, Scikit-Learn, IPython, TensorFlow and
104 pages
Python Pandas Tutorial
96% (28)
Python Pandas Tutorial
178 pages
Coffee Break NumPy PDF
100% (5)
Coffee Break NumPy PDF
211 pages
Python Data Science
92% (12)
Python Data Science
65 pages
Object Oriented Python Tutorial
100% (20)
Object Oriented Python Tutorial
111 pages
Universal Data Analytics Algorithm
No ratings yet
Universal Data Analytics Algorithm
51 pages
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
No ratings yet
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
7 pages
exp3 python (1)
No ratings yet
exp3 python (1)
15 pages
Pandas Fuction Notes
No ratings yet
Pandas Fuction Notes
3 pages
1745516832930-Pandas-Handbook
No ratings yet
1745516832930-Pandas-Handbook
33 pages
Pandas_Dataframe_All_Operations_1735471870
No ratings yet
Pandas_Dataframe_All_Operations_1735471870
4 pages
Pandas
No ratings yet
Pandas
12 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Chapter Notes - Data Handling Using Pandas DataFrame
No ratings yet
Chapter Notes - Data Handling Using Pandas DataFrame
16 pages
Pandas PDF(2)
No ratings yet
Pandas PDF(2)
25 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Important Pandas Operations 1697910759
No ratings yet
Important Pandas Operations 1697910759
6 pages
Pandas
No ratings yet
Pandas
4 pages
Week 2_Data Exploration
No ratings yet
Week 2_Data Exploration
8 pages
Pandas 1
No ratings yet
Pandas 1
2 pages
What is pandas
No ratings yet
What is pandas
9 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
20 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
27 pages
a5
No ratings yet
a5
28 pages
Dataframe in Pandas - Cheatsheet
No ratings yet
Dataframe in Pandas - Cheatsheet
8 pages
Course_ Introduction to Data Science (SD211105)
No ratings yet
Course_ Introduction to Data Science (SD211105)
10 pages
Pandas
No ratings yet
Pandas
94 pages
DevOps Session 3 Pandas.pptx
No ratings yet
DevOps Session 3 Pandas.pptx
33 pages
Experiment 678910
No ratings yet
Experiment 678910
12 pages
Python & MySQL for Data Analysis
No ratings yet
Python & MySQL for Data Analysis
45 pages
BasicAnalysis Using PYTHON
No ratings yet
BasicAnalysis Using PYTHON
6 pages
Introduction to Pandas Programming 1
No ratings yet
Introduction to Pandas Programming 1
2 pages
EDA with Pandas
No ratings yet
EDA with Pandas
8 pages
Pandas_Presentation
No ratings yet
Pandas_Presentation
10 pages
Code explanation for date types
No ratings yet
Code explanation for date types
8 pages
Pandas 1705297450
No ratings yet
Pandas 1705297450
21 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
data analysis
No ratings yet
data analysis
42 pages
Informatics Practices Practical File
No ratings yet
Informatics Practices Practical File
8 pages
L32, 33 Pandas
No ratings yet
L32, 33 Pandas
7 pages
ICT2103 Full Book-Part-3
No ratings yet
ICT2103 Full Book-Part-3
14 pages
Data Analysis Cheat Sheet
No ratings yet
Data Analysis Cheat Sheet
1 page
EDA-Unit2
No ratings yet
EDA-Unit2
99 pages
Data_Analysis_Python
No ratings yet
Data_Analysis_Python
3 pages
CSE445 NSU Week_3
No ratings yet
CSE445 NSU Week_3
48 pages
Pandas
No ratings yet
Pandas
26 pages
Data Analysis CheatSheet
No ratings yet
Data Analysis CheatSheet
2 pages
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
100% (1)
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
12 pages
Data Frame in Panda 01
No ratings yet
Data Frame in Panda 01
9 pages
Unit 4 Pandas
No ratings yet
Unit 4 Pandas
8 pages
EDA With Pandas CheatSheet
No ratings yet
EDA With Pandas CheatSheet
3 pages
Practical
No ratings yet
Practical
12 pages
Pandas_Tutorial
No ratings yet
Pandas_Tutorial
9 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
Pandas
No ratings yet
Pandas
13 pages
Unit-2 Bda
No ratings yet
Unit-2 Bda
11 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
Pyspark Basics
No ratings yet
Pyspark Basics
16 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
lecture-week2
No ratings yet
lecture-week2
72 pages
Usage of NumPy for Numerical Data in Detail
No ratings yet
Usage of NumPy for Numerical Data in Detail
52 pages
Python For Statistics
No ratings yet
Python For Statistics
40 pages
Pandas Notes
No ratings yet
Pandas Notes
8 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python Cheat Sheets Compilation
100% (4)
Python Cheat Sheets Compilation
14 pages
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Data Analysis With PANDAS: Cheat Sheet
83% (6)
Data Analysis With PANDAS: Cheat Sheet
4 pages
Data Visualization With Python PDF
93% (14)
Data Visualization With Python PDF
662 pages
Python For Data Science - Cheat Sheets
100% (4)
Python For Data Science - Cheat Sheets
10 pages
Matplotlib Cheat Sheet
100% (6)
Matplotlib Cheat Sheet
8 pages
Mastering Python Data Visualization - Sample Chapter
100% (9)
Mastering Python Data Visualization - Sample Chapter
63 pages
EBOOK - Python Crash Course For Data Analysis
100% (12)
EBOOK - Python Crash Course For Data Analysis
168 pages
Python For Data Science PDF
100% (3)
Python For Data Science PDF
15 pages
Pandas
No ratings yet
Pandas
9 pages
Python For Data Analysis
67% (3)
Python For Data Analysis
39 pages
Python Pandas Tutorial
No ratings yet
Python Pandas Tutorial
45 pages
Python 3 Cheat Sheet v3
100% (4)
Python 3 Cheat Sheet v3
13 pages
Pandas
No ratings yet
Pandas
41 pages
Data Science With Python
100% (3)
Data Science With Python
725 pages
Python Quick Reference Card
94% (17)
Python Quick Reference Card
17 pages
PythonForDataScience Cheatsheet PDF
100% (5)
PythonForDataScience Cheatsheet PDF
21 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Pandas Notes

Uploaded by

Pandas Notes

Uploaded by

Pandas Notes - Data Handling & Analysis

- Use `pd.read_csv('file.csv')` to read CSV files.

- Use `df.isnull().sum()` to check missing values.

- Single condition: `df[df['age'] > 30]`

- `df.describe()` gives statistical summary.

- Use `groupby()` to group and aggregate.

- `pd.merge(df1, df2, on='id')` for inner join.

- To CSV: `df.to_csv('file.csv', index=False)`

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.