0% found this document useful (0 votes)

2 views10 pages

data handling module

The document is an introduction to data handling using Pandas and NumPy, covering basic operations, data structures, data exploration, manipulation, cleaning, aggregation, and visualization techniques. It includes practical usage examples for creating Series and DataFrames, handling missing data, sorting, filtering, and applying functions. Additionally, it addresses NumPy functionalities such as mathematical operations, statistical functions, and saving/loading data.

Uploaded by

debasish0dutta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views10 pages

data handling module

Uploaded by

debasish0dutta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Introduction to Data Handling with Pandas and

NumPy
Debasish Dutta
July 4, 2024

Contents
1 Pandas 3
1.1 Basic Operations and Data Structures . . . . . . . . . . . . . . . . . . 3
1.1.1 Creating Series and DataFrames . . . . . . . . . . . . . . . . . . 3
1.1.2 Basic Indexing and Slicing . . . . . . . . . . . . . . . . . . . . . 3
1.2 Data Exploration and Manipulation . . . . . . . . . . . . . . . . . . . . 3
1.2.1 Data Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2.2 Handling Missing Data . . . . . . . . . . . . . . . . . . . . . . . 4
1.2.3 Sorting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2.4 Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2.5 Applying Functions . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 Data Cleaning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3.1 Removing Duplicates . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3.2 String Manipulation . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3.3 Changing Data Types . . . . . . . . . . . . . . . . . . . . . . . 5
1.4 Data Aggregation and Grouping . . . . . . . . . . . . . . . . . . . . . . 5
1.4.1 Grouping Data . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.4.2 Aggregation Functions . . . . . . . . . . . . . . . . . . . . . . . 6
1.4.3 Pivot Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.5 Combining DataFrames . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.5.1 Concatenation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.5.2 Merging and Joining . . . . . . . . . . . . . . . . . . . . . . . . 6
1.6 Time Series Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.6.1 Resampling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.6.2 Date/Time Indexing . . . . . . . . . . . . . . . . . . . . . . . . 7
1.7 Visualization with Pandas . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.7.1 Plotting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.8 Exporting Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.8.1 Saving to CSV . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2 NumPy 7
2.1 Basic Operations and Data Structures . . . . . . . . . . . . . . . . . . 7
2.1.1 Creating Arrays . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.1.2 Basic Indexing and Slicing . . . . . . . . . . . . . . . . . . . . . 8
2.2 Mathematical Operations . . . . . . . . . . . . . . . . . . . . . . . . . . 8

1
2.2.1 Element-wise Operations . . . . . . . . . . . . . . . . . . . . . . 8
2.2.2 Matrix Operations . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.3 Statistical Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.3.1 Descriptive Statistics . . . . . . . . . . . . . . . . . . . . . . . . 8
2.4 Linear Algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.4.1 Eigenvalues and Eigenvectors . . . . . . . . . . . . . . . . . . . 9
2.4.2 Solving Linear Equations . . . . . . . . . . . . . . . . . . . . . . 9
2.5 Random Sampling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.5.1 Generating Random Numbers . . . . . . . . . . . . . . . . . . . 9
2.6 Saving and Loading Data . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.6.1 Saving Arrays . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

2
1 Pandas
1.1 Basic Operations and Data Structures
1.1.1 Creating Series and DataFrames
Explanation: A Series is a one-dimensional labeled array, capable of holding any
data type. It can be created from a list, dictionary, or scalar value. DataFrame is a
two-dimensional labeled data structure with columns of potentially different types. It
can be created from a dictionary of lists, a list of dictionaries, or other data structures.
Usage:
import pandas as pd

# Series
s = pd . Series ([1 , 3 , 5 , 7 , 9] , index =[ ’a ’ , ’b ’ , ’c ’ , ’d ’ , ’e ’ ])
print ( s )

# DataFrame from dictionary

data = { ’A ’: [1 , 2 , 3] , ’B ’: [4 , 5 , 6]}
df = pd . DataFrame ( data )
print ( df )

# DataFrame from list of dictionaries

data = [{ ’A ’: 1 , ’B ’: 4} , { ’A ’: 2 , ’B ’: 5} , { ’A ’: 3 , ’B ’: 6}]
df = pd . DataFrame ( data )
print ( df )

1.1.2 Basic Indexing and Slicing

Explanation: Indexing and slicing are used to access specific elements or subsets of
data from Series or DataFrames. This can be done using labels or integer positions.
Usage:
# Series indexing by label
print ( s [ ’a ’ ])

# Series indexing by integer position

print ( s [0])

# DataFrame slicing by column label

print ( df [ ’A ’ ])

# DataFrame slicing by integer position

print ( df . iloc [0 , 1]) # Row 0 , Column 1

# DataFrame slicing by label

print ( df . loc [0 , ’B ’ ]) # Row 0 , Column ’B ’

1.2 Data Exploration and Manipulation

1.2.1 Data Overview
Explanation: Pandas provides methods to get a quick overview of the dataset, such
as the first few rows, summary of the DataFrame, and descriptive statistics.

3
Usage:
# First few rows
print ( df . head () )

# Summary of the DataFrame

print ( df . info () )

# Descriptive statistics
print ( df . describe () )

1.2.2 Handling Missing Data

Explanation: Handling missing data is crucial in data analysis. Pandas provides
methods to detect, remove, or fill missing values.
Usage:
# Detect missing values
print ( df . isna () )

# Drop rows with missing values

df . dropna ( inplace = True )

# Fill missing values

df . fillna (0 , inplace = True )

1.2.3 Sorting
Explanation: Sorting data helps in organizing the data and making it easier to ana-
lyze. Pandas allows sorting by values or index.
Usage:
# Sort by values in column ’A ’
d f_ s o r te d _ by _ v al u e s = df . sort_values ( by = ’A ’)

# Sort by index
df _s or te d_ by _i nd ex = df . sort_index ()

1.2.4 Filtering
Explanation: Filtering data based on conditions allows extracting subsets of data
that meet specific criteria.
Usage:
# Filter rows where column ’A ’ is greater than 1
filtered_df = df [ df [ ’A ’] > 1]

1.2.5 Applying Functions

Explanation: Applying functions to data allows transforming data or performing
operations on it. Pandas provides methods such as apply, applymap, and map.
Usage:

4
# Apply a function to each column
df_applied = df . apply ( lambda x : x * 2)

# Apply a function element - wise

df [ ’A ’] = df [ ’A ’ ]. map ( lambda x : x * 2)

1.3 Data Cleaning

1.3.1 Removing Duplicates
Explanation: Removing duplicate rows from the DataFrame ensures data integrity
and consistency.
Usage:
# Remove duplicate rows
df_no_duplicates = df . drop_duplicates ()

1.3.2 String Manipulation

Explanation: Using string methods to manipulate text data is essential for cleaning
and transforming textual data.
Usage:
# Convert strings to lowercase
df [ ’A ’] = df [ ’A ’ ]. str . lower ()

# Replace substring
df [ ’A ’] = df [ ’A ’ ]. str . replace ( ’ old ’ , ’ new ’)

1.3.3 Changing Data Types

Explanation: Converting data types of DataFrame columns is necessary for ensuring
data types are appropriate for analysis.
Usage:
# Convert column ’A ’ to float
df [ ’A ’] = df [ ’A ’ ]. astype ( ’ float ’)

1.4 Data Aggregation and Grouping

1.4.1 Grouping Data
Explanation: Grouping data by one or more columns and applying aggregation func-
tions helps in summarizing and analyzing data.
Usage:
# Group by column ’A ’ and calculate the sum
grouped_df = df . groupby ( ’A ’) . sum ()

5
1.4.2 Aggregation Functions
Explanation: Aggregation functions can be applied to grouped data to calculate
summary statistics.
Usage:
# Group by column ’A ’ and calculate sum and mean
aggregated_df = df . groupby ( ’A ’) . agg ([ ’ sum ’ , ’ mean ’ ])

1.4.3 Pivot Tables

Explanation: Creating pivot tables allows summarizing data in a matrix format,
which is useful for data analysis and reporting.
Usage:
# Create a pivot table
pivot_table = df . pivot_table ( values = ’B ’ , index = ’A ’ , aggfunc = ’ mean ’)

1.5 Combining DataFrames

1.5.1 Concatenation
Explanation: Concatenating DataFrames along rows or columns combines multiple
DataFrames into one.
Usage:
# Concatenate along columns
concatenated_df = pd . concat ([ df , df ] , axis =1)

# Concatenate along rows

concatenated_df = pd . concat ([ df , df ] , axis =0)

1.5.2 Merging and Joining

Explanation: Merging DataFrames using a key column allows combining data based
on common columns.
Usage:
# Merge DataFrames on column ’A ’
merged_df = df . merge ( df , on = ’A ’)

1.6 Time Series Data

1.6.1 Resampling
Explanation: Resampling time series data involves changing the frequency of the
time series, such as converting daily data to monthly data.
Usage:
# Resample data to monthly frequency and calculate the mean
resampled_df = df . resample ( ’M ’) . mean ()

6
1.6.2 Date/Time Indexing
Explanation: Indexing data by date/time allows performing time series analysis.
Usage:
# Set column ’ date ’ as index
df . set_index ( ’ date ’ , inplace = True )

# Select data for a specific date range

selected_data = df [ ’ 2023 -01 -01 ’: ’ 2023 -12 -31 ’]

1.7 Visualization with Pandas

1.7.1 Plotting
Explanation: Pandas provides built-in plotting methods for quick data visualization.
Usage:
# Line plot
df . plot ( kind = ’ line ’ , x = ’A ’ , y = ’B ’)

# Scatter plot
df . plot ( kind = ’ scatter ’ , x = ’A ’ , y = ’B ’)

# Histogram
df [ ’A ’ ]. plot ( kind = ’ hist ’)

1.8 Exporting Data

1.8.1 Saving to CSV
Explanation: Saving DataFrame to CSV format allows exporting data for use in other
applications.
Usage:
# Save DataFrame to CSV file
df . to_csv ( ’ data . csv ’ , index = False )

2 NumPy
2.1 Basic Operations and Data Structures
2.1.1 Creating Arrays
Explanation: NumPy arrays are used for storing and manipulating data efficiently.
Usage:
import numpy as np

# Create a NumPy array from a list

arr = np . array ([1 , 2 , 3 , 4 , 5])

# Create a NumPy array of zeros

zeros_arr = np . zeros ((3 , 3) )

7
# Create a NumPy array of ones
ones_arr = np . ones ((2 , 2) )

2.1.2 Basic Indexing and Slicing

Explanation: Indexing and slicing NumPy arrays allows accessing specific elements
or subsets of data.
Usage:
# Indexing
print ( arr [0]) # First element

# Slicing
print ( arr [1:4]) # Elements from index 1 to 3

2.2 Mathematical Operations

2.2.1 Element-wise Operations
Explanation: NumPy arrays support element-wise operations, such as addition, sub-
traction, multiplication, and division.
Usage:
# Element - wise addition
result = arr1 + arr2

# Element - wise multiplication

result = arr1 * arr2

2.2.2 Matrix Operations

Explanation: NumPy supports matrix operations, such as matrix multiplication and
dot product.
Usage:
# Matrix multiplication
result = np . matmul ( matrix1 , matrix2 )

# Dot product
result = np . dot ( vector1 , vector2 )

2.3 Statistical Functions

2.3.1 Descriptive Statistics
Explanation: NumPy provides functions for calculating descriptive statistics, such as
mean, median, standard deviation, and variance.
Usage:
# Mean
mean_value = np . mean ( arr )

8
# Median
median_value = np . median ( arr )

# Standard deviation
std_deviation = np . std ( arr )

# Variance
variance = np . var ( arr )

2.4 Linear Algebra

2.4.1 Eigenvalues and Eigenvectors
Explanation: NumPy allows computing eigenvalues and eigenvectors of a square
matrix.
Usage:
# Compute eigenvalues and eigenvectors
eigenvalues , eigenvectors = np . linalg . eig ( matrix )

2.4.2 Solving Linear Equations

Explanation: NumPy provides functions for solving systems of linear equations.
Usage:
# Solve linear equations
solution = np . linalg . solve ( coeff_matrix , const_vector )

2.5 Random Sampling

2.5.1 Generating Random Numbers
Explanation: NumPy allows generating arrays of random numbers from various prob-
ability distributions.
Usage:
# Generate random numbers from uniform distribution
random_numbers = np . random . rand (5)

# Generate random integers

random_integers = np . random . randint (1 , 100 , size =5)

2.6 Saving and Loading Data

2.6.1 Saving Arrays
Explanation: NumPy arrays can be saved to and loaded from binary files.
Usage:
# Save array to binary file
np . save ( ’ array . npy ’ , arr )

# Load array from binary file

loaded_arr = np . load ( ’ array . npy ’)

9
References
[1] Pandas Documentation: Comprehensive official documentation covering instal-
lation, user guide, API reference, and more. Available at https://pandas.pydata.
org/docs/.

[2] NumPy Documentation: Official documentation providing details on instal-

lation, quickstart tutorial, and API reference. Available at https://numpy.org/
doc/.

[3] Jake VanderPlas. Python Data Science Handbook. O’Reilly Media, 2016.

[4] DataCamp Pandas Tutorial: Interactive Pandas tutorial covering es-

sential topics. Access at https://www.datacamp.com/community/tutorials/
pandas-tutorial-dataframe-python.

[5] DataCamp NumPy Tutorial: Interactive NumPy tutorial with examples

and exercises. Access at https://www.datacamp.com/community/tutorials/
python-numpy-tutorial.

[6] GitHub Repositories: Explore GitHub repositories for code examples and
projects using Pandas and NumPy. Example: https://github.com/pandas-dev/
pandas.

[7] Matplotlib: https://matplotlib.org/

[8] Seaborn: https://seaborn.pydata.org/

NumPy and Pandas Tutorial
No ratings yet
NumPy and Pandas Tutorial
8 pages
Python Pandas Tutorial For Beginners
No ratings yet
Python Pandas Tutorial For Beginners
203 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Christian Mayer, Lukas Rieger, Kyrylo Kravets - Coffee Break Pandas - 74 Pandas Puzzles To Build Your Pandas Data Science Superpower-Finxter - Com (2020)
No ratings yet
Christian Mayer, Lukas Rieger, Kyrylo Kravets - Coffee Break Pandas - 74 Pandas Puzzles To Build Your Pandas Data Science Superpower-Finxter - Com (2020)
156 pages
Pandas_Notes
No ratings yet
Pandas_Notes
6 pages
04-Data Manipulation With Pandas
No ratings yet
04-Data Manipulation With Pandas
28 pages
04 Getting Started with pandas
No ratings yet
04 Getting Started with pandas
85 pages
python interviews
No ratings yet
python interviews
154 pages
Introduction To Pandas in Data Analytics
No ratings yet
Introduction To Pandas in Data Analytics
12 pages
DAP_3_module
No ratings yet
DAP_3_module
62 pages
Chapter-2 Python Pandas
100% (2)
Chapter-2 Python Pandas
33 pages
Phan1_Pandas_Numpy_Matplotlib
No ratings yet
Phan1_Pandas_Numpy_Matplotlib
158 pages
Panda Python
100% (1)
Panda Python
398 pages
Course_ Introduction to Data Science (SD211105)
No ratings yet
Course_ Introduction to Data Science (SD211105)
10 pages
Pandasguide
No ratings yet
Pandasguide
65 pages
Pandas Powerful
No ratings yet
Pandas Powerful
100 pages
Pandas Notes
No ratings yet
Pandas Notes
3 pages
Python Notes by Prof T
No ratings yet
Python Notes by Prof T
10 pages
Usage of NumPy for Numerical Data in Detail
No ratings yet
Usage of NumPy for Numerical Data in Detail
52 pages
Data Analysis Python Read The Docs Io en Latest
No ratings yet
Data Analysis Python Read The Docs Io en Latest
79 pages
Python Unit IV
No ratings yet
Python Unit IV
12 pages
FDS RECORD-1-4
No ratings yet
FDS RECORD-1-4
18 pages
99c949c0-5910-425f-9ac5-155882800fa5
No ratings yet
99c949c0-5910-425f-9ac5-155882800fa5
36 pages
Pandasguide Readthedocs Io en Latest PDF
No ratings yet
Pandasguide Readthedocs Io en Latest PDF
65 pages
Learneverythingai 1661068200
No ratings yet
Learneverythingai 1661068200
66 pages
Statistics Machine Learning Python Draft
No ratings yet
Statistics Machine Learning Python Draft
173 pages
Pandas: Powerful Python Data Analysis Toolkit: Release 0.10.0
No ratings yet
Pandas: Powerful Python Data Analysis Toolkit: Release 0.10.0
432 pages
unit-3(FODS)
No ratings yet
unit-3(FODS)
34 pages
03 Komatsu GD825 Machine Maintenance PDF
100% (3)
03 Komatsu GD825 Machine Maintenance PDF
50 pages
I.p file
No ratings yet
I.p file
20 pages
Pandas Guide
No ratings yet
Pandas Guide
65 pages
Python CSBS Bhavya Lab Manual
No ratings yet
Python CSBS Bhavya Lab Manual
14 pages
NumPy and Pandas (1)
No ratings yet
NumPy and Pandas (1)
12 pages
Pandas
No ratings yet
Pandas
29 pages
StatisticsMachineLearningPythonDraft PDF
100% (1)
StatisticsMachineLearningPythonDraft PDF
223 pages
Pandas Guide
No ratings yet
Pandas Guide
64 pages
jenisha INTERNSHIP REPORT-2.docx (1)
No ratings yet
jenisha INTERNSHIP REPORT-2.docx (1)
19 pages
Report
No ratings yet
Report
18 pages
FDS Notes Unit-4
No ratings yet
FDS Notes Unit-4
30 pages
1364 English Teacher Interview Questions Answers Guide
No ratings yet
1364 English Teacher Interview Questions Answers Guide
9 pages
Bank of India 2
No ratings yet
Bank of India 2
4 pages
Pandas_Tutorial
No ratings yet
Pandas_Tutorial
9 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
Learninng Plan
No ratings yet
Learninng Plan
6 pages
Pandas Training Plan
No ratings yet
Pandas Training Plan
5 pages
What is pandas
No ratings yet
What is pandas
9 pages
Lind 18e Chap001 PPT
No ratings yet
Lind 18e Chap001 PPT
20 pages
Dataframe in Pandas - Cheatsheet
No ratings yet
Dataframe in Pandas - Cheatsheet
8 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Time Table IMO Model Course 1.08
100% (1)
Time Table IMO Model Course 1.08
2 pages
0 - Bethune College - IDC Syllabus of All Department - 230810 - 194239
No ratings yet
0 - Bethune College - IDC Syllabus of All Department - 230810 - 194239
5 pages
Pandas
No ratings yet
Pandas
5 pages
Learningthepandaslibrary PDF
100% (1)
Learningthepandaslibrary PDF
233 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Pandas: Powerful Python Data Analysis Toolkit: Release 0.7.1
No ratings yet
Pandas: Powerful Python Data Analysis Toolkit: Release 0.7.1
283 pages
Data Wrangling With Python and Pandas
No ratings yet
Data Wrangling With Python and Pandas
7 pages
Political Thoughts of Aristotl1
No ratings yet
Political Thoughts of Aristotl1
7 pages
Maxsurf Manual Spainish
No ratings yet
Maxsurf Manual Spainish
144 pages
The Resilience Framework - Organizing For Sustained Viability (PDFDrive)
No ratings yet
The Resilience Framework - Organizing For Sustained Viability (PDFDrive)
273 pages
Paper - II Linguistics
No ratings yet
Paper - II Linguistics
16 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Mercer 3000psi-Test-Stand-IOM-Manual
No ratings yet
Mercer 3000psi-Test-Stand-IOM-Manual
30 pages
Section 6: Ict Applications: Communication Media
No ratings yet
Section 6: Ict Applications: Communication Media
38 pages
Data Analysis Midterms Exam
100% (1)
Data Analysis Midterms Exam
6 pages
Final Exam
No ratings yet
Final Exam
10 pages
The Mathematics of Decisions, Elections, and Games
No ratings yet
The Mathematics of Decisions, Elections, and Games
242 pages
The Real You by DR - Sudipta Rath
100% (2)
The Real You by DR - Sudipta Rath
84 pages
MCM301-FinalTerm-By Rana Abubakar Khan
No ratings yet
MCM301-FinalTerm-By Rana Abubakar Khan
5 pages
Sketching User Experiences
No ratings yet
Sketching User Experiences
2 pages
Training Manual
No ratings yet
Training Manual
49 pages
SOP-04 Preventive Maintenance of DG Sets
100% (1)
SOP-04 Preventive Maintenance of DG Sets
11 pages
Lecture 2. Measuring Tools-Rules and Calipers
No ratings yet
Lecture 2. Measuring Tools-Rules and Calipers
45 pages
FP100 SCHEMATIC Rev.8
No ratings yet
FP100 SCHEMATIC Rev.8
1 page
Pre - DT Report ZBGR - 4331 - TDD
No ratings yet
Pre - DT Report ZBGR - 4331 - TDD
4 pages
5.1 Productivity Engineering and Management Part 1 - BAGULBAGUL
No ratings yet
5.1 Productivity Engineering and Management Part 1 - BAGULBAGUL
28 pages
601PH Install Manual
100% (1)
601PH Install Manual
2 pages
Role Name Terms of Reference (Duties and Responsibilities)
No ratings yet
Role Name Terms of Reference (Duties and Responsibilities)
5 pages
Spectrum Management - A Regulator View & QOS: ELF Extremely Low Frequency 30 HZ - 300 HZ
No ratings yet
Spectrum Management - A Regulator View & QOS: ELF Extremely Low Frequency 30 HZ - 300 HZ
4 pages
Broschuere Solenergy 40s1-60s2 en A2 2014-09 Web Neu
No ratings yet
Broschuere Solenergy 40s1-60s2 en A2 2014-09 Web Neu
4 pages
Case 1
No ratings yet
Case 1
2 pages
Engine Data
No ratings yet
Engine Data
2 pages
Data Empowerment: Harnessing Advanced Mathematical and Statistical Methods for Data Science and Machine Learning
From Everand
Data Empowerment: Harnessing Advanced Mathematical and Statistical Methods for Data Science and Machine Learning
NAGARAJU CHEVURU
No ratings yet
The Linux Terminal for Advanced Users - The Command Line Made Easy: First Edition
From Everand
The Linux Terminal for Advanced Users - The Command Line Made Easy: First Edition
Michael Basler
No ratings yet
Unlocking Statistics for the Social Sciences
From Everand
Unlocking Statistics for the Social Sciences
Norma Sinclair
No ratings yet
Mastering Python Advanced Concepts and Practical Applications
From Everand
Mastering Python Advanced Concepts and Practical Applications
Aissa Younes
No ratings yet
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
Gray Hat Hacking the Ethical Hacker's
From Everand
Gray Hat Hacking the Ethical Hacker's
Çağatay Şanlı
5/5 (1)
Intrusion Detection Honeypots
From Everand
Intrusion Detection Honeypots
Chris Sanders
3/5 (2)
ChatGPT for Business: Strategies for Success
From Everand
ChatGPT for Business: Strategies for Success
Matthew C. Smith
No ratings yet
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
From Everand
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
Matthew C. Smith
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

data handling module

Uploaded by

data handling module

Uploaded by

Introduction to Data Handling with Pandas and

# DataFrame from dictionary

# DataFrame from list of dictionaries

1.1.2 Basic Indexing and Slicing

# Series indexing by integer position

# DataFrame slicing by column label

# DataFrame slicing by integer position

# DataFrame slicing by label

1.2 Data Exploration and Manipulation

# Summary of the DataFrame

1.2.2 Handling Missing Data

# Drop rows with missing values

# Fill missing values

1.2.5 Applying Functions

# Apply a function element - wise

1.3 Data Cleaning

1.3.2 String Manipulation

1.3.3 Changing Data Types

1.4 Data Aggregation and Grouping

1.4.3 Pivot Tables

1.5 Combining DataFrames

# Concatenate along rows

1.5.2 Merging and Joining

1.6 Time Series Data

# Select data for a specific date range

1.7 Visualization with Pandas

1.8 Exporting Data

# Create a NumPy array from a list

# Create a NumPy array of zeros

2.1.2 Basic Indexing and Slicing

2.2 Mathematical Operations

# Element - wise multiplication

2.2.2 Matrix Operations

2.3 Statistical Functions

2.4 Linear Algebra

2.4.2 Solving Linear Equations

2.5 Random Sampling

# Generate random integers

2.6 Saving and Loading Data

# Load array from binary file

[2] NumPy Documentation: Official documentation providing details on instal-

[4] DataCamp Pandas Tutorial: Interactive Pandas tutorial covering es-

[5] DataCamp NumPy Tutorial: Interactive NumPy tutorial with examples

[7] Matplotlib: https://matplotlib.org/

[8] Seaborn: https://seaborn.pydata.org/

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.