0% found this document useful (0 votes)

1 views6 pages

Pandasmohali

Pandas is an open-source Python library for data analysis and manipulation, featuring data structures like Series and DataFrame. A DataFrame is a 2D labeled structure that can store various data types, and users can create it from dictionaries, lists, or CSV files. The document also covers advanced functionalities such as multi-indexing, groupby, merging DataFrames, and handling missing data.

Uploaded by

singhshivam10071996

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views6 pages

Pandasmohali

Uploaded by

singhshivam10071996

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Beginner Level (Easy)

1. What is Pandas?

Pandas is an open-source data analysis and manipulation library in Python. It provides data
structures like Series (1D) and DataFrame (2D) to handle structured data efficiently.

2. What is a DataFrame in Pandas?

A DataFrame is a 2D labeled data structure, similar to a table in a database or an Excel

spreadsheet, with rows and columns. It can store data of different types.

3. How do you create a Pandas DataFrame?

You can create a DataFrame from a dictionary, a list, or a NumPy array:

python
CopyEdit
import pandas as pd
data = {'Name': ['Alice', 'Bob'], 'Age': [25, 30]}
df = pd.DataFrame(data)

4. What is the difference between a Series and a DataFrame?

● Series: A one-dimensional array with labeled indices.

● DataFrame: A two-dimensional table with rows and columns, where each column can
have a different data type.

5. How do you read data from a CSV file into a DataFrame?

You can use the read_csv() function:

python
CopyEdit
df = pd.read_csv('file.csv')
🟡 Intermediate Level
6. How do you select a column from a DataFrame?

You can select a column by its name:

python
CopyEdit
df['column_name']

7. What is the purpose of iloc and loc?

● iloc[]: Used to select rows and columns by integer position.

● loc[]: Used to select rows and columns by label.

8. How do you handle missing data in Pandas?

You can handle missing data using:

● df.isna() to check for NaN values.

● df.fillna(value) to fill missing values.

● df.dropna() to remove rows with NaN values.

9. How do you filter rows in a DataFrame based on a condition?

You can filter rows using boolean indexing:

python
CopyEdit
filtered_df = df[df['column_name'] > 10]
10. What is the difference between apply() and map()?

● apply(): Used to apply a function along a DataFrame axis (rows or columns).

● map(): Used to map a function or dictionary to individual elements in a Series.

11. How do you sort a DataFrame by a column?

You can use the sort_values() method:

python
CopyEdit
df.sort_values(by='column_name', ascending=False)

12. How do you add a new column to an existing DataFrame?

You can add a new column by assigning a value to a new column name:

python
CopyEdit
df['new_column'] = [value1, value2, value3]

🔵 Advanced Level
13. What are multi-indexes in Pandas, and why are they used?

Multi-indexes allow you to work with higher-dimensional data in a 2D DataFrame, making it

easier to handle hierarchical data. You can create a multi-index using set_index() or
pd.MultiIndex.

14. What is the purpose of groupby() in Pandas?

The groupby() function is used to group data based on a column and then apply aggregation
or transformation functions to the grouped data.
15. How do you merge/join DataFrames in Pandas?

You can merge DataFrames using the merge() function:

python
CopyEdit
merged_df = pd.merge(df1, df2, on='common_column', how='inner')

Common join types are inner, outer, left, and right.

16. What is vectorized computation in Pandas?

Vectorized computation refers to performing operations on entire columns or rows without

explicit loops. Pandas uses this approach for efficient computation, e.g., df['column_name']
+ 10.

17. What is the difference between concat() and append()?

● concat(): Used to concatenate DataFrames along a particular axis (rows or columns).

● append(): Used to add rows to a DataFrame, but it is less efficient than concat().

18. How do you pivot a DataFrame?

You can pivot a DataFrame using the pivot() function:

python
CopyEdit
df_pivot = df.pivot(index='col1', columns='col2', values='col3')

19. What is the purpose of crosstab() in Pandas?

crosstab() computes a cross-tabulation (contingency table) of two or more variables:

python
CopyEdit
pd.crosstab(df['column1'], df['column2'])

20. How do you optimize memory usage in Pandas?

● Use category dtype for categorical data.

● Downcast numeric columns using pd.to_numeric() with the downcast argument.

● Load only relevant columns with usecols during file reading.

21. How do you perform time series analysis in Pandas?

You can use pd.to_datetime() to convert a column to datetime type, and use time-based
indexing and resampling:

python
CopyEdit
df['date'] = pd.to_datetime(df['date'])
df.set_index('date', inplace=True)
df.resample('D').sum() # Resample data by day

22. What is query() in Pandas?

The query() function allows you to filter data using a string expression:

python
CopyEdit
df.query('column_name > 10')

23. How do you calculate moving averages in Pandas?

You can use the rolling() function to calculate moving averages:

python
CopyEdit
df['moving_avg'] = df['column_name'].rolling(window=3).mean()

24. How do you handle duplicate rows in a DataFrame?

You can remove duplicates using drop_duplicates():

python
CopyEdit
df.drop_duplicates(inplace=True)

Cheat Sheet Imperva
100% (2)
Cheat Sheet Imperva
12 pages
Pandas Basics
No ratings yet
Pandas Basics
84 pages
Infiniti Q30 Owners Manual PDF
No ratings yet
Infiniti Q30 Owners Manual PDF
468 pages
Python Ques
No ratings yet
Python Ques
5 pages
8th of 10 Python Resources PANDAS Interview Q A ? 1737825285
No ratings yet
8th of 10 Python Resources PANDAS Interview Q A ? 1737825285
19 pages
Python Unit Iv - Pandas
No ratings yet
Python Unit Iv - Pandas
36 pages
Pandas Viva Questions
No ratings yet
Pandas Viva Questions
23 pages
Pandas Interview Questions
No ratings yet
Pandas Interview Questions
21 pages
3Y3Z2Xzqn7 U Y%K : 2. How To Create A Data Frame Using A Dictionary of Pre-Existing Columns or Numpy 2D Arrays?
No ratings yet
3Y3Z2Xzqn7 U Y%K : 2. How To Create A Data Frame Using A Dictionary of Pre-Existing Columns or Numpy 2D Arrays?
8 pages
100 Python Interview Questions
No ratings yet
100 Python Interview Questions
68 pages
How To Add Pandas To Spyder?: Ans-Import Pandas As PD
No ratings yet
How To Add Pandas To Spyder?: Ans-Import Pandas As PD
3 pages
Python Pandas Interview Questions
100% (1)
Python Pandas Interview Questions
17 pages
Python Unit 2 Question Bank
No ratings yet
Python Unit 2 Question Bank
5 pages
Assignment 11 (Pandas)
No ratings yet
Assignment 11 (Pandas)
2 pages
Top Python Questions 1735201448
No ratings yet
Top Python Questions 1735201448
25 pages
Pandas Questions
No ratings yet
Pandas Questions
11 pages
Phyton
No ratings yet
Phyton
11 pages
Pandas - Matplotlib - QA Class 12
No ratings yet
Pandas - Matplotlib - QA Class 12
4 pages
Pandas
No ratings yet
Pandas
40 pages
Pandas Notes
No ratings yet
Pandas Notes
4 pages
I.P Project File (Pandas CODES)
No ratings yet
I.P Project File (Pandas CODES)
15 pages
Python Pandas Interview Questions and Answers
No ratings yet
Python Pandas Interview Questions and Answers
20 pages
Unit Ii 2M
No ratings yet
Unit Ii 2M
8 pages
15 Commonly Asked Python Interview Questions
No ratings yet
15 Commonly Asked Python Interview Questions
4 pages
Python 3rd Unit Question and Answer
No ratings yet
Python 3rd Unit Question and Answer
25 pages
Viva Voce
No ratings yet
Viva Voce
5 pages
Interview Ques
No ratings yet
Interview Ques
2 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
Pandas Trick Ques
No ratings yet
Pandas Trick Ques
2 pages
Python Pandas Demo PDF
100% (2)
Python Pandas Demo PDF
23 pages
Python Basic Interview Questions Compressed 1
No ratings yet
Python Basic Interview Questions Compressed 1
62 pages
AWP Interview Question
No ratings yet
AWP Interview Question
4 pages
Phan1 Pandas Numpy Matplotlib
No ratings yet
Phan1 Pandas Numpy Matplotlib
158 pages
All Document Reader 1715619870900
No ratings yet
All Document Reader 1715619870900
6 pages
Loki Temp PPT Pandas 2
No ratings yet
Loki Temp PPT Pandas 2
31 pages
Python Programming For Data Science
No ratings yet
Python Programming For Data Science
36 pages
Pandas
No ratings yet
Pandas
25 pages
Python 1
No ratings yet
Python 1
14 pages
11-01-2025 Pandas
No ratings yet
11-01-2025 Pandas
1 page
Python Pandas Tutorial For Beginners
No ratings yet
Python Pandas Tutorial For Beginners
203 pages
Pandas
No ratings yet
Pandas
5 pages
Pandas DataFrame
No ratings yet
Pandas DataFrame
70 pages
Data Handling Using Pandas - Revision Notes
No ratings yet
Data Handling Using Pandas - Revision Notes
6 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Pandas
No ratings yet
Pandas
29 pages
06 MGMT 590 Fall 2019 Data Handling With Pandas
No ratings yet
06 MGMT 590 Fall 2019 Data Handling With Pandas
14 pages
Pandas
No ratings yet
Pandas
26 pages
Mypnotes
No ratings yet
Mypnotes
3 pages
Pandas Dataframe Export The CSV File
No ratings yet
Pandas Dataframe Export The CSV File
9 pages
Pandas
No ratings yet
Pandas
7 pages
MCQ
No ratings yet
MCQ
8 pages
1 Data Handling Using Pandas 1
No ratings yet
1 Data Handling Using Pandas 1
63 pages
Introduction To Pandas in Data Analytics
No ratings yet
Introduction To Pandas in Data Analytics
12 pages
On Data Handling Using Pandas-I
100% (2)
On Data Handling Using Pandas-I
63 pages
Viva Questions
No ratings yet
Viva Questions
7 pages
Python Pandas Tutorial
No ratings yet
Python Pandas Tutorial
2 pages
Pandas
No ratings yet
Pandas
13 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Analystics Data Cleaning Questions Interview
No ratings yet
Analystics Data Cleaning Questions Interview
8 pages
Pandas
No ratings yet
Pandas
13 pages
Interview Bit Pandas
No ratings yet
Interview Bit Pandas
62 pages
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Marketing Plan S: Sample
No ratings yet
Marketing Plan S: Sample
17 pages
Electronic Data Interchange
No ratings yet
Electronic Data Interchange
17 pages
Turbo HD DVR V3.1.13 Release Notes
No ratings yet
Turbo HD DVR V3.1.13 Release Notes
2 pages
Bac U3
No ratings yet
Bac U3
9 pages
PRU211m - Slot 1 - Module 1 - Starting To Program
No ratings yet
PRU211m - Slot 1 - Module 1 - Starting To Program
34 pages
R122 - Start and Stop Procedure of Services
No ratings yet
R122 - Start and Stop Procedure of Services
2 pages
Liebert Ita 5kva and 6kva Ups.
100% (1)
Liebert Ita 5kva and 6kva Ups.
72 pages
Acct Statement - XX2389 - 12092023
No ratings yet
Acct Statement - XX2389 - 12092023
7 pages
Swing Check Valve - Flanged Ends: Schematic Drawing
No ratings yet
Swing Check Valve - Flanged Ends: Schematic Drawing
1 page
Project Report
No ratings yet
Project Report
66 pages
Sadhana Government Jobs Academy, Ap & Ts .: Appsc - Group Ii Screening Test
No ratings yet
Sadhana Government Jobs Academy, Ap & Ts .: Appsc - Group Ii Screening Test
9 pages
Triblade Presentation-1
No ratings yet
Triblade Presentation-1
16 pages
Vignan'S Institute of Management and Technology For Women: Biometric Identification in Atm'S BY P.Harika
No ratings yet
Vignan'S Institute of Management and Technology For Women: Biometric Identification in Atm'S BY P.Harika
19 pages
Activity 2 (CPE-PC112)
No ratings yet
Activity 2 (CPE-PC112)
1 page
Essentials of Educational Psychology Big Ideas To Guide Effective Teaching 5th Edition Test Bank Available Instantly
No ratings yet
Essentials of Educational Psychology Big Ideas To Guide Effective Teaching 5th Edition Test Bank Available Instantly
411 pages
AMX™ 240 Quality Assurance Process DOC2457124
No ratings yet
AMX™ 240 Quality Assurance Process DOC2457124
7 pages
AM Project
No ratings yet
AM Project
30 pages
For Wipro Competition
No ratings yet
For Wipro Competition
4 pages
BSC (Maths) V-Sem
No ratings yet
BSC (Maths) V-Sem
10 pages
Carousel Bootstrap
No ratings yet
Carousel Bootstrap
9 pages
Industrial Training Report
No ratings yet
Industrial Training Report
19 pages
Bgtech 1 - Hardwares
No ratings yet
Bgtech 1 - Hardwares
44 pages
Sectigo JD
No ratings yet
Sectigo JD
2 pages
00 VESDA-E Power Supplies - (European) TDS A4 IE Lores
No ratings yet
00 VESDA-E Power Supplies - (European) TDS A4 IE Lores
2 pages
Test Your Knowledge - Study Session 1
No ratings yet
Test Your Knowledge - Study Session 1
4 pages
Python Programming: Start Your Python Scripts in Pycharm
No ratings yet
Python Programming: Start Your Python Scripts in Pycharm
24 pages
Gloveox-Manual-MB20-200 Labmaster TP700 V4.1 MBI
100% (1)
Gloveox-Manual-MB20-200 Labmaster TP700 V4.1 MBI
350 pages
Inferential Statistics: (Parametric Data)
No ratings yet
Inferential Statistics: (Parametric Data)
46 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Pandasmohali

Uploaded by

Pandasmohali

Uploaded by

Beginner Level (Easy)

2. What is a DataFrame in Pandas?

A DataFrame is a 2D labeled data structure, similar to a table in a database or an Excel

3. How do you create a Pandas DataFrame?

You can create a DataFrame from a dictionary, a list, or a NumPy array:

4. What is the difference between a Series and a DataFrame?

●​ Series: A one-dimensional array with labeled indices.​

5. How do you read data from a CSV file into a DataFrame?

You can use the read_csv() function:

You can select a column by its name:

7. What is the purpose of iloc and loc?

●​ iloc[]: Used to select rows and columns by integer position.​

●​ loc[]: Used to select rows and columns by label.​

8. How do you handle missing data in Pandas?

You can handle missing data using:

●​ df.isna() to check for NaN values.​

●​ df.fillna(value) to fill missing values.​

●​ df.dropna() to remove rows with NaN values.​

9. How do you filter rows in a DataFrame based on a condition?

You can filter rows using boolean indexing:

●​ apply(): Used to apply a function along a DataFrame axis (rows or columns).​

●​ map(): Used to map a function or dictionary to individual elements in a Series.​

11. How do you sort a DataFrame by a column?

You can use the sort_values() method:

12. How do you add a new column to an existing DataFrame?

Multi-indexes allow you to work with higher-dimensional data in a 2D DataFrame, making it

14. What is the purpose of groupby() in Pandas?

You can merge DataFrames using the merge() function:

Common join types are inner, outer, left, and right.

16. What is vectorized computation in Pandas?

Vectorized computation refers to performing operations on entire columns or rows without

17. What is the difference between concat() and append()?

●​ concat(): Used to concatenate DataFrames along a particular axis (rows or columns).​

18. How do you pivot a DataFrame?

You can pivot a DataFrame using the pivot() function:

19. What is the purpose of crosstab() in Pandas?

crosstab() computes a cross-tabulation (contingency table) of two or more variables:

20. How do you optimize memory usage in Pandas?

●​ Use category dtype for categorical data.​

●​ Downcast numeric columns using pd.to_numeric() with the downcast argument.​

●​ Load only relevant columns with usecols during file reading.​

21. How do you perform time series analysis in Pandas?

22. What is query() in Pandas?

23. How do you calculate moving averages in Pandas?

You can use the rolling() function to calculate moving averages:

24. How do you handle duplicate rows in a DataFrame?

You can remove duplicates using drop_duplicates():

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

● Series: A one-dimensional array with labeled indices.

● iloc[]: Used to select rows and columns by integer position.

● loc[]: Used to select rows and columns by label.

● df.isna() to check for NaN values.

● df.fillna(value) to fill missing values.

● df.dropna() to remove rows with NaN values.

● apply(): Used to apply a function along a DataFrame axis (rows or columns).

● map(): Used to map a function or dictionary to individual elements in a Series.

● concat(): Used to concatenate DataFrames along a particular axis (rows or columns).

● Use category dtype for categorical data.

● Downcast numeric columns using pd.to_numeric() with the downcast argument.

● Load only relevant columns with usecols during file reading.