0% found this document useful (0 votes)
30 views

Project Report

Uploaded by

Pubg Montages
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views

Project Report

Uploaded by

Pubg Montages
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7

DATA

ANALYTICS
REPORT
Pankaj Kumar
Roll Number : 8632

1
1. INTRODUCTION

The main objective of this Project is to Analyze the Data


using different Libraries of Python and Visualize.
The Dataset which is used in this Project is collected
from “Cardekho.com”
The following details of Cars are included in the Datasets:
1) Car name
2) Year
3) Selling Price
4) Kms Driven
5) Fuel
6) Seller Type
7) Transmission
8) Owner
2. Libraries used :

1) Numpy:
NumPy is a Python library used for working with arrays It also
has functions for working in domain of linear algebra and
matrices.

2) Pandas:
Pandas is used for data cleaning and analysis. Pandas is the
best tool for handling this real-world messy data.

3) Matplotlib:
This Library is used for visualizing the Analysed Data in
the form of graphs and charts.

4) Seaborn :
The seaborn library, we can easily represent our data on
a plot. This library is used to visualize our data; we do
not need to take care of the internal details; we just have
to pass our data set or data inside the relplot() function,
and it will calculate and place the value accordingly.
Code :
3. Implementation :
3.1 Exploratory Data Analysis (EDA) –
Exploratory Data Analysis (EDA) is an approach to
analyze the data using visual techniques. It is used to
discover errors and patterns.

-We use many function for EDA like :

df.shape # showing number of rows and columns


df.info # The information contains the number of
columns, column labels, column data types, memory
usage, range index
df.describe # Provides description of the numerical data
df.isnull().sum() # For checking null values

Dataset:

Table 1
3.2 Splitting the Columns –

We extract the Brand and Model name from the columns


“Name” using Lambda function.

Code :

Dataset after splitting the columns :

Table 2
3.3 Extracting useful Insights from Data after EDA

1) Top 10 Most selling brands :-


Graphical Representation :

2) Top least selling brands :-

Graphical Representation :-
3) Graph of Owner :-

- This graph is showing the numbers of cars (in y-axis)


having first, second or third owner (in x-axis).

4) Graph of the Seller Type :-

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy