0% found this document useful (0 votes)
29 views

TASK 1 Data - Quality - Analysis

Uploaded by

Wasim Shaikh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views

TASK 1 Data - Quality - Analysis

Uploaded by

Wasim Shaikh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

Hello,

This is MOHAMMAD WASEEM SHAIKH from KPMG Data Analytics (Virtual


Internship) team. We have reviewed the data sets which were provided by your company
and during the data quality analysis, we have found the some errors in the data sets.

The data quality analysis is the core phase and due to errors in the data set, we suggest
the following mitigates in order to improve the data quality, which will eventually help
us to driven the better analytics, results for your company.
 We can take a mode year value for the missing records of customers DOB.
 We can assign a uniform last name of customers, which values are missing.
 Replace gender ‘U’ with reference to the customer name and make a consistency.
 For tenure values, we can take a mean of rest of the values and assign the mean
value to the missing fields in order to maintain the consistency of data.
 Eliminate the blank orders considering fake orders.

The following are the details of error encountered in the data set.

Customer Demographic (Total records 4000)

FIELD NAME ERRORS


DOB 01 record 1843
87 records Blanks
last_name 125 records Blanks
Gender 88 records gender ‘U’
Values are not consistence M, Male, F, Female, Femal, U
job_title 506 records Blanks
job_industry 656 records mention ‘N/A’
Default 3317 records value ‘special characters’ includes null and Blanks
Tenure 87 records Blanks

Transactions (Total records 20000 -past 3months)


FIELD NAME ERRORS
Online_order 94 records Blanks
brand 48 records Blanks
product_line 48 records Blanks
product_class 48 records Blanks
product_size 48 records Blanks
standard_cost 48 records Blanks
product_first_sold_date 48 records Blanks

Regards,
KPMG (Data Analytics Team)

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy