0% found this document useful (0 votes)

118 views

Customer Data Analysis

This document summarizes a presentation on using customer data to improve demand forecasting. It describes the retail transaction dataset used, which contains over 500,000 records. Key attributes include invoices, products, prices, customers and countries. The solution segments customers based on purchase history and uses machine learning to predict which customers will buy certain products. Decision trees achieved the best accuracy on new test data.

Uploaded by

Debashish Mukherjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

118 views

Customer Data Analysis

Uploaded by

Debashish Mukherjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Customer data analysis

Presentation for Hackerearth Sigma-Thon 1.0

Demand Forecasting -> FMCG

Problem statement
The goal is to come up with an analytical solution for better demand
forecasting by mining insights from the marketplace, consumer, and
competitor data.
Dataset Used

 The dataset is a transnational data which contains all the transactions

occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered
non-store online retail. The company mainly sells unique all-occasion gifts.
Many customers of the company are wholesalers.
 Data Set Characteristics:
 Multivariate, Sequential, Time-Series
 Number of Instances: 541909
 Area: Business
 Attribute Characteristics: Integer, Real
 Number of Attributes: 8
 Date Donated: 2015-11-06
Dataset information

 InvoiceNo: Invoice number. Nominal, a 6-digit integral number uniquely assigned

to each transaction. If this code starts with letter 'c', it indicates a cancellation.
 StockCode: Product (item) code. Nominal, a 5-digit integral number uniquely
assigned to each distinct product.
 Description: Product (item) name. Nominal.
 Quantity: The quantities of each product (item) per transaction. Numeric.
 InvoiceDate: Invoice Date and time. Numeric, the day and time when each
transaction was generated.
 UnitPrice: Unit price. Numeric, Product price per unit in sterling.
 CustomerID: Customer number. Nominal, a 5-digit integral number uniquely
assigned to each customer.
 Country: Country name. Nominal, the name of the country where each customer
resides.
Our Solution

 We have come up with a model which will identify the type of customers
based on their purchase history and segmented them.
 Then use Machine Learning to figure out the type of customers and suggesting
them according products.
Additional information

 What is Customer Segmentation ?

Customer segmentation is a process where we divide the consumer base of the
company into subgroups. We need to generate the subgroups by using some specific
characteristics so that the company sells more products with less marketing
expenditure. Before moving forward, we need to understand the basics, for example,
what do I mean by customer base? What do I mean by segment? How do we generate
the consumer subgroup? What are the characteristics that we consider while we are
segmenting the consumers? Let's answers these questions one by one.
Basically, the consumer base of any company consists of two types of consumers:
 Existing consumers
 Potential consumers
Generally, we need to categorize our consumer base into subgroups. These subgroups
are called segments. We need to create the groups in such a way that each subgroup of
customers has some shared characteristics.
 What is STP ?
STP stands for Segmentation-Targeting-Positioning. In this approach, there are three stages.
The points that we handle in each stage are explained as follows:
 Segmentation: In this stage, we create segments of our customer base using their profile
characteristics as well as consider features provided in the preceding figure. Once the
segmentation is firm, we move on to the next stage.
 Targeting: In this stage, marketing teams evaluate segments and try to understand which
kind of product is suited to which particular segment(s). The team performs this exercise
for each segment, and finally, the team designs customized products that will attract the
customers of one or many segments. They will also select which product should be offered
to which segment.
 Positioning: This is the last stage of the STP process. In this stage, companies study the
market opportunity and what their product is offering to the customer. The marketing
team should come up with a unique selling proposition. Here, the team also tries to
understand how a particular segment perceives the products, brand, or service. This is a
way for companies to determine how to best position their offering. The marketing and
product teams of companies create a value proposition that clearly explains how their
offering is better than any other competitors. Lastly, the companies start their campaign
representing this value proposition in such a way that the consumer base will be happy
about what they are getting.
Data Analysis

It is observed that UK has done most of the transactions. (19857)

Least amount of transactions were made by countries like Brazil, RSA etc. (only 1)
After removing duplicate entries and all the cancelled orders, the order amounts
are distributed as follows:
Now we grouped data according to important words used in products
and clustered them. Then we plot the silhouette score for each
cluster.
We also analysed which cluster has what common words or the most frequent
words in each cluster using wordclouds.

It is seen that words like 'box',

'pot' are common in all
clusters.
Using PCA, we reduced the dimensionality of the dataset. The plot for amount of
variance explained is:

It is seen that
more than 100 Principal Components are needed to explain more than 90 % of the variance.
Now we made customer segments and again checked variance explained .
After the segmentation of customers and some hyperparameter tuning, the
customers are classified and grouped. Selected customers’ who are then
labelled, their data is retained.
This led us to build a machine learning model which can predict what kind of
customers are accustomed or are most probable of buying certain goods.
From all the classifiers the best accuracy was provided by Decision Tree
Classifier.

The accuracy obtained by the model on testing on some of the relatively

new data was….

..which is pretty good.

Amazon - Change Management and Project
No ratings yet
Amazon - Change Management and Project
12 pages
Chapter - 6 - The Strategic Influences of Direct and Digital Marketing
No ratings yet
Chapter - 6 - The Strategic Influences of Direct and Digital Marketing
33 pages
Glosario Traducción Económica-Financiera
No ratings yet
Glosario Traducción Económica-Financiera
57 pages
CRM HP
No ratings yet
CRM HP
3 pages
Presented By: Group 7: Mayank Shouche Mudit Agarwal Rifat Ahmed Saurabh Bose Tushar Shah Vishal Sharma
No ratings yet
Presented By: Group 7: Mayank Shouche Mudit Agarwal Rifat Ahmed Saurabh Bose Tushar Shah Vishal Sharma
29 pages
Retail Customer Segmentation Using SAS
No ratings yet
Retail Customer Segmentation Using SAS
19 pages
The New Beetle Case Study
No ratings yet
The New Beetle Case Study
2 pages
Iis - Case Study Group-5: Subject - Slot
No ratings yet
Iis - Case Study Group-5: Subject - Slot
12 pages
White Paper 1 - Sales Force Benchmarking
No ratings yet
White Paper 1 - Sales Force Benchmarking
23 pages
Ups Worldship: User Guide
No ratings yet
Ups Worldship: User Guide
34 pages
Iphone App Develpment
No ratings yet
Iphone App Develpment
110 pages
Mba Iv Semester Module-Ii Digital Marketing Research
100% (1)
Mba Iv Semester Module-Ii Digital Marketing Research
32 pages
MM 6-11
No ratings yet
MM 6-11
377 pages
Table of Contents
No ratings yet
Table of Contents
9 pages
Brochure For Branding and Marketing 2023
No ratings yet
Brochure For Branding and Marketing 2023
5 pages
Competitive Strategy in The Age of The Customer: Key Takeaways
No ratings yet
Competitive Strategy in The Age of The Customer: Key Takeaways
19 pages
List of Business Intelligence Software
No ratings yet
List of Business Intelligence Software
31 pages
MBA 560 Final Project
No ratings yet
MBA 560 Final Project
30 pages
Corporate Strategy and Corporate Governance
0% (1)
Corporate Strategy and Corporate Governance
34 pages
Amr-21370 Survey 2008
No ratings yet
Amr-21370 Survey 2008
36 pages
Procter & Gamble - Gillette, United Kingdom: Warehousemanager Loadbuilder Module
No ratings yet
Procter & Gamble - Gillette, United Kingdom: Warehousemanager Loadbuilder Module
4 pages
Intern Docs1
No ratings yet
Intern Docs1
191 pages
Mobile App
0% (1)
Mobile App
13 pages
Starbucks Project
100% (1)
Starbucks Project
41 pages
Nasty Gal Case Paper Evaluation HD - High Distinction 80%+ For Student Reference
100% (1)
Nasty Gal Case Paper Evaluation HD - High Distinction 80%+ For Student Reference
133 pages
Customer Relationship Management and Supply Chain Management
No ratings yet
Customer Relationship Management and Supply Chain Management
43 pages
Caterpillar Space Mining Portfolio Management Plan
No ratings yet
Caterpillar Space Mining Portfolio Management Plan
10 pages
MCS Infosys
No ratings yet
MCS Infosys
17 pages
Assignment - 1 On: (Erp Implementation-Case Study)
No ratings yet
Assignment - 1 On: (Erp Implementation-Case Study)
39 pages
Amazon, Apple, Google Facebook
100% (1)
Amazon, Apple, Google Facebook
43 pages
Central Australian College Hobart, Tasmania: BSBSLS501 Develop A Sales Plan
No ratings yet
Central Australian College Hobart, Tasmania: BSBSLS501 Develop A Sales Plan
32 pages
Dominos PULSE System Module1chapter2
No ratings yet
Dominos PULSE System Module1chapter2
17 pages
Session 4 5 Economics of CRM
No ratings yet
Session 4 5 Economics of CRM
19 pages
BCG Building An Integrated Marketing and Sales Engine For B2B June 2018 NL Tcm9 196057
No ratings yet
BCG Building An Integrated Marketing and Sales Engine For B2B June 2018 NL Tcm9 196057
8 pages
Little Black Book of Ads
No ratings yet
Little Black Book of Ads
20 pages
Intelligent Monitoring System On Refrigerator Trucks
No ratings yet
Intelligent Monitoring System On Refrigerator Trucks
9 pages
Mba e Business
No ratings yet
Mba e Business
16 pages
Membership Management System Toolkit - EN
No ratings yet
Membership Management System Toolkit - EN
44 pages
CRM - Unit 2
No ratings yet
CRM - Unit 2
26 pages
Shri Vaishnav School of Management SESSION:-2020-21 Topic:-Domino'S Submitted By:-Submitted To
No ratings yet
Shri Vaishnav School of Management SESSION:-2020-21 Topic:-Domino'S Submitted By:-Submitted To
19 pages
CRMT Marketing Operations Roadmap Matrix V2
No ratings yet
CRMT Marketing Operations Roadmap Matrix V2
35 pages
1 - Introduction 2019-12-12 09 - 17 - 08
No ratings yet
1 - Introduction 2019-12-12 09 - 17 - 08
56 pages
VAST 4.2 Final June26
No ratings yet
VAST 4.2 Final June26
111 pages
Strategic MRK Plan Flow Setyo
No ratings yet
Strategic MRK Plan Flow Setyo
14 pages
Jamcracker
No ratings yet
Jamcracker
5 pages
Introduction To Sales and Distribution
No ratings yet
Introduction To Sales and Distribution
22 pages
The Co-Creation Connection - CKP PDF
No ratings yet
The Co-Creation Connection - CKP PDF
12 pages
Trusted Where Permanent Identification Is Critical
No ratings yet
Trusted Where Permanent Identification Is Critical
2 pages
Wassim Zhani Kroger
No ratings yet
Wassim Zhani Kroger
13 pages
Chapter Four of SNM (1) (Autosaved)
100% (1)
Chapter Four of SNM (1) (Autosaved)
25 pages
Session 6 7 CRM in B2C Markets PDF
No ratings yet
Session 6 7 CRM in B2C Markets PDF
23 pages
Simple Power Map
No ratings yet
Simple Power Map
2 pages
Marketing Management: From 2 SEM BBM, D Section, Group F
No ratings yet
Marketing Management: From 2 SEM BBM, D Section, Group F
65 pages
Strategic Planning Process and Dairy Pak Case Study
No ratings yet
Strategic Planning Process and Dairy Pak Case Study
31 pages
Atm Treasure Data CDP Use Cases
No ratings yet
Atm Treasure Data CDP Use Cases
1 page
Formula For Success: Google Cloud Platform & WP Engine.: White Paper
No ratings yet
Formula For Success: Google Cloud Platform & WP Engine.: White Paper
7 pages
Jitendra Kumar MBA
No ratings yet
Jitendra Kumar MBA
68 pages
4.1 11 Steps To Create World Class Organizations
No ratings yet
4.1 11 Steps To Create World Class Organizations
19 pages
Case Study Management Information Systems - 100150
No ratings yet
Case Study Management Information Systems - 100150
6 pages
Analysis of Strategic Management Issues of
No ratings yet
Analysis of Strategic Management Issues of
74 pages
Asset management plan period Complete Self-Assessment Guide
From Everand
Asset management plan period Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Basic 3 Statement Model 2
No ratings yet
Basic 3 Statement Model 2
20 pages
Alliance Governance at Klarna: Managing and Controlling Risks of An Alliance Portfolio
No ratings yet
Alliance Governance at Klarna: Managing and Controlling Risks of An Alliance Portfolio
9 pages
Paper Segmented Reporting, Investment Center Evaluation, and Transfer Pricing
No ratings yet
Paper Segmented Reporting, Investment Center Evaluation, and Transfer Pricing
7 pages
Trading Area Analysis
No ratings yet
Trading Area Analysis
24 pages
McKinsey DCF Valuation 2020 - User Guide V4
No ratings yet
McKinsey DCF Valuation 2020 - User Guide V4
18 pages
Nextuple Inc. Raises $1.5M Seed Round To Continue Investing in AI-Based Post-Purchase Experience Products
No ratings yet
Nextuple Inc. Raises $1.5M Seed Round To Continue Investing in AI-Based Post-Purchase Experience Products
2 pages
Profile Dhanikuberji
No ratings yet
Profile Dhanikuberji
6 pages
Modern Small Business Enterprises: Dr. Saumendra Das Saumendra@giet - Edu
No ratings yet
Modern Small Business Enterprises: Dr. Saumendra Das Saumendra@giet - Edu
15 pages
Chapter 02
No ratings yet
Chapter 02
50 pages
ECONOMIC ORGANIZATION
No ratings yet
ECONOMIC ORGANIZATION
15 pages
Accounting Process 2
No ratings yet
Accounting Process 2
2 pages
Auditing Principles and Practices I & II Model Exam
No ratings yet
Auditing Principles and Practices I & II Model Exam
73 pages
Case 1:: Explain The Reason of Your Analysis Done in Each Below Case Supporting by Calculations
No ratings yet
Case 1:: Explain The Reason of Your Analysis Done in Each Below Case Supporting by Calculations
2 pages
206 Oscm
No ratings yet
206 Oscm
9 pages
horngren_ima16_tif_12
No ratings yet
horngren_ima16_tif_12
49 pages
CHASE; APTE, 2007
No ratings yet
CHASE; APTE, 2007
12 pages
Arihant Retail, Chennai
No ratings yet
Arihant Retail, Chennai
13 pages
Oligopoly: What Is An Oligopoly?
No ratings yet
Oligopoly: What Is An Oligopoly?
11 pages
Ps B ABC and Jit
No ratings yet
Ps B ABC and Jit
4 pages
Test Bank 1
No ratings yet
Test Bank 1
44 pages
FY 2017 GEMS Golden Energy Mines TBK
No ratings yet
FY 2017 GEMS Golden Energy Mines TBK
157 pages
Black Derman Toy Model
No ratings yet
Black Derman Toy Model
4 pages
Unsolicited Cover Letter Format
100% (2)
Unsolicited Cover Letter Format
8 pages
Lean Implementation and Supply Chain Development at Oak Hills Final
No ratings yet
Lean Implementation and Supply Chain Development at Oak Hills Final
11 pages
Civics Markets Around Us.
No ratings yet
Civics Markets Around Us.
2 pages
Chapter 1 Cost I
No ratings yet
Chapter 1 Cost I
34 pages
Internship Report
100% (1)
Internship Report
53 pages
Chapter One FoA II Accounting For Receivables Handout
No ratings yet
Chapter One FoA II Accounting For Receivables Handout
11 pages
Accounting Assignment
No ratings yet
Accounting Assignment
16 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Customer Data Analysis

Uploaded by

Customer Data Analysis

Uploaded by

Customer data analysis

Presentation for Hackerearth Sigma-Thon 1.0

 The dataset is a transnational data which contains all the transactions

 InvoiceNo: Invoice number. Nominal, a 6-digit integral number uniquely assigned

 What is Customer Segmentation ?

It is observed that UK has done most of the transactions. (19857)

It is seen that words like 'box',

The accuracy obtained by the model on testing on some of the relatively

..which is pretty good.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.