0% found this document useful (0 votes)

13 views

ILANTENRALVBDA

The document discusses the application of Big Data Analytics and Machine Learning in retail and healthcare sectors. It covers retail sales analysis using data warehousing, customer loyalty classification through machine learning, and predicting patient outcomes in healthcare. The implementation of these technologies leads to improved decision-making, customer retention, and optimized patient care.

Uploaded by

gauthamsaikumar1806

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

ILANTENRALVBDA

Uploaded by

gauthamsaikumar1806

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

ADHIYAMAAN COLLEGE OF ENGINEERING

(Autonomous)

DEPARTMENT OF COMPUTER SCIENCE AND

ENGINEERING

818CIT01 – BIG DATA ANALYTICS

ASSIGNMENT - I

Submitted by:
ILANTHENRAL V
6176AC21UCS046
IV – CSE – A
1. Retail Sales Analysis using Data Warehousing
Retail businesses generate an enormous amount of transactional data every day. This data
is crucial for analyzing sales trends, managing inventory, and understanding customer
behavior. A data warehouse provides an efficient way to store, retrieve, and analyze this
data for better decision-making.
i. Data Warehouse Schema Design
A star schema is one of the most effective and commonly used designs for sales analysis. It
consists of a central fact table that contains sales transaction data, surrounded by multiple
dimension tables that provide additional details about products, customers, stores, and
time.
A star schema is widely used in retail sales analytics due to its simplicity and efficiency.
Fact Table: Sales_Fact

Column Name Data Type Description

sale_id INT (PK) Unique identifier for each sale

date_id INT (FK) Foreign key to Date dimension

product_id INT (FK) Foreign key to Product dimension

store_id INT (FK) Foreign key to Store dimension

customer_id INT (FK) Foreign key to Customer dimension

quantity_sold INT Number of units sold

total_sales DECIMAL Total revenue from the sale

discount DECIMAL Discount applied

Dimension Tables:
1. Date Dimension (Date_Dim): Contains fields such as date_id, date, month, year,
quarter, day_of_week.
2. Product Dimension (Product_Dim): Holds product-related details such as
product_id, product_name, category, brand, price, and supplier.
3. Store Dimension (Store_Dim): Includes store_id, store_name, location, region,
store_type, and manager.
4. Customer Dimension (Customer_Dim): Consists of customer_id, customer_name,
age, gender, loyalty_status, purchase_frequency.
ii. Data Preprocessing and Transformation
Before data is stored in the warehouse, it needs to be cleaned and transformed for
accurate reporting and analysis. The ETL (Extract, Transform, Load) process ensures that
data is collected from various sources, transformed into a consistent format, and loaded
into the warehouse.
Steps in Data Preprocessing:
1. Data Extraction:
o Collects sales transactions, customer information, and product details from
multiple sources such as POS systems, online stores, and CRM systems.
2. Data Cleaning:
o Handles missing values, removes duplicate records, and corrects
inconsistencies in product names, dates, and prices.
3. Data Normalization:
o Converts different currency formats, standardizes units (e.g., kilograms to
grams), and encodes categorical data (e.g., converting gender as Male=1,
Female=0).
4. Data Aggregation:
o Summarizes data to create new features like total revenue per store,
average basket size, and seasonal sales trends.
5. Data Loading:
o Stores the cleaned and structured data into the data warehouse for future
queries and analysis.
iii. Sales Analysis Metrics
The data warehouse enables advanced analysis to improve sales performance and
business strategies.
Key Metrics for Sales Analysis:
1. Total Revenue per Store, Product, and Region
o Helps in identifying high-performing stores and products.
2. Customer Segmentation Based on Purchase Behavior
o Groups customers into categories such as frequent buyers, occasional
buyers, and inactive customers.
3. Trend Analysis Over Time
o Analyzes how seasonality affects sales (e.g., higher sales in December due
to holiday shopping).
4. Inventory Optimization
o Helps businesses prevent overstocking or stockouts by predicting demand
trends.
5. Profit Margin Analysis
o Identifies products with the highest and lowest profit margins.
iv. Business Benefits of Using Data Warehousing in Retail
1. Improved Decision-Making
• Store managers can use real-time sales reports to optimize staffing and inventory.
2. Personalized Marketing and Promotions
• Based on customer purchase history, businesses can send targeted offers and
promotions to increase sales.
3. Fraud Detection and Prevention
• Abnormal sales patterns (e.g., sudden high refunds or unusual discounts) can be
flagged to prevent fraud.
4. Supply Chain Optimization
• Helps in determining optimal reorder levels and minimizing supply chain
disruptions.

Diagram: Retail Data Warehouse Workflow

Sales Transactions ---> Data Cleaning & Transformation ---> Data Warehouse
| |
Customer Profiles Aggregated Reports for Analysis
| |
Store Performance Inventory & Demand Forecasting
|
Business Intelligence & Decision-Making

v. Case Study: Walmart’s Use of Data Warehousing

One of the best examples of data warehousing in retail is Walmart.
• Walmart collects over 2.5 petabytes of data per hour from millions of transactions
across its stores.
• The company uses data warehouses and big data analytics to optimize pricing
strategies, predict demand, and manage supply chains efficiently.
• By analyzing weather patterns, Walmart discovered that before hurricanes,
customers buy more flashlights and Pop-Tarts—this helped them stock stores
appropriately and maximize sales.

2. Customer Loyalty Classification using Machine Learning

Customer loyalty is a critical factor for business success. Loyal customers contribute to
repeat sales, brand advocacy, and higher customer lifetime value (CLV). Machine learning
(ML) can help businesses predict whether a customer is loyal or not based on their
purchase behavior, demographics, and engagement history.i. Data Collection for Loyalty
Classification
i. Data Collection for Customer Loyalty Classification
To build a machine learning model for customer loyalty classification, we need various
data sources.
1. Transactional Data
• Purchase history: Number of purchases, total spending, average spending per
order.
• Discount usage: Whether customers frequently use discount coupons or loyalty
points.
• Purchase frequency: How often the customer makes purchases (e.g., weekly,
monthly, yearly).
2. Customer Demographics
• Age, gender, location, income level.
• Customer type: New, returning, VIP.
3. Behavioral Data
• Time since last purchase (Recency).
• Product categories purchased (e.g., electronics vs. groceries).
• Website behavior: Time spent browsing, abandoned carts.
• Engagement metrics: Email open rates, responses to promotions.
ii. Feature Engineering
Feature engineering involves creating new variables that improve model performance.
1. Important Features for Classification:
• purchase_frequency = total_orders / months_active
• avg_spent_per_order = total_spent / total_orders
• recency = days_since_last_purchase
• discount_usage_rate = total_discount_used / total_orders
• loyalty_score = weighted_sum(purchase_frequency, avg_spent_per_order, recency,
discount_usage_rate)
2. Feature Scaling and Encoding:
• Numerical features (e.g., total_spent, avg_spent_per_order) are normalized using
Min-Max Scaling.
• Categorical features (e.g., customer type, location) are one-hot encoded.
iii. Machine Learning Model Selection
Different machine learning algorithms can be used to classify customers into loyal or not
loyal.
1. Logistic Regression (Baseline Model)
• Predicts the probability of a customer being loyal based on their transaction
history.
• Works well for interpretable models but may not capture complex relationships.
2. Decision Trees & Random Forest
• Decision Trees split customers into different categories based on spending and
behavior.
• Random Forest improves accuracy by combining multiple trees to reduce
overfitting.
3. XGBoost (Extreme Gradient Boosting)
• A powerful model that handles large datasets efficiently.
• Often used in real-world classification problems due to high accuracy.
4. Neural Networks (Deep Learning)
• Useful when there are complex customer interactions to analyze.
• Requires large amounts of data for training.
iv. Model Training and Evaluation
1. Train-Test Split
• Dataset is divided into 80% training and 20% testing for model evaluation.
2. Performance Metrics
• Accuracy: Percentage of correctly classified customers.
• Precision: How many predicted loyal customers were actually loyal.
• Recall: How many actual loyal customers were correctly identified.
• F1-Score: Balances precision and recall.

v. Business Impact of Customer Loyalty Prediction

1. Personalized Marketing Campaigns
• Predict which customers are at risk of churning and offer targeted discounts.
2. Customer Retention Strategies
• Identify customers who frequently shop but may leave soon and send personalized
offers.
3. Increase Revenue through Segmentation
• Loyal customers can be offered premium products, while non-loyal customers can
be given incentives to stay.
4. Reduce Customer Acquisition Costs
• Retaining existing customers is 5X cheaper than acquiring new ones. ML models
help optimize customer engagement strategies.

vi. Case Study: Amazon's Customer Loyalty Prediction

Amazon uses AI-driven customer segmentation to classify shoppers based on their buying
patterns, browsing history, and engagement levels.
• Customers with high engagement but low purchases receive personalized
discounts.
• Customers with high-value purchases get exclusive offers.
• Inactive customers are targeted with email campaigns and limited-time deals
3. Predicting Patient Outcomes using Machine Learning
Predicting patient outcomes is one of the most impactful applications of machine learning
in healthcare. Hospitals and healthcare organizations use predictive models to assess
recovery chances, readmission risks, mortality rates, and disease progression. By analyzing
historical patient data, ML models can assist doctors in early diagnosis, treatment
planning, and improving patient care.

i. Data Collection for Patient Outcome Prediction

To develop an accurate predictive model, we need diverse patient data sources.
1. Demographic Data
• Age, Gender, Ethnicity (Certain conditions affect different populations differently).
• Socioeconomic Factors (Income level, access to healthcare, diet, lifestyle).
2. Clinical Data
• Vital signs (Heart rate, blood pressure, oxygen levels).
• Medical history (Chronic diseases, past surgeries, allergies).
• Lab test results (Blood sugar, cholesterol, WBC count).
• Imaging data (X-rays, MRIs, CT scans).
3. Treatment Data
• Medication prescribed (Dosage, frequency, effectiveness).
• Surgical procedures performed (Post-surgery recovery rates).
• Length of hospital stay (Long stays indicate severe cases).
4. Outcome Labels
• Recovery (Did the patient recover fully or partially?).
• Readmission Risk (Did the patient return to the hospital within 30 days?).
• Mortality (Whether the patient survived or not).

ii. Data Preprocessing

Healthcare data is often incomplete and noisy, so preprocessing is essential before training
an ML model.
1. Handling Missing Data
• Imputation (Fill missing values using median, mean, or predictive methods).
• Removing incomplete records if too many fields are missing.
2. Normalization of Numerical Features
• Vital signs and lab results are standardized to ensure consistent scaling.
3. Encoding Categorical Variables
• Example: Converting disease types into numerical values using One-Hot Encoding.
4. Feature Selection
• Eliminating irrelevant attributes (e.g., patient names).
• Keeping only the most important predictors of patient outcomes.

iii. Machine Learning Model Selection

Different ML models can predict patient outcomes based on historical data.
1. Logistic Regression (For Binary Predictions: Recovery or Not)
• Used for classifying whether a patient will recover or not based on input
parameters.
• Works well for interpretable predictions (e.g., effect of age on recovery).
2. Decision Trees & Random Forest
• Decision Trees analyze patient records and classify outcomes.
• Random Forest improves accuracy by combining multiple trees to reduce
overfitting.
3. Support Vector Machines (SVM)
• Works well for smaller datasets.
• Identifies clear separations between patient classes.
4. XGBoost (Best for Large Medical Datasets)
• Used in predicting readmission risk, mortality rates, and recovery chances.
• Highly accurate and efficient for structured healthcare data.
5. Neural Networks (Deep Learning)
• Used for analyzing complex medical data (e.g., MRI scans, X-ray images).
• Requires a large amount of data but provides high accuracy.
iv. Model Training and Evaluation
1. Splitting the Dataset
• Training Set (80%) – Used to train the ML model.
• Test Set (20%) – Used to evaluate model performance.
2. Performance Metrics
• Accuracy – Measures overall prediction correctness.
• Precision & Recall – Important for imbalanced datasets (e.g., rare diseases).
• ROC-AUC Score – Measures how well the model distinguishes between recovery
and non-recovery cases.

v. Building a Data Pipeline for Patient Data Processing

A data pipeline automates the collection, transformation, and analysis of patient data.
Step-by-Step Pipeline Design:
1. Data Ingestion
o Collect patient data from Electronic Health Records (EHR), wearable devices,
and hospital systems.
o Store it in a secure data warehouse (e.g., AWS S3, Google Cloud Storage).
2. Data Cleaning & Preprocessing
o Remove duplicate entries.
o Standardize medical records.
3. Feature Engineering
o Extract key medical indicators from raw data.
o Normalize and encode categorical variables.

vi. Business & Clinical Benefits of Patient Outcome Prediction

1. Early Disease Detection
• Predict high-risk conditions like heart attacks, diabetes, and cancer before they
worsen.
2. Optimized Hospital Resource Management
• Hospitals can allocate ICU beds, staff, and equipment based on predictions.

vii. Case Study: Predicting Patient Readmission at a U.S. Hospital

A major U.S. hospital used XGBoost-based ML models to predict patient readmissions after
discharge.
• 20,000+ patient records were analyzed.
• The model identified key risk factors such as age, pre-existing conditions, and past
hospitalization history.
• Results:
o 15% reduction in unnecessary hospital readmissions.
o $2 million in healthcare cost savings.

Conclusion
Big Data Analytics and Machine Learning revolutionize industries like retail and healthcare
by providing actionable insights. Data warehousing enables large-scale sales analysis,
while machine learning enhances customer retention strategies and patient care
predictions. Implementing these technologies leads to better business decisions, improved
patient outcomes, and a more efficient data-driven future.

Assignment 6 - Data Value Templates: Postgraduate Diploma in Digital Business Page 1 of 4
67% (3)
Assignment 6 - Data Value Templates: Postgraduate Diploma in Digital Business Page 1 of 4
4 pages
This Study Resource Was: Business Marketing Business 500F 001 CRN 1388 Professor: Hal
No ratings yet
This Study Resource Was: Business Marketing Business 500F 001 CRN 1388 Professor: Hal
5 pages
Monograph Output of University Presses
No ratings yet
Monograph Output of University Presses
48 pages
Data Analysis On BigMart Sales
67% (3)
Data Analysis On BigMart Sales
17 pages
Management Project Proposal
0% (1)
Management Project Proposal
4 pages
Retail Data Model
No ratings yet
Retail Data Model
24 pages
2017 - Managing Multi - and Omni-Channel Distribution Metrics and Research Direction
No ratings yet
2017 - Managing Multi - and Omni-Channel Distribution Metrics and Research Direction
17 pages
HOFACKER - Big Data and Consumer Behavior - Imminent Opportunities
100% (1)
HOFACKER - Big Data and Consumer Behavior - Imminent Opportunities
12 pages
advance database
No ratings yet
advance database
15 pages
CS7079NI - Data Warehousing and Big Data Y22 Autumn (1st Sit) - CW QP
No ratings yet
CS7079NI - Data Warehousing and Big Data Y22 Autumn (1st Sit) - CW QP
5 pages
Chapter 1: Introduction: 1.1 Background Theory
No ratings yet
Chapter 1: Introduction: 1.1 Background Theory
36 pages
PPIR
No ratings yet
PPIR
8 pages
Data Mining & Knowledge Discovery: Personalization Technologies For One To One Marketing
No ratings yet
Data Mining & Knowledge Discovery: Personalization Technologies For One To One Marketing
61 pages
Big Data in Customer Acquisition and Retention For Ecommerce - Taking Walmart As An Example
No ratings yet
Big Data in Customer Acquisition and Retention For Ecommerce - Taking Walmart As An Example
4 pages
Data Science Assignment 1
No ratings yet
Data Science Assignment 1
7 pages
Research Paper
No ratings yet
Research Paper
15 pages
Developing Your Data Asset DIGITAL
No ratings yet
Developing Your Data Asset DIGITAL
7 pages
final pbl of aaryan & Satyam
No ratings yet
final pbl of aaryan & Satyam
19 pages
Retail Case Study
No ratings yet
Retail Case Study
5 pages
Project Analysis of Shopping Trends Using Data Analytics
No ratings yet
Project Analysis of Shopping Trends Using Data Analytics
4 pages
Technical Note - Machine Learning in Data Science
No ratings yet
Technical Note - Machine Learning in Data Science
1 page
Marketing Analytics Unit 3
No ratings yet
Marketing Analytics Unit 3
18 pages
Enhancing Customer Experience Leveraging Data Engineering and AI in Retail Analytics
No ratings yet
Enhancing Customer Experience Leveraging Data Engineering and AI in Retail Analytics
7 pages
9 (Big Data and Analytics in Retailing
No ratings yet
9 (Big Data and Analytics in Retailing
5 pages
Anusika Gupta ABMCL18043
No ratings yet
Anusika Gupta ABMCL18043
20 pages
International Conference On Services Systems and Services Management
No ratings yet
International Conference On Services Systems and Services Management
5 pages
FILE_2620
No ratings yet
FILE_2620
24 pages
Customer Profiling, Segmentation, and Sales Prediction Using AI in Direct Marketing
No ratings yet
Customer Profiling, Segmentation, and Sales Prediction Using AI in Direct Marketing
11 pages
Unit 4 CRM
No ratings yet
Unit 4 CRM
26 pages
Data Mining For Customer Segmentation
No ratings yet
Data Mining For Customer Segmentation
13 pages
New Project in Bussines
No ratings yet
New Project in Bussines
19 pages
Sales Prediction and Product Recommendation Model Through
No ratings yet
Sales Prediction and Product Recommendation Model Through
20 pages
Data Warehouse
No ratings yet
Data Warehouse
9 pages
A data warehouse is a centralized repository for enterprise data
No ratings yet
A data warehouse is a centralized repository for enterprise data
5 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Final DMT Report PDF
No ratings yet
Final DMT Report PDF
27 pages
Customer Profiling Segmentation and Sales Predicti
No ratings yet
Customer Profiling Segmentation and Sales Predicti
12 pages
BA_CH-1
No ratings yet
BA_CH-1
8 pages
PPIR!1
No ratings yet
PPIR!1
9 pages
Group 5 Project
No ratings yet
Group 5 Project
29 pages
Research Paper On Retail Data Analytics
No ratings yet
Research Paper On Retail Data Analytics
6 pages
Business Analytics Course
No ratings yet
Business Analytics Course
11 pages
Customer Profiling, Segmentation, and Sales Prediction Using AI in Direct Marketing
No ratings yet
Customer Profiling, Segmentation, and Sales Prediction Using AI in Direct Marketing
35 pages
Data Mining in Retail Sector
No ratings yet
Data Mining in Retail Sector
2 pages
DSA Assignment
No ratings yet
DSA Assignment
42 pages
How To Win Customers Every Day _ Volume 7: Data-Driven Selling: The Complete Guide to Success
From Everand
How To Win Customers Every Day _ Volume 7: Data-Driven Selling: The Complete Guide to Success
MAX EDITORIAL
No ratings yet
DOC-20241014-WA0005.
No ratings yet
DOC-20241014-WA0005.
7 pages
IJCRT2105404 Bigmart 4
No ratings yet
IJCRT2105404 Bigmart 4
4 pages
Rapport Bi
No ratings yet
Rapport Bi
94 pages
Sample_3rd_Project_I
No ratings yet
Sample_3rd_Project_I
13 pages
Sandeep Proposal
No ratings yet
Sandeep Proposal
6 pages
249 PRJ
No ratings yet
249 PRJ
31 pages
FTU 2024 Chap10 Using Customer Related Data for Analytics
No ratings yet
FTU 2024 Chap10 Using Customer Related Data for Analytics
26 pages
PROJECT Consolidated v1 27june
No ratings yet
PROJECT Consolidated v1 27june
31 pages
2 Analyzing Target Customer Behavior Using Data Mining Techniques For E-Commerce Data
No ratings yet
2 Analyzing Target Customer Behavior Using Data Mining Techniques For E-Commerce Data
4 pages
Analyzing Target Customer Behavior Using Data Mining Techniques For E-Commerce Data
No ratings yet
Analyzing Target Customer Behavior Using Data Mining Techniques For E-Commerce Data
4 pages
Capegemini
No ratings yet
Capegemini
6 pages
Large Scale Product Recommendation of Supermarket
No ratings yet
Large Scale Product Recommendation of Supermarket
19 pages
5 Ways Retailers Can Leverage Data To Improve User Experience - Handout - Spryker
No ratings yet
5 Ways Retailers Can Leverage Data To Improve User Experience - Handout - Spryker
8 pages
Report
No ratings yet
Report
4 pages
X Trica9543
No ratings yet
X Trica9543
17 pages
Retail Data Analytics: Enhancing Customer Experience and Profitability
From Everand
Retail Data Analytics: Enhancing Customer Experience and Profitability
Christine Nyaga
No ratings yet
Advanced E-Commerce Business Questions and Analytical Hints
From Everand
Advanced E-Commerce Business Questions and Analytical Hints
Zemelak Goraga
No ratings yet
Behavior Analytics in Retail: Measure, Monitor and Predict Employee and Customer Activities to Optimize Store Operations and Profitably, and Enhance the Shopping Experience.
From Everand
Behavior Analytics in Retail: Measure, Monitor and Predict Employee and Customer Activities to Optimize Store Operations and Profitably, and Enhance the Shopping Experience.
Ronny Max
No ratings yet
(eBook PDF) Transnational Management: Text and Cases in Cross-Border Management 8th Edition all chapter instant download
100% (4)
(eBook PDF) Transnational Management: Text and Cases in Cross-Border Management 8th Edition all chapter instant download
56 pages
Ngo Kien Thinh - Tranformation of Urban Village in Hanoi - Health Conference Bristol
No ratings yet
Ngo Kien Thinh - Tranformation of Urban Village in Hanoi - Health Conference Bristol
12 pages
Priyanka Ji MGN 815 Ca2
No ratings yet
Priyanka Ji MGN 815 Ca2
10 pages
Boston Consulting Group Operational - Excellence
No ratings yet
Boston Consulting Group Operational - Excellence
18 pages
Pricing Scenarios in SAP SD S - 4HANA
No ratings yet
Pricing Scenarios in SAP SD S - 4HANA
4 pages
Business Administration Project
No ratings yet
Business Administration Project
64 pages
Selling Vs Marketing
50% (2)
Selling Vs Marketing
9 pages
Acs 3
No ratings yet
Acs 3
46 pages
Dmart Vs Walmart
No ratings yet
Dmart Vs Walmart
7 pages
Adbi wp827 PDF
No ratings yet
Adbi wp827 PDF
26 pages
Ramakant Kini Vs DR L H
No ratings yet
Ramakant Kini Vs DR L H
33 pages
Stephen P. Rogers: Key Management Skills
No ratings yet
Stephen P. Rogers: Key Management Skills
4 pages
Loan
100% (1)
Loan
97 pages
Hscprojects Com Marketing Management Project On Hair Oil
No ratings yet
Hscprojects Com Marketing Management Project On Hair Oil
13 pages
Campbell Rachel CV
No ratings yet
Campbell Rachel CV
2 pages
Perfetti Van Melle Group0000000
No ratings yet
Perfetti Van Melle Group0000000
8 pages
Chapter 12 - Marketing Channels Delivering Customer Value
No ratings yet
Chapter 12 - Marketing Channels Delivering Customer Value
34 pages
Top IT Services Companies India 1f0fd858c9
No ratings yet
Top IT Services Companies India 1f0fd858c9
18 pages
BSH MCP Prefinal Print
No ratings yet
BSH MCP Prefinal Print
81 pages
P&G Company Profile and Startegies
57% (7)
P&G Company Profile and Startegies
47 pages
ICB Structure and Codes
No ratings yet
ICB Structure and Codes
11 pages
Consumer's Motivation, Opportunities and Abilities For Sustainable Consumption: A Case in China
No ratings yet
Consumer's Motivation, Opportunities and Abilities For Sustainable Consumption: A Case in China
16 pages
SWOT Analysis Agora - RahimAfroz Final Assignment
0% (1)
SWOT Analysis Agora - RahimAfroz Final Assignment
8 pages
2011039-Joy Maria Ancy J-SIP - Joy Maria Ancy J
No ratings yet
2011039-Joy Maria Ancy J-SIP - Joy Maria Ancy J
49 pages
Case Study: LEGO's Turnaround
100% (6)
Case Study: LEGO's Turnaround
16 pages
Thesis Abstract
No ratings yet
Thesis Abstract
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ILANTENRALVBDA

Uploaded by

ILANTENRALVBDA

Uploaded by

ADHIYAMAAN COLLEGE OF ENGINEERING

DEPARTMENT OF COMPUTER SCIENCE AND

818CIT01 – BIG DATA ANALYTICS

Column Name Data Type Description

sale_id INT (PK) Unique identifier for each sale

date_id INT (FK) Foreign key to Date dimension

product_id INT (FK) Foreign key to Product dimension

store_id INT (FK) Foreign key to Store dimension

customer_id INT (FK) Foreign key to Customer dimension

quantity_sold INT Number of units sold

total_sales DECIMAL Total revenue from the sale

discount DECIMAL Discount applied

Diagram: Retail Data Warehouse Workflow

v. Case Study: Walmart’s Use of Data Warehousing

2. Customer Loyalty Classification using Machine Learning

v. Business Impact of Customer Loyalty Prediction

vi. Case Study: Amazon's Customer Loyalty Prediction

i. Data Collection for Patient Outcome Prediction

ii. Data Preprocessing

iii. Machine Learning Model Selection

v. Building a Data Pipeline for Patient Data Processing

vi. Business & Clinical Benefits of Patient Outcome Prediction

vii. Case Study: Predicting Patient Readmission at a U.S. Hospital

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.