0% found this document useful (0 votes)

214 views12 pages

SMDM Report

From the given document: - The data consists of annual spending amounts of 440 large retailers on 6 product categories across 3 regions and 2 sales channels in Portugal. - Exploratory data analysis found the data to have no null values, right skewed distributions, and many outliers. Fresh items showed the most consistent behavior while Delicatessen the least. - Based on the analysis, the recommendations are to focus on high demand areas like Fresh items in hotels and Grocery in retail, and to increase stock of in-demand Fresh items in other regions since it has the highest expenditures overall.

Uploaded by

Ruhee's Kitchen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

214 views12 pages

SMDM Report

Uploaded by

Ruhee's Kitchen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Study -1 Project

A wholesale distributor operating in different regions of Portugal has information on annual spending of several items in
their stores across different regions and channels. The data consists of 440 large retailers’ annual spending on 6
different varieties of products in 3 different regions (Lisbon, Oporto, Other) and across different sales channel (Hotel,
Retail).

Q.1.1: Exploratory data analysis:

Buyer/ Detergents
Channel Region Fresh Milk Grocery Frozen Delicatessen
Spender _Paper
1 Retail Other 12669 9656 7561 214 2674 1338
2 Retail Other 7057 9810 9568 1762 3293 1776
3 Retail Other 6353 8808 7684 2405 3516 7844
4 Hotel Other 13265 1196 4221 6404 507 1788
5 Retail Other 22615 5410 7198 3915 1777 5185

From the above table the data set has 6 different types of items. Buyer/Spender, Region and channel are categorical
data, whereas other being int type data which are measures of amount spent on different types of items.

count unique top freq mean std min 25% 50% 75% max
Buyer/Spender 440 NaN NaN NaN 220.5 127.16 1 110.8 221 330.3 440
Channel 440 2 Hotel 298 NaN NaN NaN NaN NaN NaN NaN
Region 440 3 Other 316 NaN NaN NaN NaN NaN NaN NaN
Fresh 440 NaN NaN NaN 12000 12647 3 3128 8504 16934 112151
Milk 440 NaN NaN NaN 5796 7380 55 1533 3627 7190 73498
Grocery 440 NaN NaN NaN 7951 9503 3 2153 4756 10656 92780
Frozen 440 NaN NaN NaN 3072 4855 25 742 1526 3554 60869
Detergents_Paper 440 NaN NaN NaN 2881 4768 3 257 817 3922 40827
Delicatessen 440 NaN NaN NaN 1525 2820 3 408 966 1820 47943

1) There are 3 unique values region column in which other has the most entries. There are 2 unique values in
Channel Column in which Hotel has more entries.
2) For all the 6 items, their standard deviation Is more than the mean.
3) All the 6 items have significant number of outliers which can be observed based on maximum and 75% Quartile
value. It can be further inferred that the distribution is right skewed since for all the six items mean is greater
than the median which is an indication f right skewedness.
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 440 entries, 0 to 439
Data columns (total 9 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Buyer/Spender 440 non-null bool
1 Channel 440 non-null bool
2 Region 440 non-null bool
3 Fresh 440 non-null bool
4 Milk 440 non-null bool
5 Grocery 440 non-null bool
6 Frozen 440 non-null bool
7 Detergents_Paper 440 non-null bool
8 Delicatessen 440 non-null bool
dtypes: bool(9)
memory usage: 4.0 KB
From the above code, it can be said that no null values are present in the provided data set.

Total
Channel Region Fresh Milk Grocery Frozen Detergents_Paper Delicatessen
Spend
Lisbon 761233 228342 237542 184512 56081 70632 1538342
Hotel Oporto 326215 64519 123074 160861 13516 30965 719150
Other 2928269 735753 820101 771606 165990 320358 5742077
Lisbon 93600 194112 332495 46514 148055 33695 848471
Retail Oporto 138506 174625 310200 29271 159795 23541 835938
Other 1032308 1153006 1675150 158886 724420 191752 4935522

Based on above table Hotel channel gets the maximum expenditure and retail channel get the minimum expenditure.
Also, as per region wise, Oporto region has minimum expenditure and other region has maximum expenditure. Same can
be seen in the below figure. Also, Combination of Hotel and Other has maximum expenditure and combination of Hotel
and Oporto has minimum expenditure.

Q.1.2 There are 6 different varieties of items are considered. Do all varieties show similar behaviour across Region
and Channel?

From the above table it can be clearly inferred that the expenditure on the ‘Fresh’ items is visibility higher in Hotel channel
than in Retail channel. Also, in Hotel channel expenditure on the Fresh items is maximum in every region as compared to
Retail channel. Grocery items are major contributors of expenditure across the Retail channel in every region. ‘Frozen’
items and ‘Delicatessen’ items are the minimum contributors of expenditure in the Retail channel. In both the channels
other regions have majority of the expenditures.

It can be seen from above all the figures that no the behavior of all the 6 items doesn’t follow similar trend in both the
channels.

Q.1.3 On the basis of descriptive measure of variability, which item shows the most inconsistent behavior? Which items
show the least inconsistent behavior?

index count mean std min 25% 50% 75% max CV

Fresh 440 12000 12647 3 3128 8504 16934 112151 1
Milk 440 5796 7380 55 1533 3627 7190 73498 1
Grocery 440 7951 9503 3 2153 4756 10656 92780 1
Frozen 440 3072 4855 25 742 1526 3554 60869 2
Detergents_Paper 440 2881 4768 3 257 817 3922 40827 2
For this question, Coefficient of variance has been calculated for the discrete integer data since this being the measure of
the dispersion or consistency while data of unequal means being compared. It can be seen from the above table that Fresh
has lowest coefficient of Variation hence is most consistent of all and Delicatessen has highest coefficient of Variation,
hence least consistent of all.

Q.1.4 Are there any outliers in the data?

It can be seen from above figure that there are significant number of outliers present in the above data. Same has been
discussed in the first question using the description of data as the basis.

Q.1.5 On the basis of this report, what are the recommendations?

Conclusion:
Based on given sample data, the wholesale distributor must focus on below observations:
• Fresh items are more in demand Hotel channel
• Grocery items are more in demand Retail channel
• other region has great demand for Fresh items; hence they must increase the stock of Fresh items.
• Delicatessen items seemed to be less in demand in all the regions.

Study -2
The Student News Service at Clear Mountain State University (CMSU) has decided to gather data about the undergraduate
students that attend CMSU. CMSU creates and distributes a survey of 14 questions and receives responses from 62
undergraduates (stored in the Survey data set).

EDA:

Observations:
Dataset has 14 variables in it
1. 6 categorical variables - Gender, Class, major, Grad Intent, Employment and Computer.
2. 5 integer data type - Age, Social Networking, Satisfaction, Spending and Text Messages.
3. 2 float data type - GPA and Salary
2.1. For this data, construct the following contingency tables (Keep Gender as row variable)

2.1.1. Gender and Major

2.1.2. Gender and Grad Intention

2.1.3. Gender and Employment

2.1.4. Gender and Computer

2.2. Assume that the sample is representative of the population of CMSU. Based on the data, answer the following
question:

2.2.1. What is the probability that a randomly selected CMSU student will be male?

Total number of students = 62

Number of male students = 29
Probability that a randomly selected CMSU student will be male = 29/62
P(Male) = 0.468
2.2.2. What is the probability that a randomly selected CMSU student will be female?

Total number of students = 62

Number of Female students = 33
Probability that a randomly selected CMSU student will be female = 33/62
P(Female) = 0.532

2.3. Assume that the sample is representative of the population of CMSU. Based on the data, answer the following
question:

Total number of students = 62

Number of males = 29
Number of females = 33

2.3.1. Find the conditional probability of different majors among the male students in CMSU.

• Probability of male_Accounting is 13.8

• Probability of male_CIS is 3.4
• Probability of male_Economics_Finance is 13.8
• Probability of male_International_Business is 6.9
• Probability of male_Management is 20.7
• Probability of male_Other is 13.8
• Probability of male_Retailing_Marketing is 17.2
• Probability of male_Undecided is 10.3
2.3.2 Find the conditional probability of different majors among the female students of CMSU.

• Probability of female_Accounting is 9.1

• Probability of female_CIS is 9.1
• Probability of female_Economics_Finance is 21.2
• Probability of female_International_Business is 12.1
• Probability of female_Management is 12.1
• Probability of female_Other is 9.1
• Probability of female_Retailing_Marketing is 27.3
• Probability of female_Undecided is 0.0

2.4. Assume that the sample is a representative of the population of CMSU. Based on the data, answer the following
question:

2.4.1. Find the probability That a randomly chosen student is a male and intends to graduate.

Probability of male and intends to gradute is 27.4%

2.4.2 Find the probability that a randomly selected student is a female and does NOT have a laptop.

Probability of female with no laptop is 6.5%

2.5. Assume that the sample is representative of the population of CMSU. Based on the data, answer the following
question:

2.5.1. Find the probability that a randomly chosen student is either a male or has full-time employment?

Probability of either male or fully employed is 74.2%

2.5.2. Find the conditional probability that given a female student is randomly chosen, she is majoring in international
business or management.

Probability of female in international business management is 12.1%

2.6. Construct a contingency table of Gender and Intent to Graduate at 2 levels (Yes/No). The Undecided students are
not considered now, and the table is a 2x2 table. Do you think the graduate intention and being female are independent
events?
Total number of students = 40

Number of females = 20
Number of student Graduation Intent Yes = 28
Number of female Graduation Intent Yes = 11
Probability female = 20/40
Probability student Graduation Intent Yes = 28/40
P(A)*P(B) = 28/80
Probability female = 20/40
Probability Graduation Intent Yes | Female = 11/20
P(A|B) *P(B) = 11/40
P(A)*P(B) = 0.35
P(A|B) *P(B) = 0.275
The events are dependent.
2.7. Note that there are four numerical (continuous) variables in the data set, GPA, Salary, Spending, and Text Messages.

Answer the following questions based on the data

2.7.1. If a student is chosen randomly, what is the probability that his/her GPA is less than 3?

Total number of students = 62

Number of students’ GPA less than 3 = 17
Probability student’s GPA less than 3 = 17/62 = 0.27

2.7.2. Find the conditional probability that a randomly selected male earns 50 or more. Find the conditional probability
that a randomly selected female earns 50 or more.

Probability of male earning more than 50 = 14/29 = 48.3%

Probability of female earning more than 50 = 18/33 = 54.5%

2.8. Note that there are four numerical (continuous) variables in the data set, GPA, Salary, Spending, and Text Messages.
For each of them comment whether they follow a normal distribution. Write a note summarizing your conclusions.

The box plots of all the numerical continuous variable seems to be almost normally distributed as shown below. Also,
mean and median of all the variables are almost equal which in itself is another evidence to say that they are normally
distributed. Also all the variable satisfy the empirical law of are between (µ-3σ, µ+3σ) = 99.7 %
Study -3
An important quality characteristic used by the manufacturers of ABC asphalt shingles is the amount of moisture the
shingles contain when they are packaged. Customers may feel that they have purchased a product lacking in quality if they
find moisture and wet shingles inside the packaging. In some cases, excessive moisture can cause the granules attached
to the shingles for texture and coloring purposes to fall off the shingles resulting in appearance problems. To monitor the
amount of moisture present, the company conducts moisture tests. A shingle is weighed and then dried. The shingle is
then reweighed and based on the amount of moisture taken out of the product, the pounds of moisture per 100 square
feet are calculated. The company would like to show that the mean moisture content is less than 0.35 pound per 100
square feet.

The file (A & B shingles.csv) includes 36 measurements (in pounds per 100 square feet) for A shingles and 31 for B shingles.

EDA:

There is float type data provided in the data set and it has two variables A and B.

The descriptive statistics of the data is shown below. It can be seen from below that standard deviation of both the samples
is in the same range, how ever means are different. Also, number of observations in column A is 36 and column B is 31.
Also, no null values observed in the dataset.

A B
count 36 31
mean 0.316667 0.273548
std 0.135731 0.137296
min 0.13 0.1
25% 0.2075 0.16
50% 0.29 0.23
75% 0.3925 0.4
max 0.72 0.58
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 36 entries, 0 to 35
Data columns (total 2 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 A 36 non-null float64
1 B 31 non-null float64
dtypes: float64(2)
memory usage: 704.0 bytes
Q.3.1 Do you think there is evidence that mean moisture contents in both types of shingles are within the permissible
limits? State your conclusions clearly showing all steps.

𝜇A be average moisture content in sample A

𝜇B be average moisture content in sample B

Sample A

Step 1: Defining null and alternative hypotheses

Poppulation standard deviation is unknown. Though sample sizes are more than 30 it is better to opt for T-Test since
they are in border range.

For Sample A
𝐻0: 𝜇A <= 0.35
𝐻𝐴: 𝜇A > 0.35
Since the status quo being the moisture content less than 0.35, it is chosen as the hypothesis.

Step 2: Deciding the significance

Since the 𝛼 is not given so here we select 𝛼(significane level) = 0.05

The sample size for the sample A is 36.
Degree of freedom for sample A is 35

Step 3: Calculate the p - value and test statistic

One sample t test, p-value, and t-statistic for one side T-test are
t-statistic: -1.4735046253382782 p-value: 0.07477633144907513
Step 4: Decide to reject or accept null hypothesis

p value is 0.07477633144907513 and it is greater than 5% level of significance

So, the statistical decision is failing to reject the null hypothesis at 5% level of significance

Hence, there is not enough evidence to reject the claim of moisture content of shingles A less than 0.35 pound per 100
square feet, at the 0.05 significance level.

Sample B

Step 1: Defining null and alternative hypotheses

Poppulation standard deviation is unknown. Though sample sizes are more than 30 it is better to opt for T-Test since
they are in border range.

For Sample A
𝐻0: 𝜇B <= 0.35
𝐻𝐴: 𝜇B > 0.35
Since the status quo being the moisture content less than 0.35, it is chosen as the hypothesis.

Step 2: Deciding the significance

Since the 𝛼 is not given so here we select 𝛼(significane level) = 0.05

The sample size for the sample B is 31.
Degree of freedom for sample B is 30

Step 3: Calculate the p - value and test statistic

One sample t test, p-value, and t-statistic for one side T-test are
t-statistic: -3.1003313069986995 p-value: 0.0020904774003191826
Step 4: Decision to reject or accept null hypothesis

p value is 0.0020904774003191826 and it is less than 5% level of significance

So, the statistical decision is failing to reject the null hypothesis at 5% level of significance
Hence, there is sufficient evidence to reject the claim of moisture content of shingles B less than 0.35 pound per 100
square feet, at the 0.05 significance level.

Q.3.2 Do you think that the population mean for shingles A and B are equal? Form the hypothesis and conduct the test
of the hypothesis. What assumption do you need to check before the test for equality of means is performed?

To perform a test for comparison of means, the samples should adhere to certain assumption, as stated below

a) Variance of two sample should be similar – which is being satisfied as shown in EDA
b) Samples Should be random and both the populations should be normally distributed – Since population data not
available but since sample size is more than 30, sample mean distribution is following normal distribution. Same
has been confirmed using empirical rule analysis as shown in Jupyter notebook
c) Outliers in the data should be minimal – same is shown in below figure and outliers are minimal

Based on above assumptions, two sample independent t-test can be performed for the data samples.

Step 1: Define null and alternative hypotheses

In testing whether the population mean for shingles A and B are equal, the null hypothesis states that the population
mean of shingles A and shingles B are the same, equals . The alternative hypothesis states that the population mean of
shingles A and shingles B are different, equals . We are going to use to two tail T test.

H0: 𝜇A - 𝜇B = 0 i.e. 𝜇A = 𝜇B
HA: 𝜇A - 𝜇B ≠ 0 i.e. 𝜇A ≠ 𝜇B

Step 2: Defining the significance level

Since the 𝛼 is not given so here we select 𝛼(significane level) = 0.05

Sample sizes for both samples are not same
Degree of freedom of the test is 36+31-2 = 65

Step 3: Calculating the p - value and test statistic

Two sample t test

t-statistic: 1.2896282719661123 p-value: 0.2017496571835306
Step 4: Decision to reject or accept null hypothesis

Level of significance: 0.05 and our two-sample t-test p-value= 0.2017496571835328, since p-value is greater than 0.05,
we have no evidence to reject the null hypothesis since p value > Level of significance
The results indicate that there is no significant difference between the population averages,
Whereas the Null Hypothesis of equality of means is accepted.

The results indicate that, at 95% confidence level, there is sufficient evidence to prove that mean moisture
content in A is equal to mean moisture content in B, which is accepting null hypothesis

Predictive Modelling Project_Nandini
No ratings yet
Predictive Modelling Project_Nandini
31 pages
Egypt National Bank of Egypt
0% (1)
Egypt National Bank of Egypt
1 page
ML-2 Guided Project Report
No ratings yet
ML-2 Guided Project Report
63 pages
Construction Failure2nd Second Edition by Feld Jacob Feld Kenneth L Carper B002e2ajje
No ratings yet
Construction Failure2nd Second Edition by Feld Jacob Feld Kenneth L Carper B002e2ajje
5 pages
Creative Writing: Module No. 1 Imaginative Writing Versus Technical Writing
100% (4)
Creative Writing: Module No. 1 Imaginative Writing Versus Technical Writing
5 pages
Nagareddy 18-Nov-2023
No ratings yet
Nagareddy 18-Nov-2023
20 pages
SMDM Project Report-Survi Ghura
100% (1)
SMDM Project Report-Survi Ghura
26 pages
SuperKart Milestone1 Final
No ratings yet
SuperKart Milestone1 Final
15 pages
1) Introduction A) Defining Problem Statement:-: ST ST
No ratings yet
1) Introduction A) Defining Problem Statement:-: ST ST
10 pages
Project: Advanced Statistics: Anova, Eda and Pca
No ratings yet
Project: Advanced Statistics: Anova, Eda and Pca
35 pages
Wholesale Custumer
100% (1)
Wholesale Custumer
32 pages
SMDM Project Report Dipti
No ratings yet
SMDM Project Report Dipti
14 pages
Business Report Project - Sheetal - SMDM
100% (1)
Business Report Project - Sheetal - SMDM
20 pages
Akshaya SMDM Project Report
100% (1)
Akshaya SMDM Project Report
18 pages
SMDM Project Report
100% (1)
SMDM Project Report
9 pages
SMDM Project
100% (1)
SMDM Project
22 pages
Business Report On Data Mining: By: Aditya Janardan Hajare Batch: PGPDSBA Mar'C21 Group 1
100% (1)
Business Report On Data Mining: By: Aditya Janardan Hajare Batch: PGPDSBA Mar'C21 Group 1
12 pages
Prathamesh Shukla SMDM Project 20.08.23
100% (1)
Prathamesh Shukla SMDM Project 20.08.23
34 pages
AS Extended Buisnesss Report
No ratings yet
AS Extended Buisnesss Report
25 pages
Rahulsharma - 03 12 23
No ratings yet
Rahulsharma - 03 12 23
25 pages
VARUNSAINI - 13 Nov 2022
No ratings yet
VARUNSAINI - 13 Nov 2022
14 pages
Problem 2 - Survey: Importing Nessceary Libraries
No ratings yet
Problem 2 - Survey: Importing Nessceary Libraries
10 pages
Capstone Notes-1
No ratings yet
Capstone Notes-1
18 pages
M4 Data Mining W4 Business Report
No ratings yet
M4 Data Mining W4 Business Report
22 pages
SMDM-Business Report
No ratings yet
SMDM-Business Report
11 pages
Data Mining Problem 2 Report
No ratings yet
Data Mining Problem 2 Report
13 pages
Uber Drive Practice DP PDF
No ratings yet
Uber Drive Practice DP PDF
10 pages
SMDM Extended Project
No ratings yet
SMDM Extended Project
1 page
Problem 1 - (Download Data) : Importing Nessceary Libraries
No ratings yet
Problem 1 - (Download Data) : Importing Nessceary Libraries
16 pages
PM Guided Project Sample Business Report
100% (1)
PM Guided Project Sample Business Report
52 pages
Project - Finance and Risk Assessment: Submitted By: Navendu Mishra
No ratings yet
Project - Finance and Risk Assessment: Submitted By: Navendu Mishra
18 pages
Vijayalakshmi
No ratings yet
Vijayalakshmi
17 pages
ML - Project - Business Report
No ratings yet
ML - Project - Business Report
43 pages
Random Forest - US - Heart - Patients - Class
100% (1)
Random Forest - US - Heart - Patients - Class
24 pages
Clustering Analysis: Prepared by Muralidharan N
100% (1)
Clustering Analysis: Prepared by Muralidharan N
16 pages
Anshul Dyundi Machine Learning July 2022
50% (2)
Anshul Dyundi Machine Learning July 2022
46 pages
Advance Stats Project Parijat
No ratings yet
Advance Stats Project Parijat
18 pages
Answer Report: Data Mining
No ratings yet
Answer Report: Data Mining
32 pages
TSF - Project
100% (1)
TSF - Project
5 pages
Great Lakes Extraa_Learn Project Business Report - 2-Kavish-Rathod
No ratings yet
Great Lakes Extraa_Learn Project Business Report - 2-Kavish-Rathod
22 pages
Predictive Modeling
No ratings yet
Predictive Modeling
38 pages
PM ProjectJune - 2021
100% (1)
PM ProjectJune - 2021
33 pages
PREDICTIVE MODELING
No ratings yet
PREDICTIVE MODELING
21 pages
SMT Capstone PPT Ayushi Rastogi PGPDSBA.O.MAY22.C
No ratings yet
SMT Capstone PPT Ayushi Rastogi PGPDSBA.O.MAY22.C
12 pages
Machine Learning Guided Project
No ratings yet
Machine Learning Guided Project
23 pages
FRA Project Report - Chilla Nagaraju
100% (1)
FRA Project Report - Chilla Nagaraju
66 pages
Cars Project PDF
No ratings yet
Cars Project PDF
9 pages
FRA Main Project Part B Guided
No ratings yet
FRA Main Project Part B Guided
23 pages
Capstone Project - Final Submission
No ratings yet
Capstone Project - Final Submission
36 pages
Capstone Project Taiwan
No ratings yet
Capstone Project Taiwan
6 pages
The Cricket Winner Prediction With Applications of ML and Data Analytics
No ratings yet
The Cricket Winner Prediction With Applications of ML and Data Analytics
18 pages
Surabhi FRA PartA
No ratings yet
Surabhi FRA PartA
13 pages
Data Mining Project - 27.06.2021
No ratings yet
Data Mining Project - 27.06.2021
6 pages
Business_Report-Comp-Fin_Data_Part A_Problem
No ratings yet
Business_Report-Comp-Fin_Data_Part A_Problem
17 pages
Finance Risk Analytics - Priyanka Sharma - Business Report
No ratings yet
Finance Risk Analytics - Priyanka Sharma - Business Report
49 pages
Report On Linear Regression Using R
No ratings yet
Report On Linear Regression Using R
15 pages
AS Graded Project Suchi Solanki
No ratings yet
AS Graded Project Suchi Solanki
21 pages
Predictive Modeling - Supporting File1
No ratings yet
Predictive Modeling - Supporting File1
3 pages
SMDM Project
No ratings yet
SMDM Project
17 pages
MySQL - Week 5 Quiz
100% (1)
MySQL - Week 5 Quiz
6 pages
Ashishpk 12-09 21
No ratings yet
Ashishpk 12-09 21
21 pages
Harish Kumar Tsaini SMDM
No ratings yet
Harish Kumar Tsaini SMDM
16 pages
Business Analytics Report: Submitted To
No ratings yet
Business Analytics Report: Submitted To
32 pages
Education - Post 12th Standard - CSV
No ratings yet
Education - Post 12th Standard - CSV
11 pages
Ruhee Ansari - Advanced Statistic Project SCB
100% (1)
Ruhee Ansari - Advanced Statistic Project SCB
28 pages
Predective Modelling Project Business Report
50% (2)
Predective Modelling Project Business Report
58 pages
Project Report - Data Mining
0% (1)
Project Report - Data Mining
52 pages
Chat
No ratings yet
Chat
61 pages
Demba Sow Web Developer2
No ratings yet
Demba Sow Web Developer2
1 page
Key 5
No ratings yet
Key 5
7 pages
Electric Perm Spoofer Instructions
No ratings yet
Electric Perm Spoofer Instructions
3 pages
BACOSTMX Module 2 Self-Reviewer
No ratings yet
BACOSTMX Module 2 Self-Reviewer
7 pages
Real Aimbot
No ratings yet
Real Aimbot
3 pages
G9 Math Q1 - Week 1 Intro of Quadratic Equation
No ratings yet
G9 Math Q1 - Week 1 Intro of Quadratic Equation
18 pages
Explorer: Untethered Real-Time Gas Main Assessment Robot System
No ratings yet
Explorer: Untethered Real-Time Gas Main Assessment Robot System
6 pages
Class-11- C Lab Report shivam (1) (2)
No ratings yet
Class-11- C Lab Report shivam (1) (2)
23 pages
Log Cat 1737383517513
No ratings yet
Log Cat 1737383517513
83 pages
Mas61007 220209177
No ratings yet
Mas61007 220209177
4 pages
Corporate Folder Dynamed
No ratings yet
Corporate Folder Dynamed
2 pages
Coa Unit 4
No ratings yet
Coa Unit 4
90 pages
Parts and Service News-At22138
No ratings yet
Parts and Service News-At22138
3 pages
Chapter 1 Introduction To Modeling
No ratings yet
Chapter 1 Introduction To Modeling
11 pages
Technical Training of 5G Networking Design
No ratings yet
Technical Training of 5G Networking Design
32 pages
Presentation - CyberSecurity DigTrans V3
No ratings yet
Presentation - CyberSecurity DigTrans V3
74 pages
AV CSS Project
No ratings yet
AV CSS Project
30 pages
SDK Axxon Software
No ratings yet
SDK Axxon Software
20 pages
IT-Support-Officer-Intern
No ratings yet
IT-Support-Officer-Intern
2 pages
Correlation-Based Botnet Detection
No ratings yet
Correlation-Based Botnet Detection
186 pages
Introduccion A La CND Computer Network Defense
No ratings yet
Introduccion A La CND Computer Network Defense
10 pages
FAM CASE STUDY
No ratings yet
FAM CASE STUDY
2 pages
XMC Posif
No ratings yet
XMC Posif
17 pages
Silvernox_Company_Profile_Detailed
No ratings yet
Silvernox_Company_Profile_Detailed
16 pages
Looking For A Lady To Share A Three Bedroom Furnished Cottage With Wifi in Horison Roodepoort Roodepoort Gumtree South Afric
No ratings yet
Looking For A Lady To Share A Three Bedroom Furnished Cottage With Wifi in Horison Roodepoort Roodepoort Gumtree South Afric
1 page
Christian Ramirez of Engram: Customized Overhead Crane Suggestion For
No ratings yet
Christian Ramirez of Engram: Customized Overhead Crane Suggestion For
10 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

SMDM Report

Uploaded by

SMDM Report

Uploaded by

Study -1 Project

Q.1.1: Exploratory data analysis:

index count mean std min 25% 50% 75% max CV

Q.1.4 Are there any outliers in the data?

Q.1.5 On the basis of this report, what are the recommendations?

2.1.1. Gender and Major

2.1.2. Gender and Grad Intention

2.1.3. Gender and Employment

2.1.4. Gender and Computer

Total number of students = 62

Total number of students = 62

Total number of students = 62

• Probability of male_Accounting is 13.8

• Probability of female_Accounting is 9.1

Probability of male and intends to gradute is 27.4%

Probability of female with no laptop is 6.5%

Probability of either male or fully employed is 74.2%

Probability of female in international business management is 12.1%

Answer the following questions based on the data

Total number of students = 62

Probability of male earning more than 50 = 14/29 = 48.3%

𝜇A be average moisture content in sample A

Step 1: Defining null and alternative hypotheses

Step 2: Deciding the significance

Since the 𝛼 is not given so here we select 𝛼(significane level) = 0.05

Step 3: Calculate the p - value and test statistic

p value is 0.07477633144907513 and it is greater than 5% level of significance

Step 1: Defining null and alternative hypotheses

Step 2: Deciding the significance

Since the 𝛼 is not given so here we select 𝛼(significane level) = 0.05

Step 3: Calculate the p - value and test statistic

p value is 0.0020904774003191826 and it is less than 5% level of significance

Step 1: Define null and alternative hypotheses

Step 2: Defining the significance level

Since the 𝛼 is not given so here we select 𝛼(significane level) = 0.05

Step 3: Calculating the p - value and test statistic

Two sample t test

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.