5.3 Quartiles (Skewness)
5.3 Quartiles (Skewness)
5.3 Quartiles (Skewness)
Ex #1:14, 17, 19, 23, 27, 32, 40, 49, 54, 59, 71, 80
Find all the quartiles and quartile deviation
Sol:
2|Page RFI
n=12
For Q1 = = = 3 is an integer
Ex#2:14, 17, 19, 23, 27, 32, 40, 49, 54, 59, 71, 80, 81
Sol:
n=13
Q1 = 23; Q2 = 40; Q3 = 59
Ex#3:14, 17, 19, 23, 27, 32, 40, 49, 54, 59, 71, 80, 81, 81
Sol:
n=14
Q1 = 23; Q2 = 44.5; Q3 = 71
Inter-quartile Range
Quartile deviation = = = 24
5|Page RFI
Skewness
Age
mean
median
mode
Age
mean
median
mode
7|Page RFI
50% 50%
Age
mean
median
mode
Measuring of skewness
Skewness,
Statistical Law
SKP =
Mode = 3*median – 2*mean
or
SKP =
If SKP = 0; it is symmetric
This Method is
Preferable for
#2.Bowley’s coefficient of skewness Ungrouped Data
SKB =
If SKB = 0; it is symmetric
Ex#4:14, 17, 19, 23, 27, 32, 40, 49, 54, 59, 71, 80, 81, 81
Sol:
n=14
10 | P a g e RFI
SKB = = 0.11
It is positively skewed
**
It is a popular (mis)conception that the sign or direction of the skew for a
numerical dataset dictates the location of the mean with respect to the
median. The idea is that the mean hangs out in the tail region.
11 | P a g e RFI
Ex#5:14, 17, 19, 23, 27, 32, 40, 49, 54, 59, 71, 80, 81, 81
Sol:
n=14
12 | P a g e RFI
Q1 = 23; Q2 = 44.5; Q3 = 71
10 20 30 40 50 60 70 80 90
** the dotted lines to the left and to the right of the box are called whiskers.What these
whiskers extend to is a matter of choice.There are some situations in statistics where we
must make a call and stand by it.This is one of them. An American mathematician called
Tukey came up with this version of a boxplot. The whiskers stop either at the extreme
values, or at a fixed distance of 1.5 IQRs(inter quartile range) from this box, whichever
comes first.The points lie beyond the 1.5 IQR mark, which is one way to qualify what are
known as outliers.
13 | P a g e RFI
Exercise
The 2015 batch of an Executive MBA program has 100 students. Their GPAs
after the first quarter are captured by the box plot below. You are also
supplied the dataset on the GPAs, which we ask you to download.
OUTLIERS
(1 point possible)
How many outliers exist beyond the left whisker?
2 5 4 6
- unanswered
FINAL
Ramya was feeling very energetic as she drove to her office that late
November morning. Her Diwali festival break had expanded into a whole
week, thanks to a few days of casual leave. As she stepped into the building
that housed her office, she was greeted by the concierge: “Hey Ramya,
looks like the holidays have rejuvenated you.” Ramya replied with a broad
smile, “Thanks Roshan, did you burst any crackers?” "Yup! But no noisy
ones," he replied.
15 | P a g e RFI
As she walked into her office, Ramyarealised that something else was
making her feel pumped up. She held an executive position at
SouraviFashions (SF), a company that manufactured and sold niche
garments to women. The company owned 13 stores in 5 metros across
India. Being a small store chain, their investment in IT infrastructure was
bare-bones, consisting of a desktop machine at each location, which was
connected to the Internet. Each store had a well-trained cashier, who
doubled up as a data entry specialist.
Business at SF had been quite good for the last couple of years. However,
competition from online retailers was fast becoming a threat. In a long and
arduous meeting held just before the extended weekend, the company had
decided to open more stores, thereby increasing access to its loyal customer
base. Ramya was entrusted with the task of finding potentially profitable
localities to open these stores.
“This meeting could not have been timed better”, she thought. “Diwali is the
one season when retail business is booming all across India.” Later that day,
she sat in her office and sent out emails to all the stores. Being tech-savvy,
Ramya created a spreadsheet on the cloud, and instructed the store
managers to enter the details of every transaction that had taken place
during the festival week. Specifically, she instructed them to meticulously
record all transactions for the period between 16 November and 22
November. Diwali had fallen on 22nd November that year.
Settling into her office after the holidays, Ramya wondered what additional
information might be helpful besides the transactions. The company had
carefully archived some data when it began its operations across the various
cities. In these archives, there was a dataset with median incomes at all
localities where the existing stores operated. Another dataset contained the
list of declared household incomes (DHI) of customers registered in SF's
loyalty program.
16 | P a g e RFI
STORE PERFORMANCE
(3 points possible)
You may use a pivot table to aggregate store-wise sales. Obtain the number
of garments sold, total sales and average price per garment sold in each
store. Keep in mind that the dataset is only for a brief festival period.
The higher the volume of sales the higher is the average price per
garment The higher the volume of sales the lower is the average price
per garment Stores selling high priced garments are more likely to have
17 | P a g e RFI
a larger volume of sales Stores selling high priced garments are less
likely to have a larger volume of sales
- unanswered
CHECKYOUR ANSWER SAVEYOUR ANSWER
You have used 0 of 3 submissions
CORRELATION
(1 point possible)
Which of the following statements are correct?
BANGALORE
(4 points possible)
Construct a two-dimensional pivot table, with Store IDs along the rows and
Garment Types along the columns. Fill in the table with total revenues from
the sale of a specific type of garment from a specific store. You are ready!
Within Bangalore, the revenue from sales of sportswear is the highest for
which of these stores?
- unanswered
Within Bangalore, the revenue from sales of formal garments is the lowest
for which of these stores?
COUNTS
(2 points possible)
The sales of how many units of garments are recorded within this dataset?
unanswered
unanswered
(5 points possible)
For the following list of problems, assume that people prefer to shop at the
store nearest to them. In other words, everyone buys garments from the
store in their locality.
Answer the questions that follow, all of which concern correlations. (Please
be cautious with giving complete columns while entering formula for
correlation in your spreadsheet)
What is the correlation of the average household income in the locality of the
store, with the minimum price of garment sold in the corresponding store?
- unanswered
CHECKYOUR ANSWER SAVEYOUR ANSWER
You have used 0 of 12 submissions
TRENDS
(1 point possible)
Using the correlations in this exercise, which of these conclusions can be
made?