Module 01 Introduction To Business Statistics
Module 01 Introduction To Business Statistics
Statistical Data
Statistical data refers to the collection of quantitative information or facts
that have been systematically gathered, organized, and analyzed. These types of
data can be collected from various methods, such as surveys, experiments,
observations, or even from existing sources. Statistical data can be classified into
several types based on the nature of the data and the way it is collected and
analyzed. The main types of statistical data are Qualitative Data, Quantitative
Data, Univariate Data, Bivariate Data, Multivariate Data, Time Series Data, and
Cross-Sectional Data.
I Qualitative Data
1. Nominal Data: Nominal data is the categorical data where the categories or
labels have no inherent order or ranking. For example, in a questionnaire, a
group of people is asked to fill in their marital status opting for Married, Never
Married, Widowed, Divorced, or Don’t Want to Reveal.
2. Ordinal Data: Ordinal data is the categorical data where the categories have a
meaningful order or ranking. The ranking has a meaning and can use alphabetic
or numeric values. For example, Credit rating agencies give ratings as AAA, AA,
A, A+, AB, ….., etc.
Examples of Qualitative Data
1. Measurable: This type of data can be easily measured and quantified. This
means that we can perform arithmetic operations like addition, subtraction,
multiplication, and division on these types of data values.
2. Numerical Values: This data is represented by numbers. These numbers can
be discrete (whole numbers) or continuous (real numbers with infinite decimal
places).
3. Visual Representation: Quantitative data can be effectively represented using
various graphical tools, such as histograms, bar charts, scatter plots, box plots,
and line graphs.
4. Descriptive Statistics: Descriptive statistics is used to summarize and describe
quantitative data.
Categorization of Quantitative Data
1. Discrete Data: Discrete data consists of distinct values, separate values that
cannot be broken down further. These values are typically whole numbers and
often represent counts of items or events. For example, the roll numbers of
students in a class can only be 1, 2, 3, 4, …, so on.
1. Election Exit Polls: Data collected through exit polls during an election,
capturing voter demographics, candidate preferences, and key issues
on Election Day.
1. Healthcare: In medical research, cross-sectional studies may be
conducted to assess the prevalence of a particular disease or condition
in a population at a given moment.
1. Social Sciences: Cross-sectional data is valuable for studying societal
issues, such as income inequality, education levels, and political
preferences.
Mutually Exclusive Categories: Each data point can belong to only one
category, and categories are mutually exclusive. For example, an
individual cannot be both male and female simultaneously.
No Inherent Order: The categories do not have a natural order or
ranking. In the gender example, there is no inherent order among
male, female, and non-binary.
No Arithmetic Operations: Nominal data does not support meaningful
mathematical operations like addition, subtraction, or multiplication.
You cannot perform calculations like finding the average (mean) or
taking the difference between categories.
Mode as the Measure of Central Tendency: The most suitable
measure of central tendency for nominal data is the mode, which
simply identifies the most frequently occurring category.
Star Ratings: Online reviews and rating systems often use ordinal
scales. For example, a product or service might be rated on a scale of 1
to 5 stars, with 5 stars indicating the highest rating. While these stars
represent a ranking, the difference in quality or satisfaction between,
say, 3 and 4 stars may not be the same as between 4 and 5 stars.
Ordered Data: Like the ordinal scale, data on the interval scale can be
ordered or ranked. You can say that one temperature is higher or lower
than another, or that one IQ score is higher or lower than another.
Age: Age in years is a ratio scale variable. It starts from zero at birth,
and one can perform meaningful arithmetic operations with age. For
instance, if someone is 40 years old and another person is 20 years old,
we can say that the first person is twice as old as the second person.
True Zero Point: The defining characteristic of the ratio scale is the
presence of a true zero point, which represents the complete absence
of the measured attribute. This allows for the calculation of meaningful
ratios (e.g., one value is twice as large as another).
Equal Intervals: Like the interval scale, the ratio scale has equal
intervals between values. This means that the numerical difference
between any two values on the scale is meaningful and consistent.
Order and Magnitude: The data on a ratio scale can be ordered or
ranked based on magnitude, and meaningful comparisons can be made
between values.