Unit 11 Introduction To Statistics: Structure
Unit 11 Introduction To Statistics: Structure
11.1 INTRODUCTION
As you know every subject has its origin, development stages, scope, uses and
limitations.
In this unit, we will discuss origin and development, definition, scope and uses,
and limitations of statistics. Different measurement scales and different types
of data also have been discussed in this unit.
Objectives
After completing this unit, you should be able to:
know origin and development stages of statistics;
know definition, scope, uses and limitations of statistics;
get an idea of different types of measurement scales; and
get an idea of different types of data.
56
institute unique internationally are Professor C.R. Rao, Professor R.C. Bose, Introduction to Statistics
Professor S.N. Roy, etc.
First post graduate course in Statistics was started by Kolkata University in
1941, while first under graduate course in Statistics was started by the
Presidency College Kolkata. With the passage of time some more
universities/institutes came up with courses in Statistics. Some of these are
University of Mumbai, University of Pune, University of Madras, University of
Mysore, University of Kerala and University of Lucknow. This list of
institutions went on increasing with time and at present more than 1100
institutes are there in the country, which are offering under graduate or post
graduate courses in statistics.
Statistics in World
Without going into more details, we will concentrate on only some major
discoveries in the area of Statistics at international level. A lot of theoretical
development in different areas of statistics took place in seventeenth and
eighteenth centuries in many countries of the world. John Graunt (1620-1674
born in London), being a haberdasher by profession, has the credit of
producing the first life table with probabilities of survival to each age. Due to
this great achievement, he is known as father of vital statistics. This was the
period when some other persons also did their contribution in the same area
such as Edmund Haller (1656-1742) prepared a life table on the basis of the
data collected by Casper Newman in 1691, relating to death records of Breslau.
Sir William Petty (1623-1687) also prepared mortality tables and calculated
expectation of life at different ages. G.F. Knapp (1842-1926) and W. lexis
(1837-1914) also did valuable work on the statistics of mortality. Study of
probability was also found to be very important in the area of Statistics,
quantitative measure of which was given by Galileo (1564-1642), an Italian
mathematician[For detail discussion on development of Probability Sec 1.1 of
Unit 1 of MST-003 may be referred to]. Guass (1777-1855) gave the principal
of least square and normal law of errors. J. Bernoulli (1654-1705) was the first
person who states the law of large numbers in his great work Ars conjectandi
published eight years after his death. Statistical methods in the field of
biometry were first introduced of Sir Francis Galton (1822-1911). Later on
Professor Karl Pearson (1857-1936) followed up the work of Galton and did
significant contribution to Anthropology and correlation coefficient theories.
Karl Pearson was also the founder of Statistical Research Laboratory in the
university college, London in 1911. Credit of discovery of Chi-Square test also
goes to Karl Pearson. Credit of discovery of ‘t test’ or ‘student t’ test goes to
W.S. Gosset who wrote under the pseudonym of student’s ‘t’.
List of contributors in the area of Statistics did not end here but we conclude by
throwing some light on the work done by Fisher. Credit of discovering of very
powerful test known as Analysis of Variance (ANOVA) goes to Sir Ronald A.
Fisher (1890-1962). Fisher also did a lot of work in the area of point
estimation. Due to his remarkable contribution in the field of Statistics, he is
known as Father of Statistics.
58
Biology and Statistics Introduction to Statistics
Professor Karl Pearson has stated that the whole doctrine of heredity rests on
statistical basis. This is generally said that height of the child is associated with
the height of the father. To test this type of hypothesis, statistics is the only
science which provides the scientific methods. Vital statistics is totally devoted
to the different aspects of human life like average life of men and women, birth
and death rates, etc. Those learners who will opt Bio-Statistics specialisation
will get these terms in detail in courses MSTE-003 and MSTE-004.
Statistics and Medicine
Statistics also plays an important role in the field of medicine. The hypothesis
of the types:
(i) Drug A is better than drug B.
(ii) Smoking and cancer are associated.
(iii) Smoking and TB are associated.
All are tested using t-test or 2 -test as the case may be. Statistics also find its
application in clinical trials.
Statistics and Planning
Every institution/organisation plans for its future targets. Now, a days for a
good planning, it has became necessary to analysis the statistical data
according to the field of interest such as availability of raw material,
consumption, investment, resources available, income, expenditure, quality
needed, etc. In order to analysis these types of data, one has to totally depend
on the statistical techniques. Thus statistics is essential for planning.
Statistics and Commerce
In present times, there is a very tough competition in almost every business.
Also fashions, likings/tastes, requirements, trends, levels of qualities,
technologies, etc. are changing very fast. So for the success of the business, it
has become necessary for a business man to know the coming trend of market
in advance or as soon as possible. This can also be achieved only with the help
of market survey, which requires statistical techniques.
Statistics and Agriculture
Presently there are a number of varieties of seeds for a particular crop. Also
different types of fertilizers are available in the market. For a good yield, it has
become necessary to know that which one is better. This job is again done by a
very popular and widely used test known as Analysis of variance (ANOVA)
discovered by Professor R.A. Fisher. You will learn about ANOVA in more
detail in block 2 of course MST-005. Complete Block 2 of MST-005 is totally
devoted to ANOVA.
Statistics and Insurance Sector
Whole insurance sector totally depends on the statistical data and different
concepts of probability theory. Life tables lies in the heart of human
insurances. Curtate future life time and complete future life time, of a life are
calculated using concept of random variables and their expected values. (you
will learn in detail about random variables and their expected values in block 2
of course MST-003). Due to the large use of statistics in insurance sector, a
new branch of statistics known as Actuarial statistics has been started in some
institutes throughout the world.
59
Matrices, Determinants Statistics and Research
and Collection of Data
Research is very important aspect in every discipline. In many disciplines such
as psychology, tourism, education, M.B.A., etc. one has to collect the data on
the characteristics of interest under study. Now a very important question
arises, related to the measurement of scale to be used and appropriate test to be
used. This requires the knowledge of different types of measurement scales and
accordingly suitable statistical tool need. (Different types of measurement
scales-nominal, ordinal, interval and ratio have been discussed in detail in Sec
11.6 of this unit). Also appropriate statistical tool to be needed in a given
situation have been listed in Table 11.1.
Statistics and Economics
In order to know about the development of a country, it has become necessary
to obtain the data related to its economical growth. Again, statistical tools are
needed to collect relevant data (such as related to agricultural, industrial,
literacy, etc) and for its analysis.
Statistics and Common Man
Statistics also plays an important role in the welfare of common man. Common
man of any country faces lot of problems in his routine life such as food
shortage, hygienic drinking water, unemployment, poverty, medical, shortage
of public transport, etc. Time to time statistical figures on these issues enables
the government to think and sort out these problems.
Statistics helps the common man in their day to day life in another way also,
e.g. in purchasing any good he/she used his/her past experience (actually based
on the data he/she faced/experienced) and take the decision to buy or not buy a
particular object. Similarly, a farmer decide about the crop to be yield based on
his past experience (actually based on the data he has faced) and labourer
choose one of the works which gives him more wages based on his past
experienced (actually based on the data he has faced).
List of fields/areas where statistics is used does not end here. We have just
touch some of the areas where statistics has its application. We close this
section by saying that there is hardly any field where statistics cannot be used.
Infact, statistics can be used in any field of interest.
60
(2) Dealingness with a Group Introduction to Statistics
Science of statistics deals with aggregates of objects not with individuals. The
individual’s figures, when taken separately do not come under the category of
statistical data. So, applicability of any statistical tool becomes meaningless.
For example, salary of one employee of an institute does give any message
related to the salaries of the employees of that particular institution.
(3) Lack of Exactness
Statistical results are not exactly true, but they are true on an average.
For example,
(i) If a statistical report says that 70% population of India lived in rural area. It
does not imply that if you visit at public place like bus stand, railway
station, etc. and asked the people about their living place. Results may
surprise you and may highly differ with the above figure. But you may
note that as sample size increases, the result will also come nearer and
nearer to exact figure 70%.
(ii) Consider another example, suppose past data show that 90% operations of
a doctor are successful. It does not imply that out of the next 100
operations, exactly 90 will be successful. It may happen that figure that
will obtain in future may be 90%, 80%, 87%, 95%, etc. But there are
sciences like mathematics where exactness is maintained. For example, if a
book seller get 5% profit on selling a particular book. Then it is sure that if
sell of that particular book is of Rs 200 he/she will get Rs 10 as profit and
in case of sale of Rs 300 profit will be Rs 15 and so on.
(4) Requirement of Experts Hands for Effective Use
Requirement of experts’ hands for effective and appropriate use is one of the
main draw backs of the science of statistics. There are many statistical tools of
similar type.
For example,
(i) To find average in a particular situation, which of the possible tools likes
mean, median, mode, geometric mean, harmonic mean, etc. is appropriate
needs the hands of experts.
(ii) Similarly to test a given statistical hypothesis which of the possible tools
like Z-test, t-test, 2 -test, F-test, ANOVA, median test, run test, sign test,
etc. is appropriate again needs the hands of experts. This limitation of the
statistics limits the range of its effective users.
61
Matrices, Determinants (3) Interval Scale
and Collection of Data
(4) Ratio Scale
Let us discuss these scales of measurement one by one:
(1) Nominal Scale
In Latin, ‘Nomen’ means name. The word nominal has come from this Latin
word, i.e. ‘Nomen’. Therefore, under nominal scale we divide the objects under
study into two or more categories by giving them unique names. The
classification of objects into atleast two or more categories is done in such a
way that
(a) Each object takes place only in one category, i.e. each object falls in a
unique category, i.e. it either belongs to a category or not. Mathematically,
we may use the symbol (“=”, “ ”) if an object falls in a category or not.
(b) Number of categories must be sufficient to include all objects, i.e. there
should not be scope for missing even a single object which does not fall in
any of the categories. That is, in statistical language categories must be
mutually exclusive and exhaustive.
Generally nominal scale is used when we want to categories the data based
on the characteristic such as gender, race, region, religion, etc.
To get more familiar with the idea of nominal scale, let us consider some
examples:
(i) Classification into Different Categories Based on Gender
This can be done by dividing the population into two categories male ‘M’
and female ‘F’
Category Name/Code
Male M
Female F
Here we have named male as ‘M’ and female as ‘F’. This is not the only
way, we can also code male by ‘0’ and female by ‘1’ or we may use any
other convenient symbols. So, we note that main thing is that we have to
give a unique name to each category.
(ii) Classification into Different Categories Based on Caste
Here also we can give a code to general, scheduled caste, scheduled tribes,
backward class and other categories by ‘0’, ‘1’, ‘2’, ‘3’, ‘4’ respectively.
(iii) Classification into Different Categories Based on Region
28 states and 7 union territories together classified India into 35 categories
which can be coded by their usual names or may be coded by using some
other symbols.
62
(iv) Classification into Different Categories Based on Religion Introduction to Statistics
To get more familiar with the concept of interval scale, let us consider some
examples:
(i) The measurement of time of an historical event comes under interval
scale because there is no fixed origin of time (i.e. ‘0’ year). As’0’ year
differ calendar to calendar or society/country to society/country e.g.
Hindus, Muslim and Hebrew calendars have different origin of time, i.e.
‘0’ year is not defined. In Indian history also, we may find BC (Before
Christ).
65
Matrices, Determinants includes , in addition to +, –, >, <, =, . But be careful never take ‘0’ in the
and Collection of Data
4
denominator while finding ratios. For example, is meaningless.
0
To get more familiar with the concept of ratio scale let us consider some
examples, where ratio scale is used:
(i) Measurement of temperature in Kelvin scale comes under ratio scale
because it has an absolute zero which is equivalent to 273.150 C . This
characteristic of origin allows us to make the statement like 50K (‘50K’
read as 50 degree Kelvin) is 5 time hot compare to 10K.
(ii) Measurement of money also comes under ratio scale because it satisfies
all the requirement of interval scale and has a natural zero. For example,
suppose there are 60 teachers in a particular school in Delhi. If we
associate a unique number to each teacher related to the cash (in rupees)
he/she has with him/her at the time of investigation. Then we have a
fixed whole number corresponding to each teacher. Of course two or
more teachers may have same cash (in rupees). These teachers will be
allotted the same whole number and will fall in one category. Here we
note that, the whole numbers allotted to the teachers can be ordered, have
an actual difference and also have origin (i.e. absolute zero ‘0’). Here
natural zero indicates the absence of money in the pocket of the teacher.
If a teacher has Rs 500 and another teacher has Rs 100 then we can say
that the teacher having Rs 500 has 5 times amount than a teacher having
Rs 100. Thus it satisfies all the requirement of ratio scale.
(iii) Both height (in cm.) and age (in days) of students of M.Sc. Statistics of a
particular university satisfy all the requirements of a ratio scale. Because
height and age both cannot be negative (i.e have an absolute zero).
Permissible Statistical Tools
One of the advantages of measurement scale is that these help us to decide
which statistical tool should be used in a given situation.
Table 11.1 shows the list of permissible statistical tools in case of nominal,
ordinal, interval and ratio scales. Based on information provided by these
scales, their levels from lowest to height are nominal, ordinal, interval and ratio
(see Fig 11.1). That is why all the Statistical tools applicable on the lower scale
will automatically be applicable on the next level scale. So, we will not repeat
the permissible statistical tools used in lower level scale. It is understood that
statistical tools which are permissible for nominal will be permissible in case of
ordinal and so on.
Fig. 11.1
66
Table 11.1 Introduction to Statistics
Before closing this section let us consider some situations and appropriate
measurement scale that can be used with the help of some examples followed
by some exercises.
Example 1: If you want to collect the data based on the characteristic of
literacy then which scale will be used? Explain with reasons.
Solution: Appropriate scale is nominal scale because population can be
categorised in two categories literate (L) and illiterate (I). The symbols for
literate and illiterate can be used according to our choice like 0, 1 or A, B or X,
Y, etc.
Example 2: At a picnic spot in India, 1000 tourists visit over a period of 7
days. Each tourist is asked the name of the country of his/her birth. Then the
data thus obtained come under which measurement scale.
Solution: Nominal scale, because the characteristic ‘name of the country’
divides the tourists into different categories each labels with the name of
his/her country.
Example 3: Answer the following questions:
(i) Which scale is at lowest level?
(ii) Which scale is at highest level?
(iii) Which scale has absolute zero?
(iv) Which scale is used to find the mean sea level (MSL)?
Solution:
(i) Nominal scale is at lowest level, because it has only one permissible
operation counting.
(ii) Ratio scale is at highest level, because it has all the four operations
counting, order, distance and absolute zero.
(iii) Ratio scale is only scale out of the four measurement scales nominal,
ordinal, interval and ratio scales which has absolute zero.
(iv) Because sea level has no absolute zero, so interval scale is used to find the
mean sea level (MSL).
67
Matrices, Determinants Example 4: Answer the following questions:
and Collection of Data
(i) In which scale median is not permissible?
(ii) In which scale(s) mean is not permissible?
(iii) In which scale(s) geometric mean and harmonic mean are not permissible?
(iv) In which scale geometric mean and harmonic mean are permissible?
Solution:
(i) In order to find median, we have to arrange the data in ascending or
descending order of magnitude. But in nominal scale order operation is not
present. So, in case of nominal scale data, median is not permissible.
(ii) In order to find mean, each observation of the data must be associated with
a numerical quantity (which exactly measure the quantity of the
characteristic). But, this requirement is not fulfilled by nominal and ordinal
scales data. So, mean is not permissible in case of nominal and ordinal
data.
(iii) In order to find geometric mean (G.M.) and harmonic mean (H.M.)
absolute zero must be defined so that one can talk of quotient/ratio of two
numbers. But as absolute zero is defined only in ratio scale, so G.M. and
H.M. are not permissible in nominal, ordinal and interval scales data.
(iv) As discussed in (iii), G.M. and H.M. are defined only in ratio scale.
68
Quantitative data Introduction to Statistics
based on the characteristic
Qualitative data
Discrete data
based on nature of the characteristic
Continuous data
Nominal data
Ordinal data
based on level of measurement
Interval data
Ratio data
TimeSeries data
based on time component
Cross Sectional data
Pr imary data
based on the ways of obtaining the data
Secondary data
Let us discuss different types of data one by one:
Quantitative Data
As the name quantitative itself suggests that it is related to the quantity. In fact,
data are said to be quantitative data if a numerical quantity (which exactly
measure the characteristic under study) is associated with each observation.
Generally, interval or ratio scales are used as a measurement of scale in case of
quantitative data. Data based on the following characteristics generally gives
quantitative type of data. Such as weight, height, ages, length, area, volume,
money, temperature, humidity, size, etc.
For example,
(i) Weights in kilogram (say) of students of a class.
(ii) Height in centimeter (say) of the candidates appearing in a direct
recruitment of Indian army organised by a particular cantonment.
(iii) Age of the females at the time of marriage celebrated over a
period of week in Delhi.
(iv) Length (in cm) of different tables in a showroom of furniture.
Here, is an exercise for you
E 3 Provide an example based on each of the following characteristic:
(i) Area (ii) Volume (iii) Money (iv) Temperature (v) Humidity (vi) Size
Qualitative Data
As the name qualitative itself suggests that it is related to the quality of an
object/thing. It is obvious that quality cannot be measured numerically in exact
terms. Thus, if the characteristic/attribute under study is such that it is
measured only on the bases of presence or absence then the data thus obtained
is known as qualitative data.
Generally nominal and ordinal scales are used as a measurement of scale in
case of qualitative data. Data based on the following characteristics generally
gives qualitative data. Such as gender, marital status, qualification, colour,
religion, satisfaction, types of trees, beauty, honesty, etc.
69
Matrices, Determinants For example,
and Collection of Data
(i) If the characteristic under study is gender then objects can be divided into
two categories, male and female.
(ii) If the characteristic under study is marital status then objects can be
divided into four categories married, unmarried, divorcee, widower.
(iii) If the characteristic under study is qualification (say) ‘matriculation’ then
objects can be divided into two categories as ‘Matriculation passed’ and
‘not passed’.
(iv) If the characteristic under study is ‘colour’ then the objects can be divided
into a number of categories Violet, Indigo, Blue, Green, Yellow, Orange
and Red.
Here, is an exercise for you.
E 4 Give an example based on the following characteristic:
(i) Religion (ii) Satisfaction
Discrete Data
If the nature of the characteristic under study is such that values of observations
may be at most countable between two certain limits then corresponding data
are known as discrete data (concept of countability have already been discussed
in Sec 2.6 of Unit 2 of this course).
For example,
(i) Number of books on the self of an elmira in a library form discrete data.
Because number of books may be 0 or 1 or 2 or 3,…. But number of
books cannot take any real values such as 0.8, 1.32, 1.53245, etc.
(ii) If there are 30 students in a class, then number of students presents in a
lecture forms discrete data. Because number of present students may be 1
or 2 or 3 or 4 or…or 30. But number of present students cannot take any
real values between 0 and 30 such as 1.8675, 22.56, 29.95, etc.
(iii) Number of children in a family in a locality forms discrete data. Because
number of children in a family may be 0 or 1 or 2 or 3 or 4 or…. But
number of children cannot take any real values such as 2.3, 3.75, etc.
(iv) Number of mistakes on a particular page of a book. Obviously number of
mistakes may be 0 or 1 or 2 or 3…. But cannot be 6.74, 3.9832, etc.
Continuous Data
Data are said to be continuous if the measurement of the observations of a
characteristic under study may be any real value between two certain limits.
For example,
(i) Data obtained by measuring the heights of the students of a class of say 30
students form continuous data, because if minimum and maximum heights
are 152cm and 175 cm then heights of the students may take any possible
values between 152 cm and 175 cm. For example, it may be 152.2375 cm,
160.31326… cm, etc.
(ii) Data obtained by measuring weights of the students of a class also form
continuous data because weights of students may be 48.25796…kg,
50.275kg, 42.314314314…kg, etc.
70
Here is an exercise for you. Introduction to Statistics
E 5) Identify whether the data are discrete or continuous in the following cases:
(i) Number of people present in a party.
(ii) Length of leafs of a plant.
(iii) Lifetime in hours of an electrical bulb.
(iv) Number of cars standing in a showroom over a period of 7 days.
(v) Number of patients visited to a hospital on a particular day.
Nominal Data
Data collected using nominal scale is called nominal data.
Similarly, data collected using ordinal scale, interval scale and ratio scale are
called ordinal data, interval data and ratio data respectively. These scales of
measurement have already been discussed in detail in Sec. 11.6.
Time Series Data
Collection of data is done to solve a purpose in hand. The purpose may have its
connection with time, geographical location or both. If the purpose of data
collection has its connection with time then it is known as time series data.
That is, in time series data, time is one of the main variables and the data
collected usually at regular interval of time related to the characteristic(s) under
study show how characteristic(s) changes over the time.
For example, quarterly profit of a company for last eight quarters, yearly
production of a crop in India for last six years, yearly expenditure of a family
on different items for last five years, weekly rate of inflation for last ten weeks,
etc. all form time series data.
Yearly expenditures (in Rs) for a family on different items from 2006 to 2010
are given in the following table.
Year Food Education Rent Miscellaneous Total
2006 40000 10000 36000 20000 106000
2007 45000 12000 40000 28000 125000
2008 54000 15000 45000 32000 146000
2009 60000 20000 50000 40000 170000
2010 70000 30000 55000 45000 2000000
Data given in above table is nothing but time series data.
Note 2: If the purpose of the data collection has its connection with
geographical location then it is known as Spatial Data.
For example,
(i) Price of petrol in Delhi, Haryana, Punjab, Chandigarh at a particular time.
(ii) Number of runs scored by a batsman in different matches in a one day
series in different stadiums.
Note 3: If the purpose of the data collection has its connection with both time
and geographical location then it is known as Spacio Temporal Data.
For example, data related to population of different states in India in 2001 and
2011 will be Spacio Temporal Data.
Note 4: In time series data, spatial data and spacio temporal data we see that
concept of frequency have no significance and hence known as non-frequency
71
Matrices, Determinants data. For instance, in the example discussed in case of time series data,
and Collection of Data expenditure of Rs 40000 on food in 2006 is itself important, here its frequency
say 3 (repeated three times) does not make any sense.
Note 5: Now consider the case of marks of 40 students in a class out of 10
(say). Here we note that there may be more than one student who score same
marks in the test. Suppose out of 40 students 5 score 10 out of 10, it means
marks 10 have frequency 5. This type of data where frequency is meaningful is
known as frequency data.
Cross Sectional Data
Sometimes we are interested to know that how a characteristic (such as income
or expenditure, population, votes in an election, etc.) under study at one point
in time is distributed over different subjects (such as families, countries,
political parties, etc.). This type of data which is collected at one point in time
is known as cross sectional data.
For example, annual income of different families of a locality, survey of
consumer’s expenditure conducted by a research scholar, opinion polls
conducted by an agency, salaries of all employees of an institute, etc.
Note 6:
(i) If you are interested to know the changes in a characteristic say
expenditure of a family over a period of time then you have to use time
series data.
(ii) If you are interested to know the changes in a characteristic say
expenditure of different families at single point in time you have to use
cross sectional data.
Primary Data
Data which are collected by an investigator or agency or institution for a
specific purpose and these people are first to use these data, are called primary
data. That is, these data are originally collected by these people and they are
first to use these data. Primary data have been discussed in Sec. 12.2 of next
unit (i.e. UNIT 12) of this course in detail.
For example, suppose a research scholar wants to know the mean age of
students of M.Sc. Chemistry of a particular university. If he collects data
related to the age of each student of M.Sc. Chemistry of that particular
university by contacting each student personally then data so obtained by the
research scholar is an example of primary data for the same research scholar.
Secondary Data
Data obtained/gathered by an investigator or agency or institution from a
source which already exists, are called secondary data. That is, these data were
originally collected by an investigator or agency or institution and has been
used by them at least once. And now, these data are going to be used at least
second time. Secondary data have been discussed in Sec. 12.3 of next unit (i.e.
UNIT 12) of this course in detail.
For example, consider the same example as discussed in case of primary data.
If the research scholar collects the ages of the students from the record of that
particular university, then the data thus obtained is an example of secondary
data. Note that, in both the cases data remain the same, only way of collecting
the data differs.
72
Introduction to Statistics
11.8 SUMMARY
In this unit, we covered following topics:
1) Origin and development of statistics.
2) Definitions of statistics by different authors.
3) Scope and uses of statistics.
4) Limitations of statistics.
5) Different measurement scales and types of data.
6) Frequency and non frequency data.
11.9 SOLUTIONS/ANSWERS
E 1 (i) Ratio scale is considered as the best measurement scale so far as the
criteria of information provide are concerned because all the four
operations counting, order, distance and absolute zero are defined on
the observations.
(ii) Measurement of height, weight, age requires absolute zero and only
ratio scale has absolute zero. So, appropriate scale of measurement for
height, weight, age is ratio scale.
(iii) Allotment of license plates to the different cars comes under nominal
scale measurement, because license plates categories the cars or
license plates only provide unique names to the cars. Further, the car
remains the same if some other registration number is provided to it.
(iv) Characteristic of equal distance between any two observations is
maintained by two scales of measurements interval and ratio scales.
For example, distance between temperatures of 18K and 13K is same
as distance between 100K and 105K.
E 2) Blood group just divides the objects/things into four categories named as
A, B, AB, O. So it comes under nominal scale.
E 3) Answers are not unique. There are a number of examples for each part,
here one answer is provided for each part.
(i) Area of each state (in km 2 ) of India.
(ii) Volume of different buckets available at a particular shop.
(iii) Income of each family over a period of one year in a particular
locality.
(iv) Highest or lowest temperature of a place over a period of 50 days.
(v) Level of humidity of a particular place at each hour of a particular
day.
(vi) Size of different shoes present at a particular showroom on a specified
day.
E 4) (i) If the characteristic under study is ‘religion’ then the objects can be
divided into five categories Hindu, Muslim, Sikh, Isaiah, and others.
(ii) If the characteristic under study is ‘satisfaction’ then the objects can be
divided into five categories (Likert scale) as shown on the next page:
73
Matrices, Determinants Highly Satisfied Neutral Dissatisfied Highly
and Collection of Data satisfied dissatisfied
5 4 3 2 1
OR
2 1 0 –1 –2
74