Brm file 3
Brm file 3
METHODOLOGY LAB
SPSS, which stands for Statistical Package for the Social Sciences, is a
powerful software tool used for statistical analysis in various fields.
Originally developed by Stanford University students in 1968, it has since
become one of the most widely used statistical software packages for
researchers, analysts, and social scientists.
SPSS is mainly used in the following areas like healthcare, marketing, and
educational research, market researchers, health researchers, survey
companies,
1. Variable View
2. Data View
Variable View
● Name: It is a column field that accepts a unique ID that helps in sorting the
data.
Some of the parameters for sorting data are name, gender, sex, educational
● Label: It gives the label and allows you to add special characters.
● Decimal: It helps us understand how to define the digits required after the
decimal
● Measure: It measures the data that is being entered in the tools, such as
Data View
The data view is displayed as rows and columns. You can import a file or add data
manually.
● When you open a dataset or create a new one in SPSS, you will typically
● You can switch between Data View and Variable View using the tabs at
2. Data Entry:
● You can directly enter data into cells, similar to working in a spreadsheet.
3. Variable Columns:
● Each column in Data View represents a variable. The top row often
● You can scroll horizontally to view additional variables if there are more
● Numeric variables contain numerical data (e.g., age, income), and string
● Depending on the variable type, SPSS will enforce appropriate data entry
rules.
5. Missing Values:
● You can designate missing values in the cells. SPSS provides codes for
● You can sort the data by one or more variables to arrange it in a specific
order.
7. Data Cleaning:
● Data View is often used for initial data cleaning tasks, such as
recoding variables.
9. Toolbar Operations:
● The toolbar at the top of the Data View provides various operations, such
● The leftmost column displays row numbers, helping you keep track of the
order of cases.
SPSS has its own data file format. Other file formats it easily deals with
include MS Excel, plaintext files, SQL, Stata and SAS.
Web analytics data -often downloaded as Excel files- can easily be opened and
further analysedin SPSS.
In real world research, raw data usually need some editing before they can be
properly analysed.Typical examples are creating means or sums as new
variables, restructuring data ordetecting and removing unlikely observations.
SPSS performs such tasks -and more complex
ones with amazing efficiency. For getting things done fast, SPSS
contains manynumericfunctions, string functions, date functions and
other handy routines.
All basic tables and charts can be created easily and fast in SPSS. Typical
examples
are demonstrated under Data Analysis. A real weakness of SPSS is that its
charts tend to be uglyand often have a clumsy layout. A great way to
overcome this problem is developing and applying SPSS chart templates.
Doing so, however, requires a fair amount of effort and expertise
The options for output are even more elaborate: charts are often copy-
pasted as images in png format. For tables, rich text format is often used
because it retains the tables' layout, fonts and borders Besides copy-pasting
individual output items, all output items can be exported in one gopdf,
HTML, MS Word and many other file formats.
2. Descriptive Statistics:
● SPSS can generate descriptive statistics such as mean, median, mode,
standard deviation, and range.
● Frequency distributions and cross-tabulations can be easily computed.
4. Inferential Statistics:
● SPSS supports a wide range of inferential statistical tests, including t-
tests, ANOVA, regression analysis, chi-square tests, and more.
● Users can perform hypothesis testing and assess relationships between
variables.
5. Data Transformation:
● SPSS allows users to create new variables, recode existing ones, and
transform data using various mathematical and logical operations.
7. Data Visualization:
pg. 11715501 YAMIKA 3
● In addition to basic charts, SPSS offers advanced data visualization
options, including 3D charts and interactive graphs.
9. Database Integration:
● SPSS can connect to various sources, including databases,
spreadsheets, and other data files, facilitating data import and
export.
S in ce it o ffe rs a w id e varie ty o f la ng ua ge s, a lo t o f p eo p le c an ac ce
ss
resu lting sta tistic al an alysis w ill no t g ive th e rig h t a nsw ers. If the g a p
Step II: O nc e afte r tra nsferring the d ata, the d ata w ill b e c
lassifie d into tw o typ e s-Da ta view & V a ria b le v ie w by the
SP S S.
DATA VIEW
UNDERSTANDING
PURCHASING PATTERN
OFPURFUMES OF
DIFFERENT AGEGROUP.
https://docs.google.com/forms/d/e/1FAIpQLSee7Q-
kdK6oAUZHo_aTIjAdpVYPg02vHrrToD6e0CRf TkzOd_hrrrrw/viewform?
It will look like this after you Apply ( IF Condition) in excel sheet.
isto g ram s
C ro sstab s & c hi sq ua re
In d ep e nd e nt sam p le
T -te st O ne -w ay A N O
VA
● FREQUENCY
Vali Cumulati
Frequency Perce d ve
nt
18 to 28 84.8 93.3Perce Percen
93.3
24 nt t
25 to 2 6.1 6.7 100.0
30
Total 30 90.9 100.0
pg. 11715501 YAMIKA 3
System 3 9.
1
Total 33 100.
0
Where do typically
Valid Cumulati
you Frequency Perce Perce ve perfumes
nt nt Percen
purchase t
your from Valid The fragrance
shop
7 21.2 23.3 23.3
In this table the first column lists the labels we assigned to the four
levels of this variable i.e., 1: The fragrance shop; 2: The Perfume Shop, 3:
Bodyshop and 4:
Online shop. The Frequency column displays the frequency of each this
score (in case, category). This shows that out of the 30 people 7
people purchase from fragrance shop, 11 people purchase from from
perfume shop, 1 person purchases
In this table the first column lists the labels we assigned to the four
levels of this variable i.e., 1: single bottles; 2: gift sets, 3: hampers and
4: deodorants/body sprays. The Frequency column displays the
frequency of each score (in this case, category). This shows that out of the
30 people 7 people purchase from fragrance shop, 11 people purchase
from perfume shop, 1 person purchases from Bodyshop
and the rest 11 people purchases from online shop. These frequencies are
converted to percentages in the Percent column i.e., 21.2% for 1st
Valid Cumulativ
e
Frequen Percent Perce Percent
cy nt
Valid male 16 48.5 53.3 53.3
female 14 42.4 46.7 100.0
Total 30 90.9 100.0
Missin System 3 9.1
g
Total 33 100.0
In this table the first column lists the labels we assigned to the four
levels of this variable i.e., 1: male and 2: female. The Frequency column
displays the frequency of each score (in this case, category). This shows
that out of the 30 people, 16 are male and the rest 14 are females. These
frequencies are converted to percentages in the Percent column i.e.,
48.5% for 1st category, 42.4% for 2nd category. Valid Percent column
shows values 53.3% for 1st category and 46.7% for 2nd category
Valid Cumulati
Frequen Percen Percen ve
cy t t Percent
Valid Floral 6 18.2 20.0 20.0
Oriental 1 3.0 3.3 23.3
woody 2 6.1 6.7 30.0
fresh 17 51.5 56.7 86.7
others 4 12.1 13.3 100.0
Total 30 90.9 100.0
Missin System 3 9.1
g
Total 33 100.0
In this table the first column lists the labels we assigned to the four
pg. 11715501 YAMIKA 3
levels of this variable i.e., 1: floral; 2: oriented, 3: woody, 4: fresh and 5:
others. The Frequency column displays the frequency of each score (in
this case, category). This shows
In this table the first column lists the labels we assigned to the four
levels of this variable i.e., 1: parfum; 2: eau de parfum, 3: eau de
cologne and 4: don’ t
know.
The Frequency column displays the frequency of each score (in this case,
category). This shows that out of the 30 people 10 people prefer parfum ,
4 people prefer eau de parfum , 2 people prefer eau de cologne and the
rest 14 people
don’ t know . These frequencies are converted to percentages in the
Percent column i.e., 30.3% for 1st category, 12.1% for 2nd category,
6.1% for 3rd category
pg. 11715501 YAMIKA 3
an 42.4% for 4th category. Valid Percent column shows values 33.3% for
d 1st
category, 13.3% for 2nd category, 6.7% for 3rd category and 46.7% for 4th
category.
In this table the first column lists the labels we assigned to the four
levels of this variable i.e., 1: often(daily); 2: sometimes, 3: rarely, 4: only
on special occasions
an 5: not at all. The Frequency column displays the frequency of (in
d each score case, category). This shows that out of the 30 13
thi people use perfume
s
often(daily), 9 people use sometimes, 2 people rarely use perfume, 4
people use only on special occasions and the rest 2 people doesn’ t
uses it at all. These frequencies are converted to percentages in the
Percent column i.e., 39.4% for 1st category, 27.3% for 2nd category, 6.1%
for 3rd category, 12.1% for 4th category and 6.1% for 5th category. Valid
Percent column shows values 43.3% for 1st category,
30.0 % for 2nd category, 6.7% for 3rd category, 13.3%% for 4th category
and 6.7% for others.
Crosstab
How much do you
typically spend
on one perfume/fragrance
3000- Tot
1000-2000 2000- 4 al
3000 000
What is m Count 15 0 1 16
your a % within 93.8% 0.0% 6.3% 100.0%
gender What
l is your
e gender
% within 62.5% 0.0% 25.0% 53.3%
How much
do you
typically
spend on
one
perfume/fr
agra
nce
f Count 9 2 3 14
e % within 64.3% 14.3% 21.4% 100.0%
What
m is your
gender
a % within 37.5% 100.0% 75.0% 46.7%
l How much
e do you
typically
spend on
one
perfume/fr
agra
nce
Total Count 24 2 4 30
% within 80.0% 6.7% 13.3% 100.0%
What
is your
gender
% within 100.0% 100.0% 100.0 100.0%
How much %
do you
typically
spend on
one
perfume/fr
pg. 11715501 YAMIKA 3
agra
nce
Chi-Square Tests
Asympto
tic
Value df Significa
nce (2-
sided)
Pearson Chi- 4.386a 2 .112
Square
Likelihood Ratio 5.202 2 .074
Linear-by-Linear 2.942 1 .086
Association
N of Valid Cases 30
a. 4 cells (66.7%) have expected count less
than 5. The minimum expected count is .93.
Crosstab
What is your age
18 to 25 to Total
24 30
What is your male Count 14 2 16
gender % within What is 87.5% 12.5% 100.0%
your
gender
% within What is 50.0% 100.0% 53.3%
your
age
pg. 11715501 YAMIKA 3
fema Count 14 0 14
le
% within What is 100.0% 0.0% 100.0%
your
gender
% within What is 50.0% 0.0% 46.7%
your
age
Total Count 28 2 30
50% of male
Chi-Square Tests
and Asympto 50% of
tic Exact Exact
Value df Significan Sig. (2- Sig. (1-
ce sided) sided)
(2-sided)
Pearson Chi-Square 1.875a 1 .171
Continuity .404 1 .525
Correctionb
Likelihood Ratio 2.639 1 .104
Fisher's Exact Test .485 .276
Linear-by-Linear 1.813 1 .178
2 cells (50.0%) have expected count less than 5. The minimum expected count is .93.
Computed only for a 2x2 table
pg. 11715501 YAMIKA 3
pg. 11715501 YAMIKA 3
The Pearson Chi-spare indicates that there are no significant
differences
between groups, given by the p-value more than 0.05 which is
0.171 therefore we can accept the alternative hypothesis.
CRO S STA B
Crosstab
How do you like purchasing your scents
Deodorants
Singl Hamp /
e Gift er Body Total
What is your m bottl sets s spra 16
gender Count es 0 0 ys
9 7
a % within What 56.3% 0.0% 0.0% 43.8% 100.0
l is %
your gender
e % within How 47.4% 0.0% 0.0% 87.5% 53.3%
do
you like
purchasing
your
scents
f Count 10 2 1 1 14
e % within What 71.4% 14.3% 7.1% 7.1% 100.0
is %
m your gender
a % within How 52.6% 100.0% 100.0 12.5% 46.7%
Total do %
l you like
e purchasing
your
scents
Count 19 2 1 8 30
% within What 63.3% 6.7% 3.3% 26.7% 100.0
is %
Chi-Square Tests
Asympto
tic
Value d Significan
f ce
(2-
sided)
Pearson Chi- 7.452a 3 .049
Square
Likelihood Ratio 9.140 3 .027
Linear-by-Linear 2.756 1 .097
Association
N of Valid Cases 30
a. 6 cells (75.0%) have expected count less than
5. The minimum expected count is .47.
Crosstab
Which scent(s) do you like best
Orien fre
Flora ta wood s other Total
l l y h s
What is your mCount 0 1 2 9 4 16
gender a % within 0.0% 6.3% 12.5% 56. 25.0% 100.0
l What is your 3 %
gender %
e % within Which 0.0% 100.0 100.0 52. 100.0 53.3%
% % 9 %
scent(s) do you %
like
Chi-Square Tests
Asymptot
ic
Value d f Significa
nce (2-
sided)
Pearson Chi- 12.983a 4 .011
Square
Likelihood Ratio 17.947 4 .001
Linear-by-Linear 6.728 1 .009
Association
N of Valid Cases 30
a. 8 cells (80.0%) have expected count less
than 5. The minimum expected count is
.47.
Crosstab
Which type of perfume do you
prefer
Eau Don't
de Eau know Tota
Parfum Parf de (unsure of l
u Colog the
m ne differenc
e)
What is your Count 5 3 1 7 16
gender a % within 31.3% 18.8% 6.3% 43.8% 100.0
What is %
l your gender
e % within 50.0% 75.0% 50.0% 50.0% 53.3%
Which type of
perfume
do you prefer
f Count 5 1 1 7 14
e % within 35.7% 7.1% 7.1% 50.0% 100.0
What is %
a your gender
% within 50.0% 25.0% 50.0% 50.0% 46.7%
l Which type of
e perfume
do you prefer
Total Count 10 4 2 14 30
% within 33.3% 13.3% 6.7% 46.7% 100.0
What is %
your gender
% within 100.0 100.0 100.0 100.0% 100.0
Which type of % % % %
perfume
do you prefer
Step III: p re ss O K to p ro c e ed
On Std.
e- Mean Error
Si
de Two- Differ Differ
Si e e
F Sig t d d p ded nce nce Lowe Uppe
. f p r r
How Equal 1.8 .18 -1.01 28 .15 .318 -.482 .474 -1.45 .489
6
often varian 1 3 7 9 3
ce
do s
you
use assum
e
perfu d
m
es/ Equal -.996 23.42 .16 .330 -.482 .484 -1.48 .519
frag 9
rance varian 5 3
s ce
s not
assum
e
d
Step III: p re ss O K to p ro c ee d
Step II:
Put all the scale va riab les or Dep end en t value in “ Variables” section ( 4),
and select
“ Pearson” as Correla tion coefficient (5 ), also “ Tw o-tailed ” as Test of
significance (6 ). Th en select“ Flag sig nifica nt
correla tions” at the b ottom (7 ).
Assumptions
When you choose to analyse your data using linear regression,
part of the process involves checking to make sure that the data
you want to analyse can actually be analysed using linear
regression. You need to do this because it is only appropriate to
use linear regression if your data "passes" seven assumptions
that are required for linear regression to give you a valid result. In
practice, checking for these seven assumptions just adds a little bit
more time to your analysis, requiring you to click a
few more buttons in SPSS Statistics when performing your analy
pg. 11715501 YAMIKA 3
sis, as well as a little bit more about your data, but it is not think
a difficult task.
Before we introduce you to these seven assumptions, do not be
surprised if, when analysing your own data using SPSS Statistics,
one or more of these assumptions is violated (i.e., not met). This is
not uncommon when working with real-world data rather than
textbook examples, which often only show you how to carry out
linear