Technologi Acceptance Method Davis, 1989
Technologi Acceptance Method Davis, 1989
Technologi Acceptance Method Davis, 1989
net/publication/200085965
CITATIONS READS
18,683 49,334
2 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Fred D. Davis on 07 January 2015.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Management Information Systems Research Center, University of Minnesota is collaborating with JSTOR to
digitize, preserve and extend access to MIS Quarterly.
http://www.jstor.org
Technology
Introduction
By: Fred D. Davis Information technologyoffersthe potentialforsub-
Computer and Information Systems stantially improving white collar performance
Graduate School of Business (Curley, 1984; Edelman, 1981; Sharda, et al.,
1988). But performance gains are often ob-
Administration structed by users' unwillingnessto accept and
University of Michigan use available systems (Bowen, 1986; Young,
Ann Arbor, Michigan 48109 1984). Because of the persistence and impor-
tance of this problem, explaining user accep-
tance has been a long-standingissue in MIS
research (Swanson, 1974; Lucas, 1975; Schultz
and Slevin, 1975; Robey, 1979; Ginzberg,1981;
Swanson, 1987). Althoughnumerousindividual,
organizational,and technologicalvariableshave
Abstract been investigated(Benbasat and Dexter, 1986;
Validmeasurement scales for predicting user Franz and Robey, 1986; Markus and Bjorn-
acceptance of computers are in short supply. Anderson, 1987; Robey and Farrow,1982), re-
Most subjective measures used in practice are search has been constrained by the shortage
unvalidated, and their relationship to system of high-qualitymeasures for key determinants
of user acceptance. Past research indicatesthat
usage is unknown. The present research de-
velops and validates new scales for two spe- many measures do not correlate highly with
cific variables, perceived usefulness and per- system use (DeSanctis, 1983; Ginzberg, 1981;
ceived ease of use, which are hypothesized to Schewe, 1976; Srinivasan, 1985), and the size
be fundamental determinants of user accep- of the usage correlationvaries greatlyfromone
tance. Definitionsfor these two variables were study to the next depending on the particular
used to develop scale items thatwere pretested measures used (Baroudi,et al., 1986; Barkiand
for content validityand then tested for reliability Huff,1985; Robey, 1979; Swanson, 1982, 1987).
and construct validityin two studies involving The developmentof improvedmeasures for key
a total of 152 users and four applicationpro- theoreticalconstructs is a research priorityfor
grams. Themeasures were refinedand stream- the informationsystems field.
lined, resultingin two six-item scales with reli- Aside fromtheir theoreticalvalue, better meas-
abilities of .98 for usefulness and .94 for ease ures for predictingand explaining system use
of use. The scales exhibited high convergent, would have great practicalvalue, both for ven-
discriminant, and factorialvalidity.Perceiveduse- dors who would like to assess user demand for
fulness was significantlycorrelatedwithbothself- new design ideas, and for informationsystems
reported current usage (r=.63, Study 1) and managers withinuser organizationswho would
self-predictedfutureusage (r= .85, Study2). Per- like to evaluate these vendor offerings.
ceived ease of use was also significantlycorre-
lated with currentusage (r=.45, Study 1) and Unvalidatedmeasures are routinelyused in prac-
futureusage (r=.59, Study 2). In both studies, tice today throughout the entire spectrum of
usefulness had a significantlygreater correla- design, selection, implementationand evaluation
tion with usage behavior than did ease of use. activities.For example: designers withinvendor
Regression analyses suggest that perceived organizationssuch as IBM(Gould,et al., 1983),
ease of use may actually be a causal antece- Xerox(Brewley,et al., 1983), and DigitalEquip-
ment Corporation(Good, et al., 1986) measure believe that the systems is too hardto use and
user perceptions to guide the development of that the performancebenefits of usage are out-
new informationtechnologies and products;in- weighed by the effort of using the application.
dustry publications often report user surveys That is, in additionto usefulness, usage is theo-
(e.g., Greenberg,1984; Rushinekand Rushinek, rizedto be influencedby perceived ease of use.
1986); several methodologies for software se-
lection call for subjective user inputs (e.g., Perceived usefulness is defined here as "the
Goslar, 1986; Kleinand Beck, 1987); and con- degree to which a person believes that using
temporarydesign principles emphasize meas- a particularsystem would enhance his or her
uringuser reactionsthroughoutthe entiredesign job performance."This follows from the defini-
process (Andersonand Olson 1985; Gould and tion of the word useful: "capableof being used
Lewis, 1985; Johansen and Baker,1984; Mantei advantageously."Withinan organizationalcon-
and Teorey, 1988; Norman,1983; Shneiderman, text, people are generally reinforcedfor good
1987). Despite the widespread use of subjec- performance by raises, promotions, bonuses,
tive measures in practice, littleattentionis paid and other rewards(Pfeffer,1982; Schein, 1980;
to the qualityof the measures used or how well Vroom,1964). A system high in perceived use-
they correlate with usage behavior. Given the fulness, in turn,is one for which a user believes
low usage correlations often observed in re- in the existence of a positive use-performance
search studies, those who base importantbusi- relationship.
ness decisions on unvalidatedmeasures may Perceived ease of use, in contrast,refersto "the
be gettingmisinformedabout a system's accept-
degree to which a person believes that using
abilityto users. a particularsystem would be free of effort."This
The purpose of this research is to pursue better follows from the definitionof "ease": "freedom
measures for predictingand explaininguse. The from difficultyor great effort."Effortis a finite
investigation focuses on two theoretical con- resource that a person may allocate to the vari-
structs, perceived usefulness and perceived ous activitiesfor which he or she is responsible
ease of use, which are theorized to be funda- (Radner and Rothschild, 1975). All else being
mental determinantsof system use. Definitions equal, we claim, an applicationperceived to be
forthese constructsare formulatedand the theo- easier to use than another is more likelyto be
reticalrationalefor their hypothesized influence accepted by users.
on system use is reviewed.New, multi-item meas-
urementscales for perceivedusefulness and per-
ceived ease of use are developed, pretested,
and then validatedin two separate empiricalstud- Theoretical Foundations
ies. Correlationand regression analyses exam- The theoreticalimportanceof perceived useful-
ine the empiricalrelationshipbetween the new ness and perceivedease of use as determinants
measures and self-reportedindicantsof system of user behavioris indicatedby several diverse
use. The discussion concludes by drawingim- lines of research. The impactof perceived use-
plicationsfor future research. fulness on system utilizationwas suggested by
the workof Schultzand Slevin (1975) and Robey
(1979). Schultz and Slevin (1975) conductedan
exploratoryfactor analysis of 67 questionnaire
Perceived Usefulness and items, which yielded seven dimensions. Of
these, the "performance" dimension,interpreted
Perceived Ease of Use by the authors as the perceived "effect of the
Whatcauses people to accept or rejectinforma- model on the manager'sjob performance,"was
tiontechnology?Amongthe manyvariablesthat most highlycorrelatedwithself-predicteduse of
may influencesystem use, previousresearchsug- a decision model (r=.61). Usingthe Schultzand
gests two determinantsthat are especially im- Slevinquestionnaire,Robey (1979) findsthe per-
portant.First,people tend to use or not use an formance dimensionto be most correlatedwith
applicationto the extent they believe it willhelp two objectivemeasures of system usage (r=.79
them performtheir job better. We refer to this and .76). Buildingon Vertinsky,et al.'s (1975)
firstvariableas perceived usefulness. Second, expectancy model, Robey (1979) theorizes that:
even if potentialusers believe that a given ap- "A system that does not help people perform
plicationis useful, they may, at the same time, their jobs is not likelyto be received favorably
large investmentsin test facilitiesand instrumen- retained, yielding 10 items for each construct.
tation.Althoughobjective ease of use is clearly Next, a field study (Study 1) of 112 users con-
relevantto user performancegiven the system cerning two differentinteractivecomputer sys-
is used, subjectiveease of use is more relevant tems was conducted in orderto assess the reli-
to the users' decision whetheror not to use the ability and construct validity of the resulting
system and may not agree with the objective scales. The scales were further refined and
measures (Carrolland Thomas, 1988). streamlined to six items per construct. A lab
study (Study2) involving40 participantsand two
graphics systems was then conducted. Data
fromthe two studies were then used to assess
Convergenceof findings the relationship between usefulness, ease of
There is a strikingconvergence among the wide use, and self-reportedusage.
range of theoretical perspectives and research
studies discussed above. Although Hill, et al. Psychometriciansemphasize that the validityof
a measurementscale is builtin fromthe outset.
(1987) examined learninga computerlanguage, As Nunnally(1978) pointsout, "Ratherthan test
Larckerand Lessig (1980) and Swanson (1982,
the validityof measures after they have been
1987) dealt with evaluating informationreports, constructed, one should ensure the validityby
and Hauser and Simmie (1981) studied com-
the plan and procedures for construction"(p.
municationtechnologies,all are supportiveof the
258). Carefulselection of the initialscale items
conceptualand empiricaldistinctionbetweenuse-
fulness and ease of use. The accumulatedbody helps to assure the scales willpossess "content
of knowledgeregardingself-efficacy,contingent validity,"defined as "the degree to which the
score or scale being used represents the con-
decision behavior and adoption of innovations
provides theoreticalsupport for perceived use- cept about which generalizations are to be
made" (Bohrnstedt,1970, p. 91). In discussing
fulness and ease of use as key determinants
content validity,psychometriciansoften appeal
of behavior.
to the "domainsampling model," (Bohrnstedt,
From multipledisciplinaryvantage points, per- 1970; Nunnally,1978) which assumes there is
ceived usefulness and perceived ease of use a domainof content correspondingto each vari-
are indicatedas fundamentaland distinctcon- able one is interested in measuring. Candidate
structsthat are influentialin decisions to use in- items representativeof the domain of content
formationtechnology. Althoughcertainlynot the should be selected. Researchers are advised to
only variables of interest in explaininguser be- begin by formulatingconceptual definitionsof
havior (for other variables, see Cheney, et al., what is to be measured and preparingitems to
1986; Davis, et al., 1989; Swanson, 1988), they fit the constructdefinitions(Anastasi, 1986).
do appear likelyto play a centralrole. Improved
measures are needed to gain furtherinsightinto Following these recommendations, candidate
items for perceived usefulness and perceived
the nature of perceived usefulness and per-
ease of use were generated based on theircon-
ceived ease of use, and their roles as determi-
nants of computer use. ceptual definitions,stated above, and then pre-
tested in order to select those items that best
fit the content domains. The Spearman-Brown
Prophecy formula was used to choose the
numberof items to generate for each scale. This
formulaestimates the numberof items needed
Scale Development and to achieve a given reliability based on the
Pretest number of items and reliabilityof comparable
existingscales. Extrapolatingfrompast studies,
A step-by-step process was used to develop the formula suggests that 10 items would be
new multi-itemscales having high reliabilityand needed for each perceptualvariableto achieve
validity.The conceptual definitionsof perceived reliabilityof at least .80 (Davis, 1986). Adding
usefulness and perceived ease of use, stated fouradditionalitems for each constructto allow
above, were used to generate 14 candidate for item elimination,it was decided to generate
itemsforeach constructfrompast literature.Pre- 14 items for each construct.
test interviewswere then conducted to assess
the semantic content of the items. Those items The initialitem pools for perceived usefulness
that best fitthe definitionsof the constructswere and perceived ease of use are given in Tables
1 and 2, respectively. In preparingcandidate other items in order to yield a more pure indi-
items, 37 publishedresearch papers dealingwith cant of the conceptual variable.
user reactions to interactivesystems were re-
viewed in other to identifyvarious facets of the Pretest interviewswere performedto furtheren-
constructs that should be measured (Davis, hance content validityby assessing the corre-
1986). The items are wordedin referenceto "the spondence betweencandidateitemsand the defi-
electronicmail system," which is one of the two nitions of the variables they are intended to
test applicationsinvestigatedin Study 1, reported measure. Itemsthatdon'trepresenta construct's
below. The items withineach pool tend to have contentvery well can be screened out by asking
a lot of overlap in their meaning, which is con- individualsto rankthe degree to whicheach item
sistent with the fact that they are intended as matches the variable's definition,and eliminat-
measures of the same underlying construct. ing items receiving low rankings.In eliminating
Thoughdifferentindividualsmay attributeslightly items, we want to make sure not to reduce the
differentmeaning to particularitem statements, representativeness of the item pools. Our item
the goal of the multi-itemapproachis to reduce pools may have excess coverage of some areas
any extranneous effects of individualitems, al- of meaning (or substrata;see Bohrnstedt,1970)
lowing idiosyncrasies to be cancelled out by withinthe content domain and not enough of
Table 1. InitialScale Items for Perceived Usefulness
others. By asking individualsto rate the similar- over work"was retainedsince, althoughit was
ity of items to one another, we can performa ranked fairly low, it fell in the top 9 and may
clusteranalysis to determinethe structureof the tap an importantaspect of usefulness.
substrata,remove items where excess coverage
is suggested, and add items where inadequate
Lookingnow at perceived ease of use (Table
coverage is indicated. 4), we again find three main clusters. The first
Pretest participantsconsisted of a sample of 15 relates to physical effort, while the second re-
experienced computer users from the Sloan lates to mental effort.Selecting the six highest-
School of Management,MIT,includingfive sec- priorityitems and eliminatingthe seventh pro-
retaries, five graduate students and five mem- vides good coverage of these two clusters. Item
bers of the professionalstaff. In face-to-face in- 11 ("understandable")was reworded to read
terviews,participantswere asked to performtwo "clearand understandable"in an effortto pick
tasks, prioritizationand categorization, which up some of the content of item 1 ("confusing"),
were done separately for usefulness and ease which has been eliminated.The thirdcluster is
of use. For prioritization,they were first given somewhat more difficultto interpretbut appears
a card containingthe definitionof the targetcon- to be tappingperceptionsof how easy a system
structand asked to read it. Next,they were given is to learn. Rememberinghow to performtasks,
13 index cards each having one of the items using the manual, and relyingon system guid-
forthat constructwrittenon it. The 14thor "over- ance are all phenomenaassociated withthe proc-
all" item for each construct was omitted since ess of learningto use a new system (Nickerson,
its wordingwas almost identicalto the label on 1981; Robertsand Moran,1983). Furtherreview
the definitioncard (see Tables 1 and 2). Partici- of the literaturesuggests that ease of use and
pants were asked to rankthe 13 cards accord- ease of learning are strongly related. Roberts
ing to how well the meaning of each statement and Moran(1983) find a correlationof .79 be-
matched the given definitionof ease of use or tween objective measures of ease of use and
usefulness. ease of learning. Whiteside, et al. (1985) find
that ease of use and ease of learning are
For the categorization task, participantswere stronglyrelatedand conclude that they are con-
asked to put the 13 cards intothree to five cate- gruent. Studies of how people learn new sys-
gories so that the statements withina category tems suggest that learning and using are not
were most similarin meaning to each other and separate, disjoint activities, but instead that
dissimilarin meaning from those in other cate- people are motivatedto begin performingactual
gories. This was an adaptationof the "owncate- workdirectlyand try to "learnby doing"as op-
gories" procedure of Sherif and Sherif (1967). posed to going throughuser manuals or online
Categorizationprovidesa simple indicantof simi- tutorials(Carrolland Carrithers,1984; Carroll,
laritythat requiresless time and effortto obtain et al., 1985; Carrolland McKendree,1987).
than other similaritymeasurement procedures
such as paid comparisons. The similaritydata
was cluster analyzed by assigning to the same In this study, therefore, ease of learningis re-
clusteritems thatseven or more subjects placed garded as one substratumof the ease of use
in the same category. The clusters are consid- construct, as opposed to a distinct construct.
ered to be a reflectionof the domain substrata Since items 4 and 13 provide a ratherindirect
for each constructand serve as a basis of as- assessment of ease of learning,they were re-
sessing coverage, or representativeness,of the placed with two items that more directlyget at
item pools. ease of learning:"Learningto operate the elec-
tronic mail system is easy for me," and "I find
The resultingrankand cluster data are summa- it takes a lot of effortto become skillfulat using
rized in Tables 3 (usefulness) and 4 (ease of electronic mail." Items 6, 9 and 2 were elimi-
use). Forperceivedusefulness, noticethat items nated because they did not cluster with other
fall into three main clusters. The firstcluster re- items, and they received low priorityrankings,
lates to job effectiveness, the second to produc- which suggests that they do not fit well within
tivityand time savings, and the thirdto the im- the content domain for ease of use. Together
portance of the system to one's job. If we with the "overall"items for each construct,this
eliminate the lowest-rankeditems (items 1, 4, procedureyielded a 10-itemscale for each con-
5 and 9), we see that the three majorclusters struct to be empiricallytested for reliabilityand
each have at least two items. Item 2, "control constructvalidity.
the sample, 10 percent were managers, 35 per- differenttraitor withdifferentitems used to meas-
cent were administrativestaff, and 55 percent ure a differenttrait(Campbelland Fiske, 1959).
were professionalstaff (whichincludeda broad For perceived usefulness, 1,800 such compari-
mixof marketanalysts,productdevelopmentana- sons were confirmedwithoutexception. Of the
lysts, programmers,financial analysts and re- 1,800 comparisons for ease of use there were
search scientists). 58 exceptions (3%). This represents an unusu-
ally high level of discriminantvalidity(Campbell
and Fiske, 1959; Silk, 1971) and implies that
the usefulness and ease of use scales possess
Reliabilityand validity a high concentrationof trait variance and are
The perceived usefulness scale attained Cron- not strongly influenced by methodological
bach alpha reliabilityof .97 for both the elec- artifacts.
tronicmailand XEDITsystems, while perceived
ease of use achieved a reliabilityof .86 for elec- Table 5 gives a summaryfrequencytable of the
tronic mail and .93 for XEDIT.When observa- correlationscomprisingthe MTMMmatricesfor
tions were pooled for the two systems, alpha usefulness and ease of use. Fromthis table it
was .97 for usefulness and .91 for ease of use. is possible to see the separation in magnitude
between monotraitand heterotraitcorrelations.
Convergentand discriminantvaliditywere tested The frequencytable also shows that the hetero-
using multitrait-multimethod (MTMM)analysis trait-heteromethod correlationsdo not appearto
(Campbelland Fiske, 1959). The MTMMmatrix be substantiallyelevated above the heterotrait-
containsthe intercorrelationsof items (methods)
monomethodcorrelations.This is an additional
appliedto the two differenttest systems (traits),
electronic mail and XEDIT.Convergentvalidity diagnostic suggested by Campbell and Fiske
refers to whether the items comprisinga scale (1959) to detect the presence of method
behave as if they are measuringa common un- variance.
derlyingconstruct.In orderto demonstratecon- The few exceptions to the convergent and dis-
vergent validity,items that measure the same criminantvaliditythatdid occur, althoughnot ex-
trait should correlate highly with one another
tensive enough to invalidatethe ease of use
(Campbelland Fiske, 1959). That is, the ele- scale, all involved negatively phrased ease of
ments in the monotraittriangles (the submatrix
use items.These "reversed"items tendedto cor-
of intercorrelationsbetween items intended to
relate more with the same item used to meas-
measure the same construct for the same
ure a differenttraitthan they didwithotheritems
system) withinthe MTMMmatrices should be of the same trait, suggesting the presence of
large.Forperceivedusefulness,the 90 monotrait- common method variance. This is ironic,since
heteromethodcorrelationswere all significantat
reversed scales are typicallyused in an effort
the .05 level. For ease of use, 86 out of 90,
corre- to reduce common methodvariance.Silk (1971)
or 95.6%, of the monotrait-heteromethod
lationswere significant.Thus, our data supports similarlyobserved minordepartures from con-
the convergent validityof the two scales. vergent and discriminantvalidityfor reversed
items. The five positively worded ease of use
Discriminantvalidityis concerned with the abil- items had a reliabilityof .92 compared to .83
ity of a measurement item to differentiatebe- for the five negative items. This suggests an im-
tween objects being measured. For instance, provementin the ease of use scale may be pos-
withinthe MTMMmatrix,a perceived usefulness sible with the eliminationor reversal of nega-
item appliedto electronicmail should not corre- tively phrased items. Nevertheless, the MTMM
late too highly with the same item applied to analysis supported the ability of the 10-item
XEDIT.Failureto discriminatemay suggest the scales for each constructto differentiatebetween
presence of "commonmethod variance,"which systems.
means that an item is measuringmethodological
artifactsunrelatedto the target construct(such Factorialvalidityis concerned with whetherthe
as individualdifferences in the style of respond- usefulness and ease of use items form distinct
ingto questions (see Campbell,et al., 1967; Silk, constructs. A principal components analysis
1971) ). The test for discriminantvalidityis that using oblique rotation was performed on the
an item should correlatemore highlywith other twenty usefulness and ease of use items. Data
items intended to measure the same traitthan were pooled across the two systems, for a total
with either the same item used to measure a of 184 observations. The results show that the
usefulness and ease of use items load on dis- how to performtasks"), which the pretest indi-
tinctfactors(Table6). The multitrait-multimethod cated was concerned withease of learning,was
analysisand factoranalysisbothsupportthe con- replaced by a reversal of item 9 ("easy to
struct validityof the 10-item scales. become skillful"),which was specifically de-
signed to more directly tap ease of learning.
These items include two from cluster C, one
Scale refinement each fromclusters A and B, and the overallitem.
In applied testing situations, it is importantto (See Table 4.) In order to improverepresenta-
keep scales as brief as possible, particularly tive coverage of the content domain, an addi-
when multiplesystems are going to be evalu- tional A item was added. Of the two remaining
ated. The usefulness and ease of use scales A items (#1, Cumbersome, and #5, Rigid and
were refined and streamlinedbased on results Inflexible),item5 is readilyreversedto form"flex-
from Study 1 and then subjected to a second ible to interactwith."This item was added to
roundof empiricalvalidationin Study2, reported form the sixth item, and the order of items 5
below. Applyingthe Spearman-Brownprophecy and 8 was permutedin order to prevent items
formulato the .97 reliabilityobtained for per- fromthe same cluster (items 4 and 5) fromap-
ceived usefulness indicatesthat a six-itemscale pearing next to one another.
composed of items having comparable reliabil-
In order to select six items to be used for the
ity would yield a scale reliabilityof .94. The five
positive ease of use items had a reliabilityof usefulness scale, an item analysis was per-
.92. Taken together, these findingsfrom Study formed. Corrected item-totalcorrelationswere
1 suggest that six items would be adequate to computed for each item, separately for each
achieve reliabilitylevels above .9 while main- system studied. Average Z-scores of these cor-
relationswere used to rankthe items. Items 3,
taining adequate validitylevels. Based on the
results of the field study, six of the 10 items for 5, 6, 8, 9 and 10 were top-rankeditems. Refer-
each construct were selected to form modified ring to the cluster analysis (Table 3), we see
scales. that this set is well-representativeof the content
domain, includingtwo items fromcluster A, two
For the ease of use scale, the five negatively from cluster B and one from cluster C, as well
worded items were eliminateddue to their ap- as the overall item (#10). The items were per-
parentcommon method variance, leaving items muted to prevent items from the same cluster
2, 4, 6, 8 and 10. Item 6 ("easy to remember fromappearingnext to one another.The result-
ing six-item usefulness and ease of use scales (.69), and overall(.64). Allcorrelationswere sig-
are shown in the Appendix. nificantat the .001 level.
Regression analyses were performedto assess
the joint effects of usefulness and ease of use
Relationshipto use on usage. The effect of usefulness on usage,
Participants were asked to self-report their controllingfor ease of use, was significantat the
degree of currentusage of electronic mail and .001 level for electronic mail (b=.55), XEDIT
XEDITon six-position categorical scales with (b=.69), and pooled (b=.57). In contrast, the
boxes labeled "Don'tuse at all,""Use less than effect of ease of use on usage, controllingfor
once each week," "Use aboutonce each week," usefulness, was non-significantacross the board
"Use several times a week," "Use about once (b=.01 for electronic mail; b=.02 for XEDIT;
each day," and "Use several times each day." and b=.07 pooled). In other words, the signifi-
Usage was significantlycorrelatedwithboth per- cant pairwise correlationbetween ease of use
ceived usefulness and perceived ease of use and usage vanishes when usefulness is con-
for both PROFS mail and XEDIT.PROFS mail trolledfor. The regression coefficients obtained
usage correlated.56 with perceived usefulness for each individualsystem within each study
and .32 with perceived ease of use. XEDIT were not significantly different (F3, 178= 1.95,
usage correlated .68 with usefulness and .48 n.s.). As the relationshipbetween independent
withease of use. Whendata were pooled across variablesin a regression approachperfectlinear
systems, usage correlated .63 with usefulness dependence, multicollinearity can degrade the
and .45 withease of use. The overallusefulness- parameterestimates obtained.Althoughthe cor-
use correlationwas significantlygreaterthan the relations between usefulness and ease of use
ease of use-use correlationas indicated by a are significant, according to tests for multi-
test of dependent correlations (t181=3.69, collinearitythey are not large enough to com-
p<.001) (Cohen and Cohen, 1975). Usefulness promise the accuracy of the estimated regres-
and ease of use were significantlycorrelated sion coefficientssince the standarderrorsof the
with each other for electronicmail (.56), XEDIT estimates are low (.08 for both usefulness and
ease of use), and the covariances between the The study involved evaluating two IBM PC-
parameter estimates are negligible (-.004) based graphics systems: Chart-Master(by De-
(Johnston, 1972; Mansfieldand Helms, 1982). cision Resources, Inc.of Westport,CN) and Pen-
Based on partialcorrelationanalyses, the vari- draw (by Pencept, Inc. of Waltham,MA).Chart-
ance in usage explained by ease of use drops Master is a menu-drivenpackage that creates
by 98% when usefulness is controlledfor. The numericalbusiness graphs, such as bar charts,
regressionand partialcorrelationresultssuggest line charts, and pie charts based on parameters
that usefulness mediates the effect of ease of defined by the user. Throughthe keyboardand
use on usage, i.e., that ease of use influences menus, the user inputsthe data for, and defines
usage indirectlythroughits effect on usefulness the desired characteristicsof, the chart to be
(J.A. Davis, 1985). made. The user can specify a wide variety of
options relatingto titlefonts, colors, plotorienta-
tion, cross-hatching pattern, chart format, and
so on. The chart can then be previewedon the
screen, saved, and printed.Chart-Masteris a
successful commercialproductthat typifies the
Study 2 categoryof numericbusiness chartingprograms.
A lab study was performedto evaluate the six-
item usefulness and ease of use scales result- Pendrawis quite differentfromthe typicalbusi-
ing from scale refinementin Study 1. Study 2 ness chartingprogram.Ituses bit-mappedgraph-
was designed to approximateapplied prototype ics and a "directmanipulation"interfacewhere
testing or system selection situations,an impor- users draw desired shapes using a digitizer
tant class of situations where measures of this tablet and an electronic "pen"as a stylus. The
kind are likelyto be used in practice. In proto- digitizertablet supplants the keyboard as the
type testing and system selection contexts, pro- inputmedium. By drawingon a tablet, the user
spective users are typicallygiven a briefhands- manipulatesthe image, which is visible on the
on demonstrationinvolvingless than an hourof screen as it is being created. Pendraw offers
actually interactingwith the candidate system. capabilities typical of PC-based, bit-mapped
Thus, representativeusers are asked to rate the "paint"programs (see Panko, 1988), allowing
future usefulness and ease of use they would the user to performfreehanddrawingand select
expect based on relativelylittleexperience with from among geometric shapes, such as boxes,
the systems being rated. We are especially in- lines, and circles. A varietyof line widths, color
terested in the propertiesof the usefulness and selections and title fonts are available. The
ease of use scales when they are worded in digitizeris also capable of performingcharacter
a prospective sense and are based on limited recognition,converting hand-printercharacters
experience with the target systems. Favorable into various fonts (Ward and Blesser, 1985).
psychometric properties under these circum- Pencept had positionedthe Pendrawproductto
stances would be encouraging relative to their complete with business chartingprograms.The
use as early warning indicants of user accep- manualintroducesPendrawby guidingthe user
tance (Ginzberg,1981). throughthe process of creating a numericbar
chart. Thus, a key marketing issue was the
The lab study involved40 voluntaryparticipants
extent to whichthe new productwouldcompete
who were evening MBAstudents at Boston Uni-
favorablywithestablishedbrands,such as Chart-
versity. They were paid $25 for participatingin Master.
the study. They had an average of five years'
workexperience and were employed full-timein
several industries,includingeducation (10 per- Participantswere given one hour of hands-on
cent), government(10 percent),financial(28 per- experience with Chart-Masterand Pendraw,
cent), health (18 percent),and manufacturing(8 using workbooksthat were designed to follow
percent). They had a range of priorexperience the same instructionalsequence as the user
with computers in general (35 percent none or manuals for the two products,while equalizing
limited;48 percent moderate; and 17 percent the style of writingand eliminatingvalue state-
extensive) and personal computers in particular ments (e.g., "See how easy that was to do?").
(35 percent none or limited;48 percent moder- Half of the participantstried Chart-Masterfirst
ate; and 15 percent extensive) but were unfa- and half tried Pendraw first. After using each
miliarwith the two systems used in the study. package, a questionnairewas completed.
system that does not performa useful function. tern of results in a two-wave study (Tables 8
The prominenceof usefulness over ease of use and 9). Inthatstudy, MBAstudentsubjects were
has importantimplicationsfordesigners, particu- asked to fillout a questionnaireaftera one-hour
larly in the human factors tradition,who have introductionto a word processing program,and
tended to overemphasize ease of use and over- again 14 weeks later. Usage intentions were
look usefulness (e.g., Branscomband Thomas, measured at both time periods, and self-
1984; Chin, et al., 1988; Shneiderman, 1987). reportedusage was measured at the latertime
Thus, a major conclusion of this study is that period. Intentionswere significantlycorrelated
perceived usefulness is a strong correlate of with usage (.35 and .63 for the two points in
user acceptance and should not be ignored by time, respectively).Unlikethe results of Studies
those attemptingto design or implementsuc- 1 and 2, Davis, et al. (1989) found a significant
cessful systems. directeffect of ease of use on usage, controlling
for usefulness, after the one-hour trainingses-
From a causal perspective, the regression re- sion (Table9), althoughthis evolved into a non-
sults suggest that ease of use may be an ante- significanteffect as of 14 weeks later. In gen-
cedent to usefulness, rather than a parallel, eral, though, Davis, et al. (1989) found useful-
direct determinant of usage. The significant ness to be more influentialthan ease of use in
pairwise correlationbetween ease of use and drivingusage behavior,consistent withthe find-
usage all but vanishes when usefulness is con- ings reportedabove.
trolledfor. This, coupled with a significantease
of use-usefulness correlationis exactly the pat- Furtherresearch willshed more lighton the gen-
tern one would expect if usefulness mediated eralityof these findings.Anotherlimitationis that
between ease of use and usage (e.g., J.A. the usage measures employed were self-
Davis, 1985). That is, the results are consistent reported as opposed to objectively measured.
with an ease of use --> usefulness --> usage Not enough is currentlyknownabout how accu-
chain of causality. These results held both for rately self-reportsreflect actual behavior.Also,
pooled observations and for each individual since usage was reportedon the same ques-
system (Table 8). The causal influenceof ease tionnaireused to measure usefulness and ease
of use on usefulness makes sense conceptu- of use, the possibilityof a halo effect should not
ally, too. All else being equal, the easier a be overlooked. Futureresearch addressing the
system is to interactwith,the less effortneeded relationshipbetween these constructs and ob-
to operate it, and the more effortone can allo- jectivelymeasured use is needed before claims
cate to other activities (Radner and Rothschild, about the behavioral predictiveness can be
1975), contributingto overall job performance. madeconclusively.These limitations notwithstand-
Goodwin(1987) also argues forthis flowof cau- ing, the results represent a promising step
sality, concludingfromher analysis that: "There towardthe establishmentof improvedmeasures
is increasing evidence that the effective func- for two importantvariables.
tionalityof a system depends on its usability"
(p. 229). This intriguinginterpretationis prelimi-
nary and should be subjected to furtherexperi-
mentation. If true, however, it underscores the Research implications
theoreticalimportanceof perceived usefulness. Futureresearch is needed to address how other
variables relate to usefulness, ease of use, and
This investigationhas limitationsthat should be acceptance. Intrinsicmotivation,for example,
pointed out. The generality of the findings re- has received inadequate attentionin MIStheo-
mains to be shown by futureresearch. The fact ries. Whereas perceived usefulness is con-
that similarfindingswere observed, withrespect cerned withperformanceas a consequence use,
to boththe psychometricpropertiesof the meas- intrinsicmotivationis concerned with the rein-
ures and the patternof empiricalassociations, forcementand enjoymentrelatedto the process
across two differentuser populations,two differ- of performinga behaviorper se, irrespectiveof
ent systems, and two differentresearch settings whatever external outcomes are generated by
(lab and field), providessome evidence favoring such behavior (Deci, 1975). Although intrinsic
externalvalidity. motivationhas been studiedin the design of com-
putergames (e.g., Malone, 1981), it is just be-
In addition, a follow-upto this study, reported ginningto be recognized as a potentialmecha-
by Davis, et al. (1989) found a very similarpat- nism underlyinguser acceptance of end-user
Appendix
Final MeasurementScales for Perceived Usefulness and
Perceived Ease of Use
Perceived Usefulness
Using CHART-MASTER in my job would enable me to accomplish tasks more quickly.
likely I I I I I I I unlikely
extremely quite slightly neither slightly quite extremely
Using CHART-MASTERwould improve my job performance.
likely 1- I I I I I unlikely
extremely quite slightly neither slightly quite extremely
Using CHART-MASTER in my job would increase my productivity.
likely I I I I I unlikely
extremely quite slightly neither slightly quite extremely
Using CHART-MASTER would enhance my effectiveness on the job.
likely I I I unlikely
extremely quite slightly neither slightly quite extremely
Using CHART-MASTERwould make it easier to do my job.
likely II I I I I I I unlikely
extremely quite slightly neither slightly quite extremely
I would find CHART-MASTER useful in my job.
likely II I I I I I I unlikely
extremely quite slightly neither slightly quite extremely