9781849668200
9781849668200
9781849668200
Bloomsbury Academic
CIP records for this book are available from the British Library and the
Library of Congress
This book is produced using paper that is made from wood grown in managed,
sustainable forests. It is natural, renewable and recyclable. The logging and
manufacturing processes conform to the environmental regulations of
the country of origin.
Printed and bound in Great Britain by MPG Books Group, Bodmin, Cornwall.
www.bloomsburyacademic.com
Contents
1 Introduction 1
2 History of social network analysis 7
3 Key concepts and measures 31
4 Applications of network analysis 57
5 Criticisms and frequently asked questions 85
6 Software for social network analysis 103
Bibliography 111
Index 123
v
This page intentionally left blank
List of Figures
1 An acquaintanceship sociogram 10
2 Sociometric structures: star, Y, chain, circle 13
3 Strong and weak ties 15
4 Clique structure 18
5 Overlapping cliques in a social hierarchy 19
6 Blockmodel role structure 21
7 Affective distance in friendship 23
8 Multidimensional scaling of a network 24
vii
This page intentionally left blank
Series foreword
ix
x What is social network analysis?
Graham Crow
Series editor
1 Introduction
1
2 What is social network analysis?
face the newcomer. The book can usefully begin, therefore, with
a contrast between how it was done before and how it is done
now.
My personal interest in social network analysis began because
I had an interest in economic power and a fascination with the
work of those who claimed to have identified the key financial
groups in contemporary economies and to have shown the
webs of connection that tied them together through class-based
links of schooling, club membership, and kinship. It seemed to
me that this could best be studied through an investigation of
interlocking directorships and through the use of systematic
network methods. Small studies by Richard Whitley (1973) and
by Phil Stanworth and Tony Giddens (1975) were beginning to
appear at this time, and these authors had drawn diagrams of
board connections among small numbers of companies and
had reported the measured ‘density’ of their networks. These
studies seemed to offer an advantage over the more popular
diagrammatic representations produced by radical journalists
in which the nature of the connections and the implications of
the patterns were not considered. However, I felt that there had
to be a way of extending these systematic methods to the larger
selections of companies that I wanted to study.
The early 1970s were a time at which British sociology was
beginning to make advances in the use of mathematical tech-
niques to study individuals and their relationships. The British
Sociological Association Quantitative Sociology Group had
been formed as a focus for this work and I wrote a short piece
for its newsletter in which I requested help in my search for
useable techniques of social network analysis. This brought me
Introduction 3
data onto punched cards which could then be read into the
university’s mainframe Control Data Corporation (CDC) Cyber
computer. The British project came to form a part of an inter-
national project headed by Frans Stokman, who headed the
team that had produced a network program called GRAph
Definition and Analysis Package (GRADAP), also stored on a
larger CDC computer at Gröningen University. My punched
cards were transferred to a large data tape, and this was posted
to Gröningen for analysis there—no other computer was large
enough to handle the program and its data. A few weeks later
I would begin to receive the output in the form of reams of
computer printout on concertinaed paper.
The output received was, however, a revelation. Most of
the now-standard network concepts could be generated by
GRADAP, and I could begin to report the basic structural features
of my network—a mere three years or so after I had begun to
compile the data. All the researchers in the ten participating
countries worked in the same way and the Gröningen team
produced equivalent output for each participant, so ensuring
strict comparability in the analysis of our various data sets.
Some measures were not, however, included in GRADAP. For
these I relied on a program written by Clyde Mitchell. Clyde
provided me with a copy of the program—CONCOR—on
punched cards and the university computer staff wrote a small
program that could convert my GRADAP files into a form suit-
able for use with CONCOR. The computer demands for CONCOR
were so great that it could run only because my university rented
time on the Manchester super-computer: which was almost as
powerful as one of today’s mobile phones! Each time I used the
6 What is social network analysis?
7
8 What is social network analysis?
13 14
15 16
7 19
8 18
10 3
23
17
6
21 11
5
24
2
22
4
1 12 20
I1 I3
W1 W2 W3 W4 W5 W6 W7 W8 W9
S1 S2 S4
–
Figure 4 Clique structure
Source: Roethlisberger and Dickson (1939: 507)
help when it was needed and drew this as a diagram (see Figure
4). Cliques were seen as formal representations of the commonly
expressed idea that people may feel they are part of an ‘in-crowd’
or that they are outsiders to it.
Warner decided to pursue similar issues through a community
study in the New England town of Newburyport. Regarding this
as a typical American town with its roots in the early colonial
period, he referred to it as ‘Yankee City’ in his published studies
(Warner and Lunt 1941; 1942; Warner and Srole 1945; Warner and
Low 1947; Warner 1963). Later in the 1930s he supervised a similar
study in the equally old southern town of Natchez, referring
to this as ‘Old City’ (Davis et al. 1941). In these studies, Warner
and his teams explored the formation of ‘cliques’, understood as
informal communal groupings based on feelings of intimacy and
solidarity and that existed alongside the formal associations of
church, business, leisure, and politics. This relationship between
communal and associational patterns, informal and formal ties,
History of social network analysis 19
Fathers Mothers
Children Teachers
self
people you
know well
distant friends
Further reading
31
32 What is social network analysis?
Such a data file might show, for example, eighteen women and
their participation in a number of social events. Each woman
would be represented by a row in the file, and the row would
show the particular events that she has attended. From this
initial file it is possible to produce a file listing the women and the
others who they meet at the various events, and also a file listing
the various events that are similar because the same women
attend them. Similarly, a file for the analysis of interlocking
directorships would show each company as a row, with the rows
listing the directors of the company. From this can be produced
listings of individuals and their meetings with each other and of
companies and their board-level connections to each other. A
file showing the friendship choices of schoolchildren would typi-
cally record the connections among the children directly, each
row showing the other children that are chosen as friends.
Standard social network analysis packages can read these
so-called linked lists and can convert them into the various other
files that are required for specialist analyses. This normally occurs
behind the scenes with there being little or no need for the
investigator to directly manipulate the files. It is to the particular
concepts and measures available in these packages that I now
turn.
for example, has its limits. Other things being equal, then, an
increase in the size of a network is likely to mean a reduction
in its density. This is exacerbated by the fact that the ability to
sustain relationships varies with the type of relationship. We
may, for example, be able to ‘recognise’ far more people than we
would acknowledge as ‘friends’, and we are likely to ‘love’ even
fewer. A comparison of the density of the recognition, friendship,
and love networks, even if they are of a similar size, will therefore
be difficult. Nevertheless, for networks of a similar type, the
global density can be a useful comparative measure if reported
alongside measures of the size and inclusiveness of the graph.
A further measure of global cohesion is the centralisation of
a network. Where centrality relates to the position of particular
points, centralisation relates to the overall structure of a network.
Centralisation measures the extent to which the cohesion of a
network is organised around a specific point or set of connected
points. The spokes on a bicycle wheel, for example, form a highly
centralised network around its hub. Measures of centralisation
can be based on the degree, distance, or betweenness of points,
and extensions of these concepts have involved the idea that it
is possible to identify the sets of points that comprise the centre,
margin, and periphery of the network as a whole.
Statistical inference
Much work in social network analysis has been both static and
descriptive. Recent work, however, has shown how it is possible
to move towards dynamic understandings of social change and
to construct explanations of the patterns described. This has led
to much greater attention being given to statistical methods for
assessing the significance of results and the validity of explana-
tions. This work follows the general and well-established prin-
ciples of statistical inference and hypothesis testing, adapting
these to the specific requirements of using relational data.
A hypothesis to explain observations must be produced by
expert use of the sociological imagination, but any hypothesis
proffered must be tested before it can be accepted with any
confidence. Statistical methods of hypothesis testing involve
comparing the observed results from particular patterns of
change to the results that might be expected to occur as a result
of random variations alone. That is, the methods attempt to
show the likelihood that any result could have occurred simply
by chance. If this probability is low, then some confidence can be
taken in the mechanisms that have been hypothesised.
54 What is social network analysis?
Further reading
57
58 What is social network analysis?
Ryan and Gross showed that the rate of adoption over the
period of the study followed the ‘S’ curve suggested by Tarde.
Although sales publicity could make most farmers aware of the
new products, there were initially only a small number of early
adopters. Most of those who eventually used the new corn
did so only after discussion with neighbouring farmers who
could persuade them that using the new corn was worthwhile.
Persuasion through discussion, rather than simple imitation,
was the key to adoption, and people were especially likely to be
persuaded by the early adopting opinion leaders whose views
they valued. When a number of neighbours had adopted it
and advocated its use, other farmers were very likely to take it
up. Thus, the initial spread was slow until there were sufficient
adoptees for many farmers to have at least one adopter in
their neighbourhoods. Take-up then accelerated rapidly as the
number of new adopters increased across the State. Eventually,
however, a point was reached at which few non-adopting farm-
ers remained and the rate of adoption slowed down.
Only in the 1960s was there any advance on this work. James
Coleman and his associates (1966) undertook a study of the
introduction of the new antibiotic Tetracycline (referred to
in the study as ‘Gammanym’) and of its adoption by general
practitioners. Though unaware of the earlier work of Ryan and
Gross, they also discovered both that the pattern of adoption
followed an ‘S’ curve and that the opinion leaders were of key
importance in this process. Coleman also found that those with
the highest neighbourhood degrees, as measured by hospital
affiliation, attendance at staff meetings, and sharing an office
with other doctors, were more likely to be early innovators and
Applications of network analysis 61
cite the work of other members. It was also found, however, that
there was a tendency for citation to reflect intellectual affinities
of theory or method.
The possibilities of using systematic network methods are
vividly illustrated in work by Moody and Light (2006), who used
citation data to investigate the overall structure of sociology as a
discipline within the social sciences. They look at the co-citation
of social science journals in order to discover how similar any
two journals are in their contents in terms of citations to them
by papers in other journals. The raw data on similarities among
1,657 journals were mapped into a multidimensional space and
the authors devised a way of drawing contour lines that surround
journals with specified levels of similarity. Peaks in the contour
map, they argue, can be seen as ‘discipline’-specific clusters of
journals.
The mapping produced by Moody and Light (2006: 71, Fig 1)
shows strong and closely linked clusters for law and economics,
closely connected to similar clusters for political science. On the
immediately opposite side of the map is the strong cluster of
psychology journals. Spread between law and psychology are a
series of lower peaks corresponding to management and organi-
sational behaviour. In the opposite direction, the ‘foothills’ from
psychology trail through education and social work. Sociology
appeared as a moderately high peak in the dead-centre of the
map, while geography and anthropology appeared as low foot-
hills between sociology and political science. The height of the
peaks in the Moody and Light map correspond to the sharp-
ness of the boundaries defining a journal’s authors, and hence
the boundaries of a discipline. Sociology, they found, is not so
Applications of network analysis 75
Further reading
85
86 What is social network analysis?
Network analysis, like all social research methods, has been the
subject of much criticism by those whose preferences and intel-
lectual concerns lie in different directions. Indeed, social network
analysis has, perhaps, received more than its fair share of criti-
cism, and perhaps some of this criticism has been provoked by
the overstatements—or even the naivety—of some of its
advocates and users. Much criticism, however, is based upon
88 What is social network analysis?
can map them out, but unless it can be shown that people take
some account of them or that things are different because of
them, then the research findings are merely documents of the
obvious and the irrelevant. The most radical of these critics
go on to assert that even if the researcher did try to assess the
importance of network connections, he or she would inevitably
find that they have no importance. All this research is, therefore,
completely unnecessary.
It has to be said that much social network analysis has,
in fact, remained at the purely descriptive level and has not
gone on to assess the significance of the connections for the
individuals involved. Some early computer-based research did
report measures simply because the available software made
them easy to compute: ‘if it can be measured then it must be
significant’ was their assumption. This kind of naïve empiricism
also marked early uses of SPSS for survey analysis, where the
option ‘Statistics all’ spewed out an array of statistics that were
assumed to be important simply because they were a part of
the package. However, this assumption is far from typical. I
have argued in Chapter 3 that it is essential that the structural
analyst know and understand the concepts that are being used
and that he or she can justify them as valid and appropriate.
When there is a prima facie reason to expect the social relations
to have an effect, then a descriptive mapping plays an essential
part in the research process. This is typically the case when a
project is firmly grounded in an on-going tradition of research
in which this significance has been explored or inferred. I have
shown in Chapter 3 that there have been good technical reasons
why conventional statistical significance tests cannot be used in
Criticisms and frequently asked questions 93
are being more widely used. We all have to walk before we can
run, and the new mathematical concepts and software programs
allow the social network analyst to take this further step.
Further reading
103
104 What is social network analysis?
PAJEK
111
112 What is social network analysis?
White, Howard D., Wellman, Barry and Nazer, Nancy 2004. ‘Does
citation reflect social structure? Longitudinal evidence from
the “Globenet” interdisciplinary reserach group’. Journal of the
American Society for Information Science and Technology 55, 2:
111–126.
Whitley, Richard D. 1973. ‘Commonalities and connections
among directors of large financial institutions’ in Scott, J. (ed.)
The Sociology of Elites, Volume 1. Aldershot: Edward Elgar, 1990.
Wiese, Leopold von 1931. ‘Outlines of the “Theory of Social
Relations”’ in von Wiese, L. (ed.) Sociology. New York: Oskar
Piest, 1941.
This page intentionally left blank
Index
123
124 What is social network analysis?