Analysis of Use Need Image Archives
Analysis of Use Need Image Archives
Analysis of Use Need Image Archives
Linda H. Armitage and Peter G.B. Enser visual images. The synthesis of these processes is
generating rapidly developing capabilities in the realm
University of Brighton, UK
of the electronic image and operates with accelerating
force on our ability to gain physical access to our
Received 14 November 1996 visual heritage.
However, our ability to gain logical or subject access
to that visual heritage has not enjoyed comparable
advances. In part, this is due to the well-recognised
Abstract.
problems which beset the indexer of still and moving
This paper describes a project in which an analysis was image material [1–13]. As with any information
undertaken of user queries addressed to seven libraries retrieval function, the indexing problem is condi-
which manage archives of widely varying still and moving tioned by the characteristics of user need for the
image material. The sampling procedure is described, in source material.
which queries obtained from each library were broadly
Surprisingly, user need for visual imagery has been
categorised by image content, identification and accessi-
the subject of very little enquiry. Keistler [14] has
bility. Attention is focused on the image content requests,
for which a categorisation based on facet analysis is devel- studied queries addressed to the photographic collec-
oped. The analytical tool which is used for this purpose is tion of the National Library of Medicine and Ornager
based on a schema already well established for the analysis [15] has described journalists’ requests submitted to a
of levels of meaning in images. number of newspaper archives. A systematic repre-
The project demonstrates the possibility of formulating a sentation of information need in the visual domain,
general categorisation of requests which seek widely incorporating an in-depth analysis of some 2,700
different still and moving image material. The paper requests serviced by a major, general-purpose picture
concludes with observations on the potential value of archive has been reported by Enser and Mcgregor [16]
embedding such a schema within the user interface of and Enser [17]. The project reported here, in extending
unmediated-query visual information retrieval systems.
the latter analysis, sought to develop a general-purpose
categorisation of user requests for still and moving
visual images. In generating a greater understanding
1. Introduction of user need for such knowledge records, it sought
also to contribute towards a better-informed view of
There is widespread recognition that we live in an effective retrieval techniques for visually encoded
increasingly visually-oriented society; one in which information.
textual representation no longer has to be the domi-
nant paradigm for information and instruction. This
paradigm shift has been reinforced by major techno- 2. Data collection
logical thrusts which have been applied to the capture,
storage, compression, transmission and display of
2.1. Participating archives
Correspondence to: Dr P.G.B. Enser, School of Information Seven picture libraries participated in the study. Two of
Management, University of Brighton, Watts Building, the archives, the National Film and Television Archive
Moulescomb, Brighton BN2 4GJ, UK. Tel: +44 1273 600900. (NFTVA), maintained by the British Film Institute
Fax: +44 1273 642405. E-mail: P.G.B.ENSER@brighton.ac.uk (BFI), and the BBC Natural History Unit (NHU),
Table 5
Panofsky-Shatford mode/facet matrix
emphasis on non-unique subject searching compared imagery on carnivals clearly addresses a non-unique
with the BFI NFTVA. event. A request for images of the Rio Carnival, 1986,
In contrast, the NMR sample is dominated by unique however, certainly relates to a unique time, place and
over non-unique subject searching. This is attributable event. A request for imagery on Rio carnivals straddles
to custom and practice in the library, which insists on the two categories uneasily.
users defining a geographical region within which In the Hulton study, the anomaly posed by subject
searches are to be made. As such, every query is ring- matter which contained elements of both uniqueness
fenced into a singular location. and non-uniqueness was resolved by the concept of
In the case of the Mitchell and Birmingham pair, ‘refinement’ [16, 17]. Thus, Rio carnivals would have
requests for locally-based imagery necessarily seek been categorised as a request for a non-unique subject
unique locations. However, the almost equal domi- (carnivals) refined by location (Rio). In essence, the
nance of known-item searching in the Birmingham process of adding refiners is an application of facet
sample is accounted for by queries which address their analysis. The high incidence of such queries in the
nationally important collections by renowned photog- current investigation encouraged the authors to adopt
raphers, which transcend local history. a more formal facet-based analytic technique in order
The apparent simplicity and functionality of the to determine if a robust, functional representation
four-element categorisation of queries described above could be achieved, while also revealing broad unifying
masks the complex nature of the relationship between characteristics across the range of project libraries.
unique and non-unique subjects in the visual domain.
At the root of the problem is the fact that, in the
3.2. Categorisation based on mode/facet analysis
natural order of physical existence, an entity can
always be interpreted into an hierarchy of related The problem was approached by adopting Panofsky’s
super-concepts and sub-concepts; as one navigates a modes of image analysis [19], as explicated by Markey
path through such a concept hierarchy, it may not be [9] and further refined by Shatford [5]. The analytical
obvious at what level one encounters the property of tool which was derived took the form of the template
uniqueness. This problem was manifest in queries shown in Table 5, which is adapted from Shatford.
which appeared to invoke subject matter which was By representing each cell of the matrix as a code
either unique or non-unique, depending upon the (shown in brackets), and returning to the ‘carnivals’
vantage point from which the relevant concept hier- example given earlier, single and multi-faceted
archy was being viewed. For example, a request for requests, including those which incorporate uneasy
Fig. 1. Image facet content data cumulative percentage chart – mode/facet analysis.
combinations of unique and non-unique subject faceted queries. The percentage sub-totals and total per
matter, can be represented in a straightforward manner library are given as an indication of the level of
as mode/facet combinations: overlap between categories.
● carnivals: G2 Fig. 1 gives the cumulative percentage spread across
● Rio carnivals: S3 + G2 the seven project libraries, illustrating the general
● the Rio Carnival, 1986: S2 + S4. similarity of pattern of query form.
Image content data from all the library samples was As can be seen from the diagram, the mode/facet
processed using the matrix as a tool of analysis. Table analysis reveals both the broad unifying characteris-
6 gives selected examples. tics of the pairs of libraries and the different emphases
Table 7 shows the overall categorisation of the between pairs, as discussed earlier. In addition, it
project data which was achieved as a result of mode/ gives a clearer picture of any individual library’s char-
facet analysis. Percentage totals per library add up to acteristics. For example, the BBC NHU queries are
more than 100% because of the incidence of multi- strongly characterised by requests which ask for
Table 6
Examples of queries categorised by mode/facet analysis
Table 7
Sample percentages of mode/facet categorisation by library
Specifics:
S1: who? ind. named person, group,
thing 45.5 44.9 33.9 39.9 58.3 64.4 13.9
S2: what? ind. named event, action 3.4 4.2 0.6 2.5 1.7 9.0 1.5
S3: where? ind. named geographical
location 10.2 8.9 50.6 54.0 11.6 8.7 87.1
S4: when? linear time: date, period 20.5 5.1 13.1 24.5 0.4 8.3 17.5
Sub totals 79.6 63.1 98.2 120.9 72.0 90.4 120.0
Generics:
G1: who? kind of person, thing 45.5 22.9 46.4 14.7 54.5 14.4 0.5
G2: what? kind of event, action,
condition 18.8 39.3 4.8 7.4 30.2 15.1 0
G3: where? kind of place: geographical,
architectural 4.5 5.1 15.5 1.2 13.6 1.6 36.3
G4: when? cyclical time: season,
time of day 0 0.5 0 0 1.7 0 0
generic groups of animals, but which also supply events as to need no extra detail. An S2 code in these
suggested named titles of programmes either for cases seems appropriate.
content extraction or as examples of similarity of In contrast, the Queen’s Christmas Day message
proposed coverage. The significance of the specifics (1988) might be seen as an event in itself, but, because
group in the Birmingham sample reflects the concen- it is a repeated event, it must be differentiated by a
tration on genealogical tracing, which works by clari- particular year’s broadcast. In this case, an S1 + S4
fying a particular link in a chain at a time. Its coding might be more suitable. However, the query
significance in the NMR sample reflects the nature of cheetahs running on a greyhound course in Haringey
the individualised query process. in 1932 satisfies specific location and time facets, but
The generally low incidence of category S2 among deals with a generic group of animals involved in
the library samples warrants some comment. It might some general activity. Clearly though, the request
be argued that an individually named event can be relates to a one-off event and differs markedly, because
described adequately with some combination of S1: of its startling nature, from queries asking for dogs
named person(s)/thing(s) + S3: location + S4: linear running, of which there may be many such events at
time – but, for example, the query Churchill’s funeral the same place within the same time span. An S2
or a request such as the Moon landing mention neither coding seems more fitting than a cumbersome G1 + G2
place nor time, but represent sufficiently memorable + S3 + S4. Thus, although the S2 category is not well
represented among the query samples, there does seem Another general problem of any model is the need
to be a need to cater for unique events. This is espe- sometimes to incorporate attributes to do with the
cially so in the case of the BFI NFTVA, which gener- medium of the imagery. For example, images of a
ated a significant number of queries such as Bombing well-known modern personality may include paint-
of Guernica, 1937 and landspeed record of ’56. ings, drawings, etchings, photographs, film, cartoons,
A more general problem is posed by queries which collages and so on. These might be regarded as sub-
specify geographical location. Any named location can selections (G1) within a general request for images of
be regarded as unique (and therefore specific) simply that personality (S1). However, a request for images of,
because it occupies different space to any other named say, a gallery’s exhibition catalogue is quite different
location. However, there is a significant difference from images of the exhibition itself or the gallery. In
between a query such as scenes of London and St this sense, the catalogue form (G1) drives the query,
John’s Cottage, Monument Lane, Edgbaston. The latter over and above the specific information on the event
describes an entity which is distinguishable from its (S2) or place (S3). This is a particular feature of the
surroundings and which could be placed easily within Witt Library data. Thus, object attributes can become
the S3 category. The former begs many more questions a necessary part of the subject description.
before it can become a meaningful query, although The low incidence of assignments to the Abstracts
it is concerned with a named location. As it stands, it categories is a partial reflection of the nature of the
may represent, for example, a request for the sort of libraries involved in the study. All seven (together
cityscape skyline found in many cities; in which case, with the Hulton library in the earlier study) are picture
the query aligns more with the generics G3 category. or film archives, as distinct from stock-shot libraries.
Alternatively, it may be asking for the sort of tourist The emphasis in the latter lies with the provision of
attractions that do distinguish this city from another, a rapid-response service to requests for images, where
whereupon it could be placed in the S3 category. It it is frequently the case that the client is an advertising
may, of course, represent any number of other inter- agency and the request addresses the iconologic
pretations. content of an image. Among stock-shot libraries, there
The following two queries illustrate a less stark is little tradition of indexing or classifying their collec-
contrast, but generate the same problem. Images of the tions, or recording the queries addressed to them.
Birchfield area of Birmingham in the 1930s is equiva- Accordingly, such organisations were not included in
lent to the ‘London scenes’ request, in that easily the study reported here. The fact that the Witt Library
definable boundaries are lacking in terms of geog- sample demonstrates the highest concentration of A-
raphy, architecture or topic, even though the area category facets is understandable, given the nature of
asked for may be quite small. Similarly, the finite the Witt’s collection and user base.
boundaries of Home Farm, Thorneaugh, Cambs may
be visually indistinguishable from any neighbouring
farm’s fields, especially if aerial photography is the 4. Conclusion
medium in question. Both these queries ask for indi-
vidually named locations, but could involve a variety The project reported here demonstrated that there are
of different types of image. observable similarities in image query formulation
The problem is one of defining the scale and size of across a range of different libraries and that, despite
what a single geographical unit should encompass and variations in library rationale, stock and user profile,
therefore any overarching geographical framework a general characterisation of queries can be formu-
within which queries could be graded as specific or lated.
generic. Different library environments develop ways The analytical framework which supports the query
of handling the geographical element according to the characterisation reported here is rooted in a schema
kind of stock they hold and the sort of queries already well established for the analysis of image
addressed. For example, within the project libraries, content. The dual applicability of the schema to the
the BFI NFTVA indexes countries, cities and towns. characterisation both of images and the queries which
The NMR ensures definable limits by a combination address those images offers some encouragement to
of grid reference and scale of the aerial photography. the view that the embedding of such a schema within
For the purposes of consistency within the project the user interface might offer a pathway towards the
data, the decision was taken to assign an S3 code to effective processing of unmediated transactions within
any named location. a visual information retrieval system.
The incidence of such unmediated transactions is Victorian paintings on, or incorporating the subject of,
increasing under the influence of those advances in ‘Invention’.
information and communications technology which Romney portrait of C.F. Greville. 1781.
have brought CD products and networked delivery of
Bernard Samual Morles – images of foundlings in
digitised images to the marketplace. Some of the
1870s.
libraries which participated in the study reported here,
for example, while continuing to offer a traditional Don Juan in French and English painting (19th
user interface in which the human intermediary plays century) as figure in literature/opera (Mozart, Hoffman,
a crucial role in receiving and processing queries, Byron, Shaw, etc) and figure/symbol of the hero.
already provide, or are engaged in the development of, Representation of the Muse ‘Clio’ or allegorical repre-
such image delivery mechanisms. The project reported sentation of ‘History’, 17th and 18th C Images of
in this paper, in seeking to further our understanding Death, Grief, Mourning – 19th C British.
about visual information need, aims to contribute
A2 Sample of queries collected from the Wellcome
towards our developing perceptions of good interface
Library
design for such visual information retrieval systems.
Urology.
Corsetry.
Appendix Industrial health.
Synthetic antimalarial agents.
A1 Sample of queries collected from the Witt
Remmelin anatomy.
Library
Smallpox and fevers.
Gambling – cards and dice.
video: Battle for the Mind (WIHM/IC/70).
‘Atlas’.
Deans of St Thomas’ Hospital Medical School.
. . . images of Native Americans or others murdering
colonists’ children especially babies, in the New Typhus in England and Ireland.
World, e.g. Hannah Durton’s children . . . Material is x-ray.
needed for article on infanticide with ref to Cooper’s Bethlem, Ashe’s and St George’s hospitals.
‘Last of the Mohicans’.
women in medicine.
Any aspect of lavender cultivation – plants, lavender
Childbirth in history, midwifery, obstetrics.
fields, selling, etc.
Sir Henry Thompson operating on Napoleon III.
Depictions of Lucy from ‘The Bride of Lamermoor’ or
Lucinde Lamermoor. Hermaphrodite, giants.
Reynolds painting of Wm Boyce mentioned in the Nelson lying in state.
DNB and ‘Portraits of Musicians Exhibition’ at South history of adolescence.
Kensington Museum 1885 handwritten catalogue by plague of Marseilles, late 18th cent. anatomical prac-
Sir George Scharf. tices.
The depiction of vanity in painting, the depiction of Sir Joseph Fayrer (Indian Medical service).
the female figure looking in the mirror, etc.
Victorian asylums.
Railways.
A3 Sample of queries collected from the Mitchell
Ann Wharton – wife of Thomas, Lord Wharton
Library
1659–1680.
Dance halls – photos.
Likenesses of Capt. John Money and other balloonists
+ balloon accidents. Scottish Daily Express – 1951 illustration.
Lawyers + legal subjects. Green’s Playhouse – dancing.
Weapons in use. Copeland Rd (Govan) area – photos.
Portraits of Harriet Houner. HMS Volunteer – photo.
Illustrations of menagerie in Stowe landscape Gardens Sir R. Balfour – photo.
19th C. 1938 Exhibition – posters.
Theatre Royal photocopies. Specific interior sections. Sir Benjamin Stone – photograph of two mutes (1901)
Clyde cruisers – photos. Sir Benjamin Stone – photograph of the funeral of a
Balshagary School – photo. Birmingham shopkeeper (1901).
Horsedrawn fire engine, ambulance, etc – photos. . . . 35mm slides of the following:
Clyde Dock 1925/26 strikers – print. a) the frontispiece of Charles Pye’s Birmingham
Glasgow Port 18th century – photo. Directory of 1797 (. . . photocopy attached from the
Belgian steam locos. Birm. Arch. Soc. Transactions 1937 . . .)
Irish immigration to Glasgow – photos of weavers’ b) a portrait of Matthew Boulton.
social conditions 18th, 19th century. . . . either a postcard or photograph of Aston Manor
‘A Venetian fish market’ – painting. . . . [believed to be] HQ of the 8th Battalion Royal
Warwickshire Regiment, formed in 1908. . . .
Sketch plan of Glasgow 1547.
First moving pictures in Glasgow. . . . 1. Church Street, Hereford, between 1875 and 1930
2. Colwall, Herefordshire, particularly a property
Portraits of ministers.
called Colwall Mill, Old Colwall in the 1860s or 1870s
NBL: sketch of a brass plate from a locomotive; 3. Any photo of a blacksmith at Colwall in the 1870s,
specific, info. on train. 1880s or 1890s
Queen Street. Specific street number, illustrations of. 4. Any of Welland or Castlemorton, Worcestershire
A4 Sample of queries collected from the 5. Any of the Verger of Hereford Cathedral between
Birmingham Central Library 1875 & 1930.
. . . enquiries about the Francis Frith photo collection . . . one or two good pictures of the latest Bull Ring
. . . to obtain old photos of Denver, in Norfolk . . . to Shopping Centre, plus confirmation that some part of
research how it [Manor Farm] formerly appeared to aid this lies on or near an earlier Market Cross and/or
a planning application, as it is now a listed building Market House fro Mediaeval times? . . . [plus] any
. . . [or] any photos of Denver. picture postcards of Shopping Arcades . . . around the
turn of the century. . . .
. . . the picture of Sir Charles Rivers Wilson (a relative,
1831–1916) in ‘Vanity Fair’ 9 Nov 1878 by Pellegrini. . . . early photographs of the area around St Mary’s
Chapel, Whittall Street, including St Mary’s Row, Love-
. . . research is the history of Japanese photography in day Street, and a street that once existed before con-
the UK collections whether there is [sic] any Japanese struction of the General Hospital, ‘Weaman Row’. . . .
photographs especially before 1945, in your collec-
tions. . . . . . . whether the Collection [Benjamin Stone
Collection] contains any photographs that may exist of
. . . [re] Charge of the Light Brigade at Balaclava 1853 street parties taken at the time. . . .
. . . [to] trace a photograph of Corporal Martin
. . . [re] the 50th anniversary of VE day . . . any
Langfield, a bugler in the above charge, . . . depicted
photographs that may exist of street parties taken at
in one of the several books written on the battle.
the time . . . .
. . . if you have any negatives of scenes in North Curry,
. . . photographs of the roof garden of Lewis’s depart-
Somerset, Witham Friary, Somerset, Alfriston, Sussex
ment store as it was in the 1950s and 1960s. . . .
or Wyke Regis, Dorset.
. . . Aston Grammar School . . . any prints of the school
. . . i) . . . any photographs of motoring subjects taken
at this time [i.e. 100 years ago]?. . . .
by F.R. Logan (successor to Blyth Clayton) of Cannon
Passage, Birmingham. He photographed motoring . . . research into Royal burial places in Europe . . . [so
subjects extensively, including photos of trials, speed request for] photographs of the Royal vaults under the
trials, hill-climbs etc outside the city . . . Albert Memorial Chapel, St George Chapel, Windsor . . . .
ii) motoring photographs, 1900–1930, by any other . . . [to] look in the index for the Frith Collection to
photographers. see if there are any pictures of windmills in the
. . . a photograph of Mr Hugh Gilzean-Reid – MP in following Leicestershire villages: Eaton, Hoton,
1885 for Aston Manor. Sewstern, Ab Kettleby, Seagrave, Foxton. . . .
. . . one photograph of each of the following works . . . to acquire a photographic copy of St Judas Church
from your collection: (sepia finish) for family research files. . . .
A5 Sample of queries collected from the BBC NHU MOTHS – ESPECIALLY IN THE SAME SHOT. 12]
Library BOGS + CREATURES RELATED TO BOGS [DRAGON-
UNDER WATER NO HUMANS. 20–30". FLIES, ETC] – NOT ‘LIVING ISLES’! WOO NIGHT LIFE
– GET WATERWALKERS. TICKET TO THE WILD.
BATS. BUTTERFLIES. TIME-LAPSE – SENSE OF
TIMING. ALL TIGERS. DOLPHINS SWIMMING. GALAPAGOS
IS. MONTAGED. LIVE ACTION. STUNNING SHOTS.
JAPANESE EXPO – 1996. CITIES.
KILLER WHALE. GOLDEN EAGLE. KUMEDO
CLOSE SHOTS OF CRABS BREEDING. UK ONLY, DRAGON. ENTER THE DRAGONS.
MED SHOTS OKAY TOO [TO CUT INTO FOOTAGE
SHOTS OF MOON.
OF LINCOLNSHIRE COAST FOR ITEM ON AREA
IMPORTANT FOR CRABS, UNDER THREAT. A6 Sample of queries collected from the BFI
OIL FROM UNDERNEATH – BIG OIL. BELUGA FILM. NFTVA Library
LP OPEN OCEAN – WAVES, DOLPHIN LEAP, 1937 television recording of Rothschild sale.
BELUGA.
BBC Healthwatch last night on microwave hazards of
KOMODO DRAGONS, SEXUAL ENCOUNTERS OF mobile phones.
THE FLORAL KIND. LP: WORLD’S APART. TNW:
On the Waterfront for open air screening o/s RFH
ENTER THE DRAGONS.
[Royal Festival Hall].
KILLING FOR A LIVING. 1] LION SAVANNAH. 2]
Urban life.
ARMY ANTS – JUNGLE. 3] PINNEPEDS. 4] KILLER
WHALES. Frida: Naturaleza Vita [Mexican film].
WOLVES FEEDING. 421.310.11. old footage for the Lincolnshire Life Museum.
SURVIVAL PROG ON MONGOOSES. TOGETHER Liners 1920s.
THEY STAND 22866. Late Show Special on produc[tion] of Coriolanus.
NATURE – TURKEY – GENERAL SHOTS. [for entertainer who impersonates Queen Victoria]
ARMADILLOS. Queen Victoria and Romanov dynasty.
BIRTH OF EUROPE 1. Liverpudlians/Liverpool in film and TV.
BLUEBOTTLE. ANY OTHER FLYING BUZZING ‘Right the First Time’ training video.
INSECT. Cheetahs running on a greyhound course in Haringey
SWARM TRIMS – PORT MAN O’WAR ON BEACH/ in 1932.
WITH PEOPLE Interferon [the drug].
VHS RONI POSSUM – A NEW ZEALAND NIGHT- Doc on the rebuilding of Glyndebourne 22/5/94.
MARE [WILD SOUTH]\PLAYING POSSUM – ABC. films of D Day, H. Bogart, C. Gable.
ANY OTHER POSSUM PROGS. people who are against or phobic about new tech-
EIDER DUCK ALL KILLED. [1] NEED ––––- HORIZON nology.
FILM – HAVE IT. [2] 4 Spp of CEM. [3] oystercatchers. man/women working in nuclear power stations +
FROM ‘THE NATURAL WORLD – SKYHUNTERS’ wearing protective clothing.
ARE THERE ANY TRIMS OF TURKEY VULTURES/ Queen’s Christmas Day message (1988).
BLACK VULTURES CIRCLING OVER A CORPSE?
Rejuvenation treatments; cryonics; immortality.
PLEASE FIND ME THE FOLLOWING CLOSE UP/BIG
Earliest footage of St. Paul’s.
CLOSE UPS MID CLOSE UPS IN A BRITISH OR
BRITISH LOOKING ENVIRONMENT: SECRET John Hurt + Miranda Richardson.
NATIVE LYMN. 1] BADGER BEING ELUSIVE [NOT
BROCKSIDE OR LIVING ISLES]. 2] WOOD MICE. 3] A7 Sample of queries collected from the NMR Library
BANK VOLE – IF YOU CAN GET THESE ITEMS ONLY Note: each request assumes the form: initials of
SELECT TO (sic) MORE EXAMPLES OF researcher/year/month/query number within month.
ITEMS 4 TO 8. 4] WEASEL. 5] ROE DEER. 6] STOAT. Place name or feature: map sheet number, grid reference
7] MOLES – WOO UNEARTHING THE MOLE. 8] (+ expanded search area). The specialist collection refers
SHREW. 9] BRITISH BUTTERFLIES. 10] BATS. 11] to the collection of oblique aerial photographs.