Coling82, Z Homck) (Ed.) North - Hollandpublishingcompany © Academ 1982
Coling82, Z Homck) (Ed.) North - Hollandpublishingcompany © Academ 1982
Coling82, Z Homck) (Ed.) North - Hollandpublishingcompany © Academ 1982
)
North.HollandPublishingCompany
Academ~ 1982
A DECISION THEORY
162
G. K N O R Z .
This paper w i l l only give reference to details of the particular solution published elsewhere. The objective of this work is to present the general framework
derivable from the basic principles underlying the WAI and the AIR project:
(I) Knowledgebases are very important for problem solving. But to presuppose
knowledge for an automatic system must notquestionits a p p l i c a b i l i t y , caused
by non-existent procedures for construction of knowledge bases of an indispensable size. The r e a l i s t i c appropriate solution is the main aim rather than a
perfect one.
(2) Controlling the quality and expenditure of e f f o r t of a system must not wait
until i t is put into practice. System development has to be guided by a control
derivable from the task to be performed,
(3) The algorithms that make the bases of the procedure should not be assumed to be
perfect. Applied to complex tasks, i t is a fundamental fact that they are
based on simplified models.
The principles can be considered to be a guideline for designing application
oriented systems. With good reason i t is claimed that the quality of such a system
can be determined by evaluation in application environments only (see for example
[ 7 ] , [ 8 ] ) . This cannot be done without empirical studies of the user-system interaction.
The paradigm of recognizing abstract objects presented here is an approach to
integrate the evaluation aspect into system development. I t is also an approach
to problems, for which no perfect solutions exist or seem to be applicable.
163
the
a strong theoretical
is, that both models ME and MI are essentially independent This fact causes every
164
G, KNORZ
objects
. ~
describing
l ;l
deciding
setof
~ o b j e c t
classes
K
( ~
RSMI - provided i t
x ~
and i t s environment
by mE . The desision theory approach appropriate to the given s i t u a t i o n is d e s c r i bed in [ 5 ] and [ 6 ] with respect to the indexing problem. The approach requires that
every s i n g l e d e c i s i o n of RS is c l a s s i f i e d . This task is f o r the most p a r t a n t i cipated by ME , which defines the set of object classes K. K determines the scope
of possible f a u l t s . Those
cause
4 DISCUSSION
The approach of recognizing a b s t r a c t objects is evaluated using the p a r a d i g nf
automatic indexing. The model ~,IE refers - f o r p r a c t i c a l reasons - not to the
165
The recognition problem causes one to regard two independent models: one with
respect to retrieval and one with respect to analysis of abstracts. This point
of view is important for an approach to optimal indexing [6 ], but i t is not
self-obvious. In [14] the retrieval oriented approach of Robertson and the
indexing oriented approach of Harter [13] are brought together. The result
is
ensures, that the gap between the optimal system RsM~t and the ideal system
FOOTNOTES
WAI means W~rterbuchentwicklung fUr automatisches Indexing (dictionary construction for automatic indexing), [3]. The research was supported by the BMFT
contract PT 131.05 to Technische Hochschule Darmstadt (march I , 1978 - december
12, 1981).
AIR means Weiterentwicklung der automatischen Indexierung und des Information
Retrieval (further development of automatic indexing and information r e t r i e v a l ) .
Supported by the BMFT contract PT 131.10 to Technische Hochschule Darmstadt
(march I , 1981 - december 31, 1983).
The order of magnitude of the two dictionaries may be characterized as follows:
about 13.000 single words, 20.000 phrases and 100.000 term-descriptor relations
each.
166
G. KNORZ
The two volumes 3 and 4 of the abstract journal Food Science and Technology
Abstracts (FSTA 71/72) containing about 33.000 documents were used as a basis
for dictionary.construction.