B08 - last change: 17-01-2007

BOBCATSSS 2008
Providing Access to Information for Everyone

Speakers
Mariàngels Granados
Anna Nicolau
Schedule
Day 2
Room Funimation Novi Park
Start time 09:30
Duration 00:30
Info
ID 55
Event type Lecture
Track S04 - Issues of information retrieval
Language English

Improving subject searching in databases through a combination of descriptors and UDC

Problems with subject access to online catalogues and databases are not new. Studies on the use of OPACs have revealed two apparently endemic problems: on the one hand, the large number of searches with zero hits (failed searches) and on the other, the retrieval of an excessive amount of bibliographic records (information overload). In this paper we describe a new information retrieval technique based on the combination of descriptor weighting and the use of the Universal Decimal Classification (UDC) call numbers. The use of classification call numbers in order to search the catalogue has traditionally been very restricted. In most catalogues, call numbers are used only as topographical indicators and are not searchable. The new system described here makes much fuller use of them. The system is based on the hypothesis that a set of descriptors correspond to a UDC call number. Through the analysis of the frequency of distribution of descriptors and call numbers, we create a set of clusters that allow increasing precision and recall. At the same time, these clusters offer alternative search modes, making it possible to systematize the indexing process and increase its consistency. We present a case study of the use of the system with the ERIC database.