© 1999 by British Computer Society
| ||||||||||||||||||||||||||||||||||||||||||||||||||
A Model for Representing and Retrieving Heterogeneous Structured Documents Based on Evidential Reasoning
A1 Department of Computer Science, Queen Mary & Westfield College, University of London, London E1 4NS, UK Email: mounia@dcs.qmw.ac.uk A2 This work was carried out when the author was at Informatik VI, University of Dortmund, Germany.
Documents often display an internal structure; they are composed of components. For example, a journal contains several articles, which themselves contain paragraphs, tables, etc. With structured documents, the retrievable units should be the document components as well as the whole document. The components of a structured document can be of different types: various media, located in a number of sites, or written in several languages. An information retrieval model for heterogeneous structured documents must take into account this disparity among document components. We present a model for representing and retrieving heterogeneous structured documents, that is multimedia, distributed and multilingual documents. The model is based on evidential reasoning, a formal theory that allows for the representation and the combination of knowledge. Here, knowledge is the content of document components. We show that the model provides for an appropriate representation and retrieval of heterogeneous structured documents.
Received 23 September, 1998. Revised 10 September, 1999.