© 1987 by British Computer Society
| ||||||||||||||||||||||||||||||||||||||||||||||||||
Accessing Textual Documents using Compressed Indexes of Arrays of Small Bloom Filters
Department of Computer Science, University of Western Ontario, London, Ontario, Canada N6A 5B7
A highly compressed index for a collection of variable-sized documents is described. Arrays of small Bloom filters are used to efficiently locate documents where the search probe contains anded and ored combinations of words. Theoretical and experimental results are reported. The method is applicable to unplanned searching of large text files. We further describe a method to provide an index to the filters. Thus only a small proportion of the compressed filter need be examined. The method is highly amenable to parallel processing.
Received November 1985. revised May 1986.
* Department of Computer Science, University of Western Ontario, London, Ontario, Canada N6A 5B7