Iterative Bucket Inverted Files for Web Engines
 
Verónica Gil Costa and Marcela Printista
LIDIC, Computer Science Department University of San Luis,
San Luis, Argentina
fgvcosta,mprintig@unsl.edu.ar
 
Abstract
 
This article describes an iterative bucket strategy to improve the efficiency of a classical index data structure for parallel textual databases. The main interest is the queries parallel processing on a cluster of PCs, and therefore this paper is focused in the communication and synchronization optimization. The design of the server that processes the queries is effected on top of the Bulk Synchronous model of parallel computing, to study how query performance is affected by the index organization. Our results, which are based on the Chilean Web collection, indicate that the iterative bucket organization outperform the classical organizations.
 
Keywords: BSP, Inverted Files, Queries Indexed, Iterative Buckets.