25 - 29 de Noviembre de 2002

Montevideo, Uruguay

Radisson Victoria Plaza Hotel

 
CL19
 
GADBMS - Restricted Genetic Algorithm to induce a set of rules from Relational Databases

Andréa de Fatima Cavalheiro
Federal University of Paraná, Computer Science Department
andrea.cavalheiro@gvt.net.br
Aurora Ramirez Pozo
Federal University of Paraná, Computer Science Department
aurora@inf.ufpr.br
 
Abstract

The present work introduces the GADBMS, a system that uses a special restricted genetic algorithm to induce a set of rules from relational databases. The choice of GA paradigm is partially justified by its great capacity in dealing with noise, invalid or inexact data, and its easy adaptation to different domains of data. The GA algorithm uses Tabu lists to restrict the selection process. This restriction allows the creation of a set of potential rules for the classifier tool. This tool was tested in four datasets and compared to other twenty-three rules based algorithms. After that, noise was added to the databases and a new set of experiments was performed. The results prove that the proposed algorithm is efficient and robust. And the strategy used to maintain the diversity was considered valid, since the algorithm was able to keep its accuracy in categorization even for smaller populations.

Keywords: Genetic Algorithms, Tabu Search, Classification Task and Knowledge Discovery in Databases.



Volver

infoUYclei 2002