Back

Ślęzak D., Wróblewski J., 2007. Roughfication of Numeric Decision Tables: The Case Study of Gene Expression Data. Proc. of RSKT 2007 (JRS 2007), Toronto, Canada. Springer-Verlag (LNAI 4481), Berlin, Heidelberg 2007, pp. 316 - 323


ABSTRACT

We extend the standard rough set-based approach to be able to deal with huge amounts of numeric attributes versus small amount of available objects. We transform the training data using a novel way of non-parametric discretization, called roughfication (in contrast to fuzzification known from fuzzy logic). Given roughfied data, we apply standard rough set attribute reduction and then classify the testing data by voting among the obtained decision rules. Roughfication enables to search for reducts and rules in the tables with the original number of attributes and far larger number of objects. It does not require expert knowledge or any kind of parameter tuning or learning. We illustrate it by the analysis of the gene expression data, where the number of genes (attributes) is enormously large with respect to the number of experiments (objects).