Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics
Universitat Politècnica de Catalunya. SOCO - Soft Computing
2011
Machine Learning methods have of late made significant efforts to solving multidisciplinary problems in the field of cancer classification in microarray gene expression data. These tasks are characterized by a large number of features and a few observations, making the modeling a non-trivial undertaking. In this work we apply entropic filter methods for gene selection, in combination with several off-the-shelf classifiers. The introduction of bootstrap resampling techniques permits the achievement of more stable performance estimates. Our findings show that the proposed methodology permits a drastic reduction in dimension, offering attractive solutions both in terms of prediction accuracy and number of explanatory genes; a dimensionality reduction technique preserving discrimination capabilities is used for visualization of the selected genes.
Postprint (author’s final draft)
Part of book or chapter of book
English
Àrees temàtiques de la UPC::Informàtica::Aplicacions de la informàtica::Bioinformàtica; Computational biology; Data mining; Cancer -- Research; Biological data mining and knowledge discovery; Gene expression analysis; Tools and methods for computational biology and bioinformatics; Cancer informatics; Biologia computacional; Mineria de dades; Càncer -- Investigació
Springer
https://link.springer.com/book/10.1007/978-1-4419-7046-6
Open Access
E-prints [73012]