Feature selection for microarray gene expression data using simulated annealing guided by the multivariate joint entropy

Other authors

Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics

Universitat Politècnica de Catalunya. SOCO - Soft Computing

Publication date

2013

Abstract

In this work a new way to calculate the multivariate joint entropy is presented. This measure is the basis for a fast information-theoretic based evaluation of gene relevance in a Microarray Gene Expression data context. Its low complexity is based on the reuse of previous computations to calculate current feature relevance. The mu-TAFS algorithm --named as such to differentiate it from previous TAFS algorithms-- implements a simulated annealing technique specially designed for feature subset selection. The algorithm is applied to the maximization of gene subset relevance in several public-domain microarray data sets. The experimental results show a notoriously high classification performance and low size subsets formed by biologically meaningful genes.


Postprint (published version)

Document Type

External research report

Language

English

Related items

LSI-13-2-R

Recommended citation

This citation was generated automatically.

Rights

Open Access

This item appears in the following Collection(s)

E-prints [72987]