Kernel-PCA data integration with enhanced interpretability

dc.contributor.author
Reverter Comes, Ferran
dc.contributor.author
Vegas Lozano, Esteban
dc.contributor.author
Oller i Sala, Josep Maria
dc.date.issued
2014-04-07T13:42:11Z
dc.date.issued
2014-04-07T13:42:11Z
dc.date.issued
2014-03
dc.date.issued
2014-04-07T13:42:12Z
dc.identifier
1752-0509
dc.identifier
https://hdl.handle.net/2445/53298
dc.identifier
637088
dc.identifier
25032747
dc.description.abstract
Background Nowadays, combining the different sources of information to improve the biological knowledge available is a challenge in bioinformatics. One of the most powerful methods for integrating heterogeneous data types are kernel-based methods. Kernel-based data integration approaches consist of two basic steps: firstly the right kernel is chosen for each data set; secondly the kernels from the different data sources are combined to give a complete representation of the available data for a given statistical task. Results We analyze the integration of data from several sources of information using kernel PCA, from the point of view of reducing dimensionality. Moreover, we improve the interpretability of kernel PCA by adding to the plot the representation of the input variables that belong to any dataset. In particular, for each input variable or linear combination of input variables, we can represent the direction of maximum growth locally, which allows us to identify those samples with higher/lower values of the variables analyzed. Conclusions The integration of different datasets and the simultaneous representation of samples and variables together give us a better understanding of biological knowledge.
dc.format
9 p.
dc.format
application/pdf
dc.language
eng
dc.publisher
BioMed Central
dc.relation
Reproducció del document publicat a: http://dx.doi.org/10.1186/1752-0509-8-S2-S6
dc.relation
BMC Systems Biology, 2014, vol. 8(S2), num. s6, p. 1-9
dc.relation
http://dx.doi.org/10.1186/1752-0509-8-S2-S6
dc.rights
cc-by (c) Reverter Comes, Ferran et al., 2014
dc.rights
http://creativecommons.org/licenses/by/3.0/es
dc.rights
info:eu-repo/semantics/openAccess
dc.source
Articles publicats en revistes (Genètica, Microbiologia i Estadística)
dc.subject
Estadística
dc.subject
Bioinformàtica
dc.subject
Mètodes estadístics
dc.subject
Programes d'ordinador
dc.subject
Processament de dades
dc.subject
Statistics
dc.subject
Bioinformatics
dc.subject
Statistical methods
dc.subject
Computer programs
dc.subject
Data processing
dc.title
Kernel-PCA data integration with enhanced interpretability
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)