dc.contributor.author
Psomopoulos, Fotis
dc.date.accessioned
2026-01-09T02:02:06Z
dc.date.available
2026-01-09T02:02:06Z
dc.date.issued
2022-09-29
dc.identifier
Psomopoulos, F. The unappreciated role of K-mers in bioinformatics. A: Severo Ochoa Research Seminars at BSC. «8th Severo Ochoa Research Seminar Lectures at BSC, Barcelona, 2022-23». Barcelona: Barcelona Supercomputing Center, 2022, p. 16-17.
dc.identifier
https://hdl.handle.net/2117/449802
dc.identifier.uri
http://hdl.handle.net/2117/449802
dc.description.abstract
K-mers are used on a daily basis in bioinformatics. Although
they have existed at the core of several popular tools for
genome assembly for quite some time, until recently they have
been woefully underutilized. Although k-mer counting is
simple and straightforward, it becomes a real challenge when
attempting to deal with the huge amounts of data generated in
high-throughput sequencing. However, having a simple
representation of the actual data with few degrees of freedom
(i.e. the k-value and the 4 letters – when dealing with nucleotide
sequences), does provide the perfect opportunity to investigate
novel mixes of methods and techniques derived from various
fields. In that context, the real challenge is to map the biological
questions to a corresponding modelling approach. Such
examples could be the application of Gödel numbering as a
means of transforming the search space for sequence similarity,
application of pruned trees and entropy for identifying novel
features in sequences, and binning methods for metagenomics
classification.
dc.format
application/pdf
dc.publisher
Barcelona Supercomputing Center
dc.rights
http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights
Attribution-NonCommercial-NoDerivatives 4.0 International
dc.subject
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject
High performance computing
dc.subject
Càlcul intensiu (Informàtica)
dc.title
The unappreciated role of K-mers in bioinformatics
dc.type
Conference report