The unappreciated role of K-mers in bioinformatics

Psomopoulos, Fotis

The unappreciated role of K-mers in bioinformatics

dc.contributor.author

Psomopoulos, Fotis

dc.date.accessioned

2026-01-09T02:02:06Z

dc.date.available

2026-01-09T02:02:06Z

dc.date.issued

2022-09-29

dc.identifier

Psomopoulos, F. The unappreciated role of K-mers in bioinformatics. A: Severo Ochoa Research Seminars at BSC. «8th Severo Ochoa Research Seminar Lectures at BSC, Barcelona, 2022-23». Barcelona: Barcelona Supercomputing Center, 2022, p. 16-17.

dc.identifier

https://hdl.handle.net/2117/449802

dc.identifier.uri

http://hdl.handle.net/2117/449802

dc.description.abstract

K-mers are used on a daily basis in bioinformatics. Although they have existed at the core of several popular tools for genome assembly for quite some time, until recently they have been woefully underutilized. Although k-mer counting is simple and straightforward, it becomes a real challenge when attempting to deal with the huge amounts of data generated in high-throughput sequencing. However, having a simple representation of the actual data with few degrees of freedom (i.e. the k-value and the 4 letters – when dealing with nucleotide sequences), does provide the perfect opportunity to investigate novel mixes of methods and techniques derived from various fields. In that context, the real challenge is to map the biological questions to a corresponding modelling approach. Such examples could be the application of Gödel numbering as a means of transforming the search space for sequence similarity, application of pruned trees and entropy for identifying novel features in sequences, and binning methods for metagenomics classification.

dc.format

2 p.

dc.format

application/pdf

dc.language

eng

dc.publisher

Barcelona Supercomputing Center

dc.rights

http://creativecommons.org/licenses/by-nc-nd/4.0/

dc.rights

Open Access

dc.rights

Attribution-NonCommercial-NoDerivatives 4.0 International

dc.subject

Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors

dc.subject

High performance computing

dc.subject

Càlcul intensiu (Informàtica)

dc.title

The unappreciated role of K-mers in bioinformatics

dc.type

Conference report

Fitxers en aquest element

Fitxers	Grandària	Format	Visualització
No hi ha fitxers associats a aquest element.

Aquest element apareix en la col·lecció o col·leccions següent(s)

Congressos [11159]