dc.contributor.author
Casanellas, M.
dc.contributor.author
Fernández-Sánchez, J.
dc.contributor.author
Garrote-López, M.
dc.contributor.author
Sabaté-Vidales, M.
dc.date.accessioned
2023-08-28T13:45:29Z
dc.date.accessioned
2024-09-19T14:35:49Z
dc.date.available
2023-08-28T13:45:29Z
dc.date.available
2024-09-19T14:35:49Z
dc.date.issued
2023-06-13
dc.identifier.uri
http://hdl.handle.net/2072/536855
dc.description.abstract
Homogeneity across lineages is a general assumption in phylogenetics according to which nucleotide substitution rates are common to all lineages. Many phylogenetic methods relax this hypothesis but keep a simple enough model to make the process of sequence evolution more tractable. On the other hand, dealing successfully with the general case (heterogeneity of rates across lineages) is one of the key features of phylogenetic reconstruction methods based on algebraic tools. The goal of this paper is twofold. First, we present a new weighting system for quartets (ASAQ) based on algebraic and semi-algebraic tools, thus especially indicated to deal with data evolving under heterogeneous rates. This method combines the weights of two previous methods by means of a test based on the positivity of the branch lengths estimated with the paralinear distance. ASAQ is statistically consistent when applied to data generated under the general Markov model, considers rate and base composition heterogeneity among lineages and does not assume stationarity nor time-reversibility. Second, we test and compare the performance of several quartet-based methods for phylogenetic tree reconstruction (namely QFM, wQFM, quartet puzzling, weight optimization and Willson’s method) in combination with several systems of weights, including ASAQ weights and other weights based on algebraic and semi-algebraic methods or on the paralinear distance. These tests are applied to both simulated and real data and support weight optimization with ASAQ weights as a reliable and successful reconstruction method that improves upon the accuracy of global methods (such as neighbor-joining or maximum likelihood) in the presence of long branches or on mixtures of distributions on trees. © 2023, The Author(s).
eng
dc.description.sponsorship
We would like to thank the reviewers of the paper for important contributions that improved the final version of the manuscript. MC, JFS and MGL were partially supported by Spanish State Research Agency grant PID2019-103849GB-I00. MC and JFS were also supported by AEI through the Severo Ochoa and María de Maeztu Program for Centers and Units of Excellence in R &D (project CEX2020-001084-M) and by the AGAUR project 2021 SGR 00603 Geometry of Manifolds and Applications, GEOMVAP.
dc.format.extent
11 p.
cat
dc.publisher
Springer
cat
dc.relation.ispartof
Bulletin of Mathematical Biology
cat
dc.rights
L'accés als continguts d'aquest document queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative Commons: https://creativecommons.org/licenses/by/4.0/
dc.source
RECERCAT (Dipòsit de la Recerca de Catalunya)
dc.subject.other
Algebraic methods for topology reconstruction; General Markov model; Heterogeneity across lineages; Paralinear method; Quartet-based methods
cat
dc.title
Designing Weights for Quartet-Based Methods When Data are Heterogeneous Across Lineages
cat
dc.type
info:eu-repo/semantics/article
cat
dc.type
info:eu-repo/semantics/publishedVersion
cat
dc.identifier.doi
10.1007/s11538-023-01167-y
cat
dc.rights.accessLevel
info:eu-repo/semantics/openAccess