dc.contributor.author
Larivière, Delphine
dc.contributor.author
Abueg, Linelle Ann L.
dc.contributor.author
Brajuka, Nadolina
dc.contributor.author
Gallardo, Cristobal
dc.contributor.author
Grüning, Björn
dc.contributor.author
Ko, Byung June
dc.contributor.author
Ostrovsky, Alexander
dc.contributor.author
Palmada-Flores, Marc
dc.contributor.author
Pickett, Brandon
dc.contributor.author
Rabbani, Keon
dc.contributor.author
Balacco, Jennifer
dc.contributor.author
Chaisson, Mark
dc.contributor.author
Cheng, Haoyu
dc.contributor.author
Collins, Joanna
dc.contributor.author
Denisova, Alexandra
dc.contributor.author
Fedrigo, Olivier
dc.contributor.author
Gallo, Guido
dc.contributor.author
Giani, Alice Maria
dc.contributor.author
Gooder, Grenville MacDonald
dc.contributor.author
Jain, Nivesh
dc.contributor.author
Johnson, Cassidy
dc.contributor.author
Kim, Heebal
dc.contributor.author
Lee, Chul
dc.contributor.author
Marquès i Bonet, Tomàs
dc.contributor.author
O'Toole, Brian
dc.contributor.author
Rhie, Arang
dc.contributor.author
Secomandi, Simona
dc.contributor.author
Sozzoni, Marcella
dc.contributor.author
Tilley, Tatiana
dc.contributor.author
Uliano da Silva, Marcela
dc.contributor.author
van den Beek, Marius
dc.contributor.author
Waterhouse, Robert
dc.contributor.author
Phillippy, Adam M.
dc.contributor.author
Jarvis, Erich
dc.contributor.author
Schatz, Michael
dc.contributor.author
Nekrutenko, Anton
dc.contributor.author
Formenti, Giulio
dc.identifier
https://ddd.uab.cat/record/283858
dc.identifier
urn:10.1101/2023.06.28.546576
dc.identifier
urn:oai:ddd.uab.cat:283858
dc.identifier
urn:pmcid:PMC10327048
dc.identifier
urn:pmc-uid:10327048
dc.identifier
urn:pmid:37425881
dc.identifier
urn:oai:pubmedcentral.nih.gov:10327048
dc.description.abstract
Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals).
dc.format
application/pdf
dc.relation
European Commission 101059492
dc.relation
Larivière, D., Abueg, L., Brajuka, N. et al. "Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy". Nature Biotechnology. Vol. 42, (March 2024), p. 367-370 ;
dc.relation
https://doi.org/10.1038/s41587-023-02100-3
dc.rights
Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades.
dc.rights
https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject
Genome assembly
dc.subject
Reproducibility
dc.title
Scalable, accessible, and reproducible reference genome assembly and evaluation in Galaxy