Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes.

dc.contributor.author
Guigó, Roderic
dc.contributor.author
Dermitzakis, E.T.
dc.contributor.author
Agarwal, Pankaj
dc.contributor.author
Ponting, Chris P
dc.contributor.author
Parra, Genís
dc.contributor.author
Reymond, Alexandre
dc.contributor.author
Abril Ferrando, Josep Francesc, 1970-
dc.contributor.author
Keibler, Evan
dc.contributor.author
Lyle, Robert
dc.contributor.author
Ucla, Catherine
dc.contributor.author
Antonarakis, Stylianos E.
dc.contributor.author
Brent, Michael R.
dc.date.issued
2023-01-26T09:29:22Z
dc.date.issued
2023-01-26T09:29:22Z
dc.date.issued
2003
dc.date.issued
2023-01-26T09:29:22Z
dc.identifier
0027-8424
dc.identifier
https://hdl.handle.net/2445/192641
dc.identifier
555709
dc.description.abstract
A primary motivation for sequencing the mouse genome was to accelerate the discovery of mammalian genes by using sequence conservation between mouse and human to identify coding exons. Achieving this goal proved challenging because of the large proportion of the mouse and human genomes that is apparently conserved but apparently does not code for protein. We developed a two-stage procedure that exploits the mouse and human genome sequences to produce a set of genes with a much higher rate of experimental verification than previously reported prediction methods. RT-PCR amplification and direct sequencing applied to an initial sample of mouse predictions that do not overlap previously known genes verified the regions flanking one intron in 139 predictions, with verification rates reaching 76%. On average, the confirmed predictions show more restricted expression patterns than the mouse orthologs of known human genes, and two-thirds lack homologs in fish genomes, demonstrating the sensitivity of this dual-genome approach to hard-to-find genes. We verified 112 previously unknown homologs of known proteins, including two homeobox proteins relevant to developmental biology, an aquaporin, and a homolog of dystrophin. We estimate that transcription and splicing can be verified for >1,000 gene predictions identified by this method that do not overlap known genes. This is likely to constitute a significant fraction of the previously unknown, multiexon mammalian genes.
dc.format
6 p.
dc.format
application/pdf
dc.format
application/pdf
dc.language
eng
dc.publisher
National Academy of Sciences
dc.relation
Reproducció del document publicat a: https://doi.org/10.1073/pnas.0337561100
dc.relation
Proceedings of the National Academy of Sciences of the United States of America - PNAS, 2003, vol. 100, num. 3, p. 1140-1145
dc.relation
https://doi.org/10.1073/pnas.0337561100
dc.rights
(c) National Academy of Sciences, 2003
dc.rights
info:eu-repo/semantics/openAccess
dc.source
Articles publicats en revistes (Genètica, Microbiologia i Estadística)
dc.subject
Mamífers
dc.subject
Gens
dc.subject
Rates (Animals de laboratori)
dc.subject
Mammals
dc.subject
Genes
dc.subject
Rats as laboratory animals
dc.title
Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes.
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)