To access the full text documents, please follow this link: http://hdl.handle.net/2117/98194

A Compositional Approach to Allele Sharing Analysis
Galván Femenía, Iván; Graffelman, Jan
Universitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa; Universitat Politècnica de Catalunya. COSDA-UPC - COmpositional and Spatial Data Analysis
Relatedness is of great interest in population-based genetic association studies. These studies search for genetic factors related to disease. Many statistical methods used in population-based genetic association studies (such as standard regression models, t-tests, and logistic regression) assume that the observations (individuals) are independent. These techniques can fail if independence is not satisfied. Allele sharing is a powerful data analysis technique for analyzing the degree of dependence in diploid species. Two individuals can share 0, 1, or 2 alleles for any genetic marker. This sharing may be assessed for alleles identical by state (IBS) or identical by descent (IBD). Starting from IBS alleles, it is possible to detect the type of relationship of a pair of individuals by using graphical methods. Typical allele sharing analysis consists of plotting the fraction of loci sharing 2 IBS alleles versus the fraction of sharing 0 IBS alleles. Compositional data analysis can be applied to allele sharing analysis because the proportions of sharing 0, 1 or 2 IBS alleles (denoted by $p_0$, $p_1$, and $p_2$) form a 3-part-composition. This chapter provides a graphical method to detect family relationships by plotting the isometric log-ratio transformation of $p_0, p_1$, and $p_2$. On the other hand, the probabilities of sharing 0, 1, or 2 IBD alleles (denoted by $k_0, k_1, k_2$), which are termed Cotterman’s coefficients, depend on the relatedness: monozygotic twins, full-siblings, parent-offspring, avuncular, first cousins, etc. It is possible to infer the type of family relationship of a pair of individuals by using maximum likelihood methods. As a result, the estimated vector $\bf{k} = (k_0, k_1, k_2)$ for each pair of individuals forms a 3-part-composition and can be plotted in a ternary diagram to identify the degree of relatedness. An R package has been developed for the study of genetic relatedness based on genetic markers such as microsatellites and single nucleotide polymorphisms from human populations, and is used for the computations and graphics of this contribution.
Peer Reviewed
-Àrees temàtiques de la UPC::Matemàtiques i estadística::Estadística aplicada
-Mathematical statistics
-Molecular biology
-Genetics
-Allele sharing
-identical by state
-identical by descent
-Cotterman's coefficients
-ternary diagram
-isometric log-ratio transformation
-Estadística matemàtica -- Aplicacions
-Genètica
-Biologia molecular
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Article - Submitted version
Book Part
Springer
         

Show full item record

Related documents

Other documents of the same author

Graffelman, Jan; Galván Femenía, Iván; de Cid, Rafael; Barceló Vidal, Carles
Galván Femenía, Iván; Graffelman, Jan; Barceló i Vidal, Carles
Blay Magriñá, Natàlia; Casas, Eduard; Galván Femenía, Iván; Graffelman, Jan; de Cid, Rafael; Vavouri, Tanya
Galván Femenía, Iván; Graffelman, Jan; Barceló Vidal, Carles
 

Coordination

 

Supporters