Detecting heterogeneity in generalized linear modeling

Home | About RECERCAT | Contact

Català | Castellano

All of RECERCAT

By Communities &
Collections By Defense Date By Authors By Titles By Subject

This Collection

By Defense Date By Authors By Titles By Subject

Statistics

View Statistics All RECERCAT

My RECERCAT

Other repositories directory

RECERCAT Home > Universitat Politècnica de Catalunya > Tesines i projectes i treballs de final de carrera > View document

To access the full text documents, please follow this link: http://hdl.handle.net/2117/100826

dc.contributor	Universitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa
dc.contributor	Aluja Banet, Tomàs
dc.contributor.author	Hernández Potiomkin, Yaroslav
dc.date	2017-02
dc.identifier.citation	123241
dc.identifier.uri	http://hdl.handle.net/2117/100826
dc.description.abstract	In classical model fitting techinques, such as traditional Multiple Linear Regression models (MLR) or Generalized Linear Models (GLM), the assumption is that the individuals come from homogeneous population. However, this condition may be not necessarily met, as there may be many factors that influence the behaviour of the individuals and therefore, biasing the model estimations. For instance, let us consider that we want to study the salaries among a certain set of individuals that come from relatively defined professional sector. The first approach would be to collect all possible modeling variables and fit the model. But it may happen that this could lead us to inaccurate estimations, since the salaries can be driven differently according to gender, region, ethnicity, among others. These variables are called segmentation variables and their number may grow very fast. In this case arises a combinatorial problem giving many possibilities of how to group those individuals. Our main goal in this work, is to go deeper in this kind of problems, and present an automatic solution to detect homogeneous segments among the heterogeneous population in the GLM context. The PATHMOX methodology is a powerful method proposed by Gastón (2009) [19] to automate the task of finding segments. The statistical tests needed to guide the PATHMOX algorithm and discover the constructs that differentiate those segments, are proposed by Lamberti (2015) [8]. First, we provide several solutions to detect heterogeneity, by means of moderating variables as in Covariance Analysis or by means of comparison of coefficients using parametric or non-parametric approaches, in section 2. Additionally, we present the method to characterize classes or continuous response by taking into account only segmentation variables in section 4. Then, we concentrate on the Generalized Linear Modeling context to define the automatic heterogeneity detection method. Then, we accurately present all the needed hypothesis test procedures in section 3. Finally, we also carry out a quite extensive simulation studies and a real problem application in sections 6 and 7, respectively.
dc.language.iso	eng
dc.publisher	Universitat Politècnica de Catalunya
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Àrees temàtiques de la UPC::Informàtica
dc.subject	Linear models (Statistics)
dc.subject	Heterogeneity
dc.subject	Generalized Linear Modeling
dc.subject	F-statistic
dc.subject	Likelihood Ratio Test
dc.subject	Pathmox
dc.subject	class characterization
dc.subject	valeur-test
dc.subject	non-parametric tests
dc.subject	Analysis of Covariance
dc.subject	statistical test
dc.subject	Models lineals (Estadística)
dc.title	Detecting heterogeneity in generalized linear modeling
dc.title	Detecting heterogeneity in generalized linear modeling and principal component analysis
dc.type	info:eu-repo/semantics/masterThesis

Show simple item record

All of RECERCAT

This Collection

Statistics

My RECERCAT

Related documents

Other documents of the same author