FCCC LOGO Faculty Publications
Huerta-Chagoya A , Moreno-Macias H , Fernandez-Lopez JC , Ordonez-Sanchez ML , Rodriguez-Guillen R , Contreras A , Hidalgo-Miranda A , Alfaro-Ruiz LA , Salazar-Fernandez EP , Moreno-Estrada A , Aguilar-Salinas CA , Tusie-Luna T
A panel of 32 AIMs suitable for population stratification correction and global ancestry estimation in Mexican mestizos
BMC Genet. 2019 Jan 8;20(1) :5
PMID: 30621578    PMCID: PMC6323778    URL: https://www.ncbi.nlm.nih.gov/pubmed/30621578
Back to previous list
Abstract
BACKGROUND: Association studies are useful to unravel the genetic basis of common human diseases. However, the presence of undetected population structure can lead to both false positive results and failures to detect genuine associations. Even when most of the approaches to deal with population stratification require genome-wide data, the use of a well-selected panel of ancestry informative markers (AIMs) may appropriately correct for population stratification. Few panels of AIMs have been developed for Latino populations and most contain a high number of markers (> 100 AIMs). For some association studies such as candidate gene approaches, it may be unfeasible to genotype a numerous set of markers to avoid false positive results. In such cases, methods that use fewer AIMs may be appropriate. RESULTS: We validated an accurate and cost-effective panel of AIMs, for use in population stratification correction of association studies and global ancestry estimation in Mexicans, as well as in populations having large proportions of both European and Native American ancestries. Based on genome-wide data from 1953 Mexican individuals, we performed a PCA and SNP weights were calculated to select subsets of unlinked AIMs within percentiles 0.10 and 0.90, ensuring that all chromosomes were represented. Correlations between PC1 calculated using genome-wide data versus each subset of AIMs (16, 32, 48 and 64) were r(2) = 0.923, 0.959, 0.972 and 0.978, respectively. When evaluating PCs performance as population stratification adjustment covariates, no correlation was found between P values obtained from uncorrected and genome-wide corrected association analyses (r(2) = 0.141), highlighting that population stratification correction is compulsory for association analyses in admixed populations. In contrast, high correlations were found when adjusting for both PC1 and PC2 for either subset of AIMs (r(2) > 0.900). After multiple validations, including an independent sample, we selected a minimal panel of 32 AIMs, which are highly informative of the major ancestral components of Mexican mestizos, namely European and Native American ancestries. Finally, the correlation between the global ancestry proportions calculated using genome-wide data and our panel of 32 AIMs was r(2) = 0.972. CONCLUSIONS: Our panel of 32 AIMs accurately estimated global ancestry and corrected for population stratification in association studies in Mexican individuals.
Notes
1471-2156 Huerta-Chagoya, Alicia Moreno-Macias, Hortensia Fernandez-Lopez, Juan Carlos Ordonez-Sanchez, Maria Luisa Rodriguez-Guillen, Rosario Contreras, Alejandra Hidalgo-Miranda, Alfredo Alfaro-Ruiz, Luis Alberto Salazar-Fernandez, Edgar Pavel Moreno-Estrada, Andres Aguilar-Salinas, Carlos Alberto Tusie-Luna, Teresa ORCID: http://orcid.org/0000-0003-4478-038X 128877/conacyt Journal Article England BMC Genet. 2019 Jan 8;20(1):5. doi: 10.1186/s12863-018-0707-7.