Dimensionality reduction reveals fine-scale structure in the Japanese population with consequences for polygenic risk prediction
Open Access
- 26 March 2020
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Communications
- Vol. 11 (1), 1-11
- https://doi.org/10.1038/s41467-020-15194-z
Abstract
The diversity in our genome is crucial to understanding the demographic history of worldwide populations. However, we have yet to know whether subtle genetic differences within a population can be disentangled, or whether they have an impact on complex traits. Here we apply dimensionality reduction methods (PCA, t-SNE, PCA-t-SNE, UMAP, and PCA-UMAP) to biobank-derived genomic data of a Japanese population (n = 169,719). Dimensionality reduction reveals fine-scale population structure, conspicuously differentiating adjacent insular subpopulations. We further enluciate the demographic landscape of these Japanese subpopulations using population genetics analyses. Finally, we perform phenome-wide polygenic risk score (PRS) analyses on 67 complex traits. Differences in PRS between the deconvoluted subpopulations are not always concordant with those in the observed phenotypes, suggesting that the PRS differences might reflect biases from the uncorrected structure, in a trait-dependent manner. This study suggests that such an uncorrected structure can be a potential pitfall in the clinical application of PRS.Keywords
Funding Information
- Qatar National Research Fund (4-344-3-105)
- Ministry of Education, Culture, Sports, Science and Technology (15H05911, 19H01021)
- Japan Agency for Medical Research and Development (19gm6010001h0004, 19ek0410041h0003, 19ek0109413h0001, 19km0405211h0001)
- Takeda Science Foundation
This publication has 47 references indexed in Scilit:
- Visualization of SNPs with t-SNEPLOS ONE, 2013
- Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency DataPLoS Genetics, 2012
- Evidence of widespread selection on standing variation in Europe at height-associated SNPsNature Genetics, 2012
- Polymorphisms in peptidylarginine deiminase (PADI) associate with rheumatoid arthritis in diverse Asian populations: evidence from MyEIRA study and meta-analysisArthritis Research & Therapy, 2012
- Smoking interacts with HLA-DRB1 shared epitope in the development of anti-citrullinated protein antibody-positive rheumatoid arthritis: results from the Malaysian Epidemiological Investigation of Rheumatoid Arthritis (MyEIRA)Arthritis Research & Therapy, 2012
- GCTA: A Tool for Genome-wide Complex Trait AnalysisAmerican Journal of Human Genetics, 2011
- Fast model-based estimation of ancestry in unrelated individualsGenome Research, 2009
- PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage AnalysesAmerican Journal of Human Genetics, 2007
- Principal components analysis corrects for stratification in genome-wide association studiesNature Genetics, 2006
- A Coefficient of Agreement for Nominal ScalesEducational and Psychological Measurement, 1960