Analysis of Stop-Gain and Frameshift Variants in Human Innate Immunity Genes
Open Access
- 24 July 2014
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 10 (7), e1003757
- https://doi.org/10.1371/journal.pcbi.1003757
Abstract
Loss-of-function variants in innate immunity genes are associated with Mendelian disorders in the form of primary immunodeficiencies. Recent resequencing projects report that stop-gains and frameshifts are collectively prevalent in humans and could be responsible for some of the inter-individual variability in innate immune response. Current computational approaches evaluating loss-of-function in genes carrying these variants rely on gene-level characteristics such as evolutionary conservation and functional redundancy across the genome. However, innate immunity genes represent a particular case because they are more likely to be under positive selection and duplicated. To create a ranking of severity that would be applicable to innate immunity genes we evaluated 17,764 stop-gain and 13,915 frameshift variants from the NHLBI Exome Sequencing Project and 1,000 Genomes Project. Sequence-based features such as loss of functional domains, isoform-specific truncation and nonsense-mediated decay were found to correlate with variant allele frequency and validated with gene expression data. We integrated these features in a Bayesian classification scheme and benchmarked its use in predicting pathogenic variants against Online Mendelian Inheritance in Man (OMIM) disease stop-gains and frameshifts. The classification scheme was applied in the assessment of 335 stop-gains and 236 frameshifts affecting 227 interferon-stimulated genes. The sequence-based score ranks variants in innate immunity genes according to their potential to cause disease, and complements existing gene-based pathogenicity scores. Specifically, the sequence-based score improves measurement of functional gene impairment, discriminates across different variants in a given gene and appears particularly useful for analysis of less conserved genes. There are well-characterized severe immunodeficiencies associated with loss-of-function variants in innate immunity genes. Genome sequencing projects identify rare stop-gain and frameshift variants in innate immunity genes whose phenotype is uncharacterized. Current methods to estimate the severity of rare stop-gains and frameshifts are based on evolutionary conservation of the gene, the likelihood for redundancy in its function or mutational burden. These parameters are not always applicable to innate immunity genes. We evaluated sequence-level characteristics of more than 30'000 stop-gains and frameshifts and prioritized variants according to their predicted functional consequences. Our scoring approach complements existing tools in the prediction of innate immunity OMIM disease variants and associates with functional readouts such as gene expression. In this framework, we show that many individuals do carry highly pathogenic variants in genes participating in antiviral defense. The clinical assessment of these variants is of significant interest.Keywords
This publication has 39 references indexed in Scilit:
- Guidelines for investigating causality of sequence variants in human diseaseNature, 2014
- A general framework for estimating the relative pathogenicity of human genetic variantsNature Genetics, 2014
- Functional annotation of noncoding sequence variantsNature Methods, 2014
- Towards Precision Medicine: Advances in Computational Approaches for the Analysis of Human VariantsJournal of Molecular Biology, 2013
- Integrative Annotation of Variants from 1092 Humans: Application to Cancer GenomicsScience, 2013
- Genic Intolerance to Functional Variation and the Interpretation of Personal GenomesPLoS Genetics, 2013
- ANNOVAR: functional annotation of genetic variants from high-throughput sequencing dataNucleic Acids Research, 2010
- A method and server for predicting damaging missense mutationsNature Methods, 2010
- Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithmNature Protocols, 2009
- Immunology in natura: clinical, epidemiological and evolutionary genetics of infectious diseasesNature Immunology, 2007