Credibility Analysis of Putative Disease-Causing Genes Using Bioinformatics
Open Access
- 5 June 2013
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 8 (6), e64899
- https://doi.org/10.1371/journal.pone.0064899
Abstract
Genetic studies are challenging in many complex diseases, particularly those with limited diagnostic certainty, low prevalence or of old age. The result is that genes may be reported as disease-causing with varying levels of evidence, and in some cases, the data may be so limited as to be indistinguishable from chance findings. When there are large numbers of such genes, an objective method for ranking the evidence is useful. Using the neurodegenerative and complex disease amyotrophic lateral sclerosis (ALS) as a model, and the disease-specific database ALSoD, the objective is to develop a method using publicly available data to generate a credibility score for putative disease-causing genes. Genes with at least one publication suggesting involvement in adult onset familial ALS were collated following an exhaustive literature search. SQL was used to generate a score by extracting information from the publications and combined with a pathogenicity analysis using bioinformatics tools. The resulting score allowed us to rank genes in order of credibility. To validate the method, we compared the objective ranking with a rank generated by ALS genetics experts. Spearman's Rho was used to compare rankings generated by the different methods. The automated method ranked ALS genes in the following order: SOD1, TARDBP, FUS, ANG, SPG11, NEFH, OPTN, ALS2, SETX, FIG4, VAPB, DCTN1, TAF15, VCP, DAO. This compared very well to the ranking of ALS genetics experts, with Spearman's Rho of 0.69 (P = 0.009). We have presented an automated method for scoring the level of evidence for a gene being disease-causing. In developing the method we have used the model disease ALS, but it could equally be applied to any disease in which there is genotypic uncertainty.This publication has 25 references indexed in Scilit:
- Expanded GGGGCC Hexanucleotide Repeat in Noncoding Region of C9ORF72 Causes Chromosome 9p-Linked FTD and ALSNeuron, 2011
- A Hexanucleotide Repeat Expansion in C9ORF72 Is the Cause of Chromosome 9p21-Linked ALS-FTDNeuron, 2011
- The risk to relatives of patients with sporadic amyotrophic lateral sclerosisBrain, 2011
- Keeping up with genetic discoveries in amyotrophic lateral sclerosis: The ALSoD and ALSGene databasesAmyotrophic Lateral Sclerosis, 2011
- Clinical phenotypes of a large Chinese multigenerational kindred with autosomal dominant familial ALS due to Ile149Thr SOD1 gene mutationAmyotrophic Lateral Sclerosis, 2006
- Power of the Mann–Kendall and Spearman's rho tests for detecting monotonic trends in hydrological seriesJournal of Hydrology, 2002
- Mutations in Cu/Zn superoxide dismutase gene are associated with familial amyotrophic lateral sclerosisNature, 1993
- Critical Values for Spearman’s Rank Order CorrelationJournal of Educational Statistics, 1989
- The variance of Spearman's rho in normal samplesBiometrika, 1961