Machine Learning Methods for COVID-19 Prediction Using Human Genomic Data
Open Access
- 16 March 2021
- conference paper
- conference paper
- Published by MDPI AG in Proceedings
- Vol. 74 (1), 20
- https://doi.org/10.3390/proceedings2021074020
Abstract
Accurate identification of COVID-19 is now a critical task since it has seriously damaged daily life, public health, and the economy. It is essential to identify the infected people to prevent the further spread of the pandemic and to treat infected patients quickly. Machine learning techniques have a significant role in predicting of COVID-19. In this study, we performed binary classification (COVID-19 vs. other types of coronavirus) by extracting features from genome sequences. Support vector machines, naive Bayes, K-nearest neighbor, and random forest methods were used for classification. We used viral gene sequences from the 2019 Novel Coronavirus Resource Database. Experimental results presented show that a decision tree method achieved 93% accuracy.Keywords
This publication has 17 references indexed in Scilit:
- An open-source k-mer based machine learning tool for fast and accurate subtyping of HIV-1 genomesPLOS ONE, 2018
- An investigation into inter- and intragenomic variations of graphic genomic signaturesBMC Bioinformatics, 2015
- Mapping the Space of Genomic SignaturesPLOS ONE, 2015
- Applications of alignment-free methods in epigenomicsBriefings in Bioinformatics, 2013
- An experimental comparison of classification algorithms for imbalanced credit scoring data setsExpert Systems with Applications, 2012
- Isolation and Characterization of Viruses Related to the SARS Coronavirus from Animals in Southern ChinaScience, 2003
- Alignment-free sequence comparison—a reviewBioinformatics, 2003
- CpGProD: identifying CpG islands associated with transcription start sites in large genomic mammalian sequencesBioinformatics, 2002
- Basic local alignment search toolJournal of Molecular Biology, 1990
- [5] Rapid and sensitive sequence comparison with FASTP and FASTAMethods in Enzymology, 1990