BnBeeEpi: An Approach of Epistasis Mining Based on Artificial Bee Colony Algorithm Optimizing Bayesian Network

Abstract
Mining epistatic gene locus which influence complex disease has great research significance. Bayesian network (BN) has been widely used in many researches of epistasis mining. However, Bayesian network methods have disadvantages of being easily trapped into local optimum, low learning efficiency and not being able to handle large-scale network. In this work, we propose an epistasis mining approach based on artificial bee colony algorithm optimizing Bayesian network (BnBeeEpi). We apply artificial bee colony algorithm into the heuristic search strategy of Bayesian network, and then use two kinds of BN scoring functions (BIC and MIT) to calculate the network fitness value to avoid overfitting and reduce false positive rate. Moreover, we introduce decomposable BIC scoring to solve the large-scale network learning problem. Finally, we compare BnBeeEpi with current popular epistasis mining algorithms by using both simulated and real datasets. Experiment results show that omb-Fast has very short running time with its accuracy is as good as other methods, and BnBeeEpi has better F1-score and lower false positive rate compared to others. Availability and implementation: codes and visualization platform are available at: http://106.14.132.202/.