gMSR: A Multi-GPU Algorithm to Accelerate a Massive Validation of Biclusters
Open Access
- 27 October 2020
- journal article
- research article
- Published by MDPI AG in Electronics
- Vol. 9 (11), 1782
- https://doi.org/10.3390/electronics9111782
Abstract
Nowadays, Biclustering is one of the most widely used machine learning techniques to discover local patterns in datasets from different areas such as energy consumption, marketing, social networks or bioinformatics, among them. Particularly in bioinformatics, Biclustering techniques have become extremely time-consuming, also being huge the number of results generated, due to the continuous increase in the size of the databases over the last few years. For this reason, validation techniques must be adapted to this new environment in order to help researchers focus their efforts on a specific subset of results in an efficient, fast and reliable way. The aforementioned situation may well be considered as Big Data context. In this sense, multiple machine learning techniques have been implemented by the application of Graphic Processing Units (GPU) technology and CUDA architecture to accelerate the processing of large databases. However, as far as we know, this technology has not yet been applied to any bicluster validation technique. In this work, a multi-GPU version of one of the most used bicluster validation measure, Mean Squared Residue (MSR), is presented. It takes advantage of all the hardware and memory resources offered by GPU devices. Because of to this, gMSR is able to validate a massive number of biclusters in any Biclustering-based study within a Big Data context.Keywords
This publication has 38 references indexed in Scilit:
- Rough assessment of GPU capabilities for parallel PCC-based biclustering method applied to microarray data setsbams, 2015
- BiTrinA—multiscale binarization and trinarization with quality analysisBioinformatics, 2015
- Effective biclustering on GPU - capabilities and constraintsPRZEGLĄD ELEKTROTECHNICZNY, 2015
- Machine learning applications in genetics and genomicsNature Reviews Genetics, 2015
- A New Study on Biclustering Tools, Biclusters Validation and Evaluation FunctionsInternational Journal of Computer Science & Engineering Survey, 2015
- CloudNMF: A MapReduce Implementation of Nonnegative Matrix Factorization for Large-scale Biological DatasetsGenomics, Proteomics & Bioinformatics, 2014
- PRC2 overexpression and PRC2-target gene repression relating to poorer prognosis in small cell lung cancerScientific Reports, 2013
- A biclustering algorithm for extracting bit-patterns from binary datasetsBioinformatics, 2011
- MapReduceCommunications of the ACM, 2008
- Using GOstats to test gene lists for GO term associationBioinformatics, 2006