GDASC: a GPU parallel-based web server for detecting hidden batch factors
- 5 May 2020
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 36 (14), 4211-4213
- https://doi.org/10.1093/bioinformatics/btaa427
Abstract
Summary: We developed GDASC, a web version of our former DASC algorithm implemented with GPU. It provides a user-friendly web interface for detecting batch factors. Based on the good performance of DASC algorithm, it is able to give the most accurate results. For two steps of DASC, data-adaptive shrinkage and semi-non-negative matrix factorization, we designed parallelization strategies facing convex clustering solution and decomposition process. It runs more than 50 times faster than the original version on the representative RNA sequencing quality control dataset. With its accuracy and high speed, this server will be a useful tool for batch effects analysis. Availability and implementation: http://bioinfo.nankai.edu.cn/gdasc.php. Contact: zhanghan@nankai.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
Funding Information
- National Natural Science Foundation of China (61973174, 31728013)
- Computational facilities
This publication has 10 references indexed in Scilit:
- MPI-FAUN: An MPI-Based Framework for Alternating-Updating Nonnegative Matrix FactorizationIEEE Transactions on Knowledge and Data Engineering, 2017
- Detecting hidden batch factors through data-adaptive adjustment for biological effectsBioinformatics, 2017
- Removal of batch effects using distribution-matching residual networksBioinformatics, 2017
- A reanalysis of mouse ENCODE comparative gene expression dataF1000Research, 2015
- Removing batch effects from purified plasma cell gene expression microarrays with modified ComBatBMC Bioinformatics, 2015
- A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control ConsortiumNature Biotechnology, 2014
- PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generationParallel Computing, 2011
- Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable AnalysisPLoS Genetics, 2007
- On the design and analysis of gene expression studies in human populationsNature Genetics, 2007
- Adjusting batch effects in microarray expression data using empirical Bayes methodsBiostatistics, 2006