Scedar: A scalable Python package for single-cell RNA-seq exploratory data analysis
Open Access
- 27 April 2020
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 16 (4), e1007794
- https://doi.org/10.1371/journal.pcbi.1007794
Abstract
In single-cell RNA-seq (scRNA-seq) experiments, the number of individual cells has increased exponentially, and the sequencing depth of each cell has decreased significantly. As a result, analyzing scRNA-seq data requires extensive considerations of program efficiency and method selection. In order to reduce the complexity of scRNA-seq data analysis, we present scedar, a scalable Python package for scRNA-seq exploratory data analysis. The package provides a convenient and reliable interface for performing visualization, imputation of gene dropouts, detection of rare transcriptomic profiles, and clustering on large-scale scRNA-seq datasets. The analytical methods are efficient, and they also do not assume that the data follow certain statistical distributions. The package is extensible and modular, which would facilitate the further development of functionalities for future requirements with the open-source development community. The scedar package is distributed under the terms of the MIT license at https://pypi.org/project/scedar.This publication has 64 references indexed in Scilit:
- Bayesian approach to single-cell differential expression analysisNature Methods, 2014
- Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian CellsScience, 2014
- Unique transcriptome signature of mouse microgliaGlia, 2013
- DNMT3A mutations and clinical features in Chinese patients with acute myeloid leukemiaCancer Cell International, 2013
- Mammalian Genes Are Transcribed with Widely Different Bursting KineticsScience, 2011
- MapReduceCommunications of the ACM, 2008
- On the Surprising Behavior of Distance Metrics in High Dimensional SpaceLecture Notes in Computer Science, 2001
- Model Selection and the Principle of Minimum Description LengthJournal of the American Statistical Association, 2001
- Generalized k-nearest neighbor rulesFuzzy Sets and Systems, 1986
- Comparing partitionsJournal of Classification, 1985