Skip to main content
scSampler: fast diversity-preserving subsampling of large-scale single-cell transcriptomic data.
- Song, Dongyuan;
- Xi, Nan Miles;
- Li, Jingyi Jessica;
- Wang, Lin
- Editor(s): Vitek, Olga
Published Web Location
https://www.biorxiv.org/content/10.1101/2022.01.15.476407v1No data is associated with this publication.
Abstract
Summary
The number of cells measured in single-cell transcriptomic data has grown fast in recent years. For such large-scale data, subsampling is a powerful and often necessary tool for exploratory data analysis. However, the easiest random subsampling is not ideal from the perspective of preserving rare cell types. Therefore, diversity-preserving subsampling is required for fast exploration of cell types in a large-scale dataset. Here, we propose scSampler, an algorithm for fast diversity-preserving subsampling of single-cell transcriptomic data.Availability and implementation
scSampler is implemented in Python and is published under the MIT source license. It can be installed by "pip install scsampler" and used with the Scanpy pipline. The code is available on GitHub: https://github.com/SONGDONGYUAN1994/scsampler. An R interface is available at: https://github.com/SONGDONGYUAN1994/rscsampler.Supplementary information
Supplementary data are available at Bioinformatics online.Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.