Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The . gov means it’s official. Federal government websites often end in VSports app下载. gov or . mil. Before sharing sensitive information, make sure you’re on a federal government site. .

Https

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely. V体育官网.

. 2018 Feb 6;19(1):15.
doi: 10.1186/s13059-017-1382-0.

SCANPY: large-scale single-cell gene expression data analysis

Affiliations

SCANPY: large-scale single-cell gene expression data analysis

VSports最新版本 - F Alexander Wolf et al. Genome Biol. .

Abstract

SCANPY is a scalable toolkit for analyzing single-cell gene expression data. It includes methods for preprocessing, visualization, clustering, pseudotime and trajectory inference, differential expression testing, and simulation of gene regulatory networks VSports手机版. Its Python-based implementation efficiently deals with data sets of more than one million cells ( https://github. com/theislab/Scanpy ). Along with SCANPY, we present ANNDATA, a generic class for handling annotated data matrices ( https://github. com/theislab/anndata ). .

Keywords: Bioinformatics; Clustering; Differential expression testing; Graph analysis; Machine learning; Pseudotemporal ordering; Scalability; Single-cell transcriptomics; Trajectory inference; Visualization. V体育安卓版.

PubMed Disclaimer

Conflict of interest statement (VSports app下载)

Ethics approval and consent to participate

Ethics approval was not applicable for this study.

Competing interests

None of the authors declare competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
aSCANPY’s analysis features. We use the example of 68,579 peripheral blood mononuclear cells of [6]. We regress out confounding variables, normalize, and identify highly variable genes. TSNE and graph-drawing (Fruchterman–Reingold) visualizations show cell-type annotations obtained by comparisons with bulk expression. Cells are clustered using the Louvain algorithm. Ranking differentially expressed genes in clusters identifies the MS4A1 marker gene for B cells in cluster 7, which agrees with the bulk labels. We use pseudotemporal ordering from a root cell in the CD34+ cluster and detect a branching trajectory, visualized with TSNE and diffusion maps. b Speedup over CELL RANGER R kit. We consider representative steps of the analysis [6]. c Visualizing and clustering 1.3 million cells. The data, brain cells from E18 mice, are publicly available from 10x Genomics. PCA = principal component analysis, DC = diffusion component

References

    1. Wagner A, Regev A, Yosef N. Revealing the vectors of cellular identity with single-cell genomics. Nat Biotechnol. 2016;34:1145–60. doi: 10.1038/nbt.3711. - DOI - PMC - PubMed
    1. Satija R, Farrell JA, Gennert D, Schier AF, Regev A. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015;33:495–502. doi: 10.1038/nbt.3192. - DOI (V体育官网入口) - PMC - PubMed
    1. Trapnell C, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32:381–6. doi: 10.1038/nbt.2859. - DOI - PMC - PubMed
    1. Kharchenko PV, Silberstein L, Scadden DT, Bayesian approach to single-cell differential expression analysis Nat Methods. 2014;11:740–2. doi: 10.1038/nmeth.2967. - DOI - PMC - PubMed
    1. Finak, G et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 2015;16:278. doi: 10.1186/s13059-015-0844-5. - DOI - PMC - PubMed

Publication types