Published August 21, 2020 | Version 2.0
Book Open

Single Cell Symphony



This is my first open-source book for single-cell RNA-Seq learners. Inspired by the Bioconductor team and I decided to translate it also added new content such as the instruction of CellRanger, Seurat, Monocle, and so on. Definitely I will keep it updating to be better.

Full online version please visit:

Of course, this book is free to read and share, but the modification of the content should contact the author firstly.


Files (68.8 kB)

Name Size Download all
1.9 kB Preview Download
835 Bytes Preview Download
3.7 kB Preview Download
3.9 kB Preview Download
2.0 kB Preview Download
2.2 kB Preview Download
1.9 kB Preview Download
1.9 kB Preview Download
2.7 kB Preview Download
2.2 kB Preview Download
2.2 kB Preview Download
2.8 kB Preview Download
1.7 kB Preview Download
1.9 kB Preview Download
3.5 kB Preview Download
1.9 kB Preview Download
1.3 kB Preview Download
1.5 kB Preview Download
2.4 kB Preview Download
997 Bytes Preview Download
2.3 kB Preview Download
1.1 kB Preview Download
2.1 kB Preview Download
1.9 kB Preview Download
1.4 kB Preview Download
1.0 kB Preview Download
1.6 kB Preview Download
1.5 kB Preview Download
2.0 kB Preview Download
1.9 kB Preview Download
4.2 kB Preview Download
4.6 kB Preview Download

Additional details


  • Anders, S., and W. Huber. 2010. "Differential expression analysis for sequence count data." Genome Biol. 11 (10): R106.
  • Aran, D., A. P. Looney, L. Liu, E. Wu, V. Fong, A. Hsu, S. Chak, et al. 2019. "Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage." Nat. Immunol. 20 (2): 163–72.
  • Bach, K., S. Pensa, M. Grzelak, J. Hadfield, D. J. Adams, J. C. Marioni, and W. T. Khaled. 2017. "Differentiation dynamics of mammary epithelial cells revealed by single-cell RNA sequencing." Nat Commun 8 (1): 2128.
  • Berger, R. L., and J. C. Hsu. 1996. "Bioequivalence Trials, Intersection-Union Tests and Equivalence Confidence Sets." Statist. Sci. 11 (4). The Institute of Mathematical Statistics: 283–319. doi:10.1214/ss/1032280304.
  • Bertoli, C., J. M. Skotheim, and R. A. de Bruin. 2013. "Control of cell cycle transcription during G1 and S phases." Nat. Rev. Mol. Cell Biol. 14 (8): 518–28.
  • Brennecke, P., S. Anders, J. K. Kim, A. A. Kołodziejczyk, X. Zhang, V. Proserpio, B. Baying, et al. 2013. "Accounting for technical noise in single-cell RNA-seq experiments." Nat. Methods 10 (11): 1093–5.
  • Buettner, F., K. N. Natarajan, F. P. Casale, V. Proserpio, A. Scialdone, F. J. Theis, S. A. Teichmann, J. C. Marioni, and O. Stegle. 2015. "Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells." Nat. Biotechnol. 33 (2): 155–60.
  • Butler, A., P. Hoffman, P. Smibert, E. Papalexi, and R. Satija. 2018. "Integrating single-cell transcriptomic data across different conditions, technologies, and species." Nat. Biotechnol. 36 (5): 411–20.
  • Chen, Y., A. T. Lun, and G. K. Smyth. 2016. "From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline." F1000Res 5: 1438.
  • Conboy, C. M., C. Spyrou, N. P. Thorne, E. J. Wade, N. L. Barbosa-Morais, M. D. Wilson, A. Bhattacharjee, et al. 2007. "Cell cycle genes are the evolutionarily conserved targets of the E2F4 transcription factor." PLoS ONE 2 (10): e1061.
  • Crowell, H. L., C. Soneson, P.-L. Germain, D. Calini, L. Collin, C. Raposo, D. Malhotra, and M. D. Robinson. 2019. "On the Discovery of Population-Specific State Transitions from Multi-Sample Multi-Condition Single-Cell Rna Sequencing Data." bioRxiv. doi:10.1101/713412.
  • Dahlin, J. S., F. K. Hamey, B. Pijuan-Sala, M. Shepherd, W. W. Y. Lau, S. Nestorowa, C. Weinreb, et al. 2018. "A single-cell hematopoietic landscape resolves 8 lineage trajectories and defects in Kit mutant mice." Blood 131 (21): e1–e11.
  • Fehlings, M., S. Chakarov, Y. Simoni, B. Sivasankar, F. Ginhoux, and E. W. Newell. 2018. "Multiplex peptide-MHC tetramer staining using mass cytometry for deep analysis of the influenza-specific T-cell response in mice." J. Immunol. Methods 453 (February): 30–36.
  • Finak, G., J. Frelinger, W. Jiang, E. W. Newell, J. Ramey, M. M. Davis, S. A. Kalams, S. C. De Rosa, and R. Gottardo. 2014. "OpenCyto: an open source infrastructure for scalable, robust, reproducible, and automated, end-to-end flow cytometry data analysis." PLoS Comput. Biol. 10 (8): e1003806.
  • Georgiou, G., G. C. Ippolito, J. Beausang, C. E. Busse, H. Wardemann, and S. R. Quake. 2014. "The promise and challenge of high-throughput sequencing of the antibody repertoire." Nat. Biotechnol. 32 (2): 158–68.
  • Glare, E. M., M. Divjak, M. J. Bailey, and E. H. Walters. 2002. "beta-Actin and GAPDH housekeeping gene expression in asthmatic airways is variable and not suitable for normalising mRNA levels." Thorax 57 (9): 765–70.
  • Godec, J., Y. Tan, A. Liberzon, P. Tamayo, S. Bhattacharya, A. J. Butte, J. P. Mesirov, and W. N. Haining. 2016. "Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation." Immunity 44 (1): 194–206.
  • Grun, D., M. J. Muraro, J. C. Boisset, K. Wiebrands, A. Lyubimova, G. Dharmadhikari, M. van den Born, et al. 2016. "De Novo Prediction of Stem Cell Identity using Single-Cell Transcriptome Data." Cell Stem Cell 19 (2): 266–77.
  • Guimaraes, J. C., and M. Zavolan. 2016. "Patterns of ribosomal protein expression specify normal and malignant human cells." Genome Biol. 17 (1): 236.
  • Haghverdi, L., A. T. L. Lun, M. D. Morgan, and J. C. Marioni. 2018. "Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors." Nat. Biotechnol. 36 (5): 421–27.
  • Hastie, T., and W. Stuetzle. 1989. "Principal Curves." J Am Stat Assoc 84 (406): 502–16.
  • Ilicic, T., J. K. Kim, A. A. Kołodziejczyk, F. O. Bagger, D. J. McCarthy, J. C. Marioni, and S. A. Teichmann. 2016. "Classification of low quality cells from single-cell RNA-seq data." Genome Biol. 17 (1): 29.
  • Islam, S., A. Zeisel, S. Joost, G. La Manno, P. Zajac, M. Kasper, P. Lonnerberg, and S. Linnarsson. 2014. "Quantitative single-cell RNA-seq with unique molecular identifiers." Nat. Methods 11 (2): 163–66.
  • Ji, Z., and H. Ji. 2016. "TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis." Nucleic Acids Res. 44 (13): e117.
  • Johnstone, I. M., and A. Y. Lu. 2009. "On Consistency and Sparsity for Principal Components Analysis in High Dimensions." J Am Stat Assoc 104 (486): 682–93.
  • Kang, H. M., M. Subramaniam, S. Targ, M. Nguyen, L. Maliskova, E. McCarthy, E. Wan, et al. 2018. "Multiplexed droplet single-cell RNA-sequencing using natural genetic variation." Nat. Biotechnol. 36 (1): 89–94.
  • Kotliar, D., A. Veres, M. A. Nagy, S. Tabrizi, E. Hodis, D. A. Melton, and P. C. Sabeti. 2019. "Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq." Elife 8 (July).
  • Langfelder, P., and S. Horvath. 2007. "Eigengene networks for studying the relationships between co-expression modules." BMC Syst Biol 1 (November): 54.
  • Law, C. W., Y. Chen, W. Shi, and G. K. Smyth. 2014. "voom: Precision weights unlock linear model analysis tools for RNA-seq read counts." Genome Biol. 15 (2): R29.
  • Lawlor, N., J. George, M. Bolisetty, R. Kursawe, L. Sun, V. Sivakamasundari, I. Kycia, P. Robson, and M. L. Stitzel. 2017. "Single-cell transcriptomes identify human islet cell signatures and reveal cell-type-specific expression changes in type 2 diabetes." Genome Res. 27 (2): 208–22.
  • Leek, J. T., W. E. Johnson, H. S. Parker, A. E. Jaffe, and J. D. Storey. 2012. "The sva package for removing batch effects and other unwanted variation in high-throughput experiments." Bioinformatics 28 (6): 882–83.
  • Lin, Y., S. Ghazanfar, K. Y. X. Wang, J. A. Gagnon-Bartsch, K. K. Lo, X. Su, Z. G. Han, et al. 2019. "scMerge leverages factor analysis, stable expression, and pseudoreplication to merge multiple single-cell RNA-seq datasets." Proc. Natl. Acad. Sci. U.s.a. 116 (20): 9775–84.
  • Love, M. I., W. Huber, and S. Anders. 2014. "Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2." Genome Biol. 15 (12): 550.
  • Lun, A. 2018. "Overcoming Systematic Errors Caused by Log-Transformation of Normalized Single-Cell Rna Sequencing Data." bioRxiv.
  • Lun, A. T. L., and J. C. Marioni. 2017. "Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data." Biostatistics 18 (3): 451–64.
  • Lun, A. T. L., F. J. Calero-Nieto, L. Haim-Vilmovsky, B. Gottgens, and J. C. Marioni. 2017. "Assessing the reliability of spike-in normalization for analyses of single-cell RNA sequencing data." Genome Res. 27 (11): 1795–1806.
  • Lun, A. T. L., D. J. McCarthy, and J. C. Marioni. 2016. "A Step-by-Step Workflow for Low-Level Analysis of Single-Cell RNA-seq Data." F1000Res. 5 (August).
  • Lun, A. T. L., H. Pages, and M. L. Smith. 2018. "beachmat: A Bioconductor C++ API for accessing high-throughput biological data from a variety of R matrix types." PLoS Comput. Biol. 14 (5): e1006135.
  • Lun, A. T. L., A. C. Richard, and J. C. Marioni. 2017. "Testing for differential abundance in mass cytometry data." Nat. Methods 14 (7): 707–9.
  • Lun, A. T., and G. K. Smyth. 2016. "csaw: a Bioconductor package for differential binding analysis of ChIP-seq data using sliding windows." Nucleic Acids Res. 44 (5): e45.
  • Lun, A. T., K. Bach, and J. C. Marioni. 2016. "Pooling across cells to normalize single-cell RNA sequencing data with many zero counts." Genome Biol. 17 (April): 75.
  • Lun, A., S. Riesenfeld, T. Andrews, T. P. Dao, T. Gomes, participants in the 1st Human Cell Atlas Jamboree, and J. Marioni. 2019. "EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data." Genome Biol. 20 (1): 63.
  • Macosko, E. Z., A. Basu, R. Satija, J. Nemesh, K. Shekhar, M. Goldman, I. Tirosh, et al. 2015. "Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets." Cell 161 (5): 1202–14.
  • Martens, J. H., and H. G. Stunnenberg. 2013. "BLUEPRINT: mapping human blood cell epigenomes." Haematologica 98 (10): 1487–9.
  • McCarthy, D. J., and G. K. Smyth. 2009. "Testing significance relative to a fold-change threshold is a TREAT." Bioinformatics 25 (6): 765–71.
  • McCarthy, D. J., K. R. Campbell, A. T. Lun, and Q. F. Wills. 2017. "Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R." Bioinformatics 33 (8): 1179–86.
  • McInnes, Leland, John Healy, and James Melville. 2018. "UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction." arXiv E-Prints, February, arXiv:1802.03426.
  • Mereu, Elisabetta, Atefeh Lafzi, Catia Moutinho, Christoph Ziegenhain, Davis J. MacCarthy, Adrian Alvarez, Eduard Batlle, et al. 2019. "Benchmarking Single-Cell Rna Sequencing Protocols for Cell Atlas Projects." bioRxiv. doi:10.1101/630087.
  • Messmer, T., F. von Meyenn, A. Savino, F. Santos, H. Mohammed, A. T. L. Lun, J. C. Marioni, and W. Reik. 2019. "Transcriptional heterogeneity in naive and primed human pluripotent stem cells at single-cell resolution." Cell Rep 26 (4): 815–24.
  • Muraro, M. J., G. Dharmadhikari, D. Grun, N. Groen, T. Dielen, E. Jansen, L. van Gurp, et al. 2016. "A Single-Cell Transcriptome Atlas of the Human Pancreas." Cell Syst 3 (4): 385–94.
  • Nazari, F., A. Parham, and A. F. Maleki. 2015. "GAPDH, -actin and -microglobulin, as three common reference genes, are not reliable for gene expression studies in equine adipose- and marrow-derived mesenchymal stem cells." J Anim Sci Technol 57: 18.
  • Nestorowa, S., F. K. Hamey, B. Pijuan Sala, E. Diamanti, M. Shepherd, E. Laurenti, N. K. Wilson, D. G. Kent, and B. Gottgens. 2016. "A single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation." Blood 128 (8): 20–31.
  • Pauken, K. E., and E. J. Wherry. 2015. "Overcoming T cell exhaustion in infection and cancer." Trends Immunol. 36 (4): 265–76.
  • Phipson, B., and G. K. Smyth. 2010. "Permutation P-values should never be zero: calculating exact P-values when permutations are randomly drawn." Stat. Appl. Genet. Mol. Biol. 9: Article 39.
  • Phipson, B., S. Lee, I. J. Majewski, W. S. Alexander, and G. K. Smyth. 2016. "Robust Hyperparameter Estimation Protects Against Hypervariable Genes and Improves Power to Detect Differential Expression." Ann. Appl. Stat. 10 (2): 946–63.
  • Picelli, S., O. R. Faridani, A. K. Bjorklund, G. Winberg, S. Sagasser, and R. Sandberg. 2014. "Full-length RNA-seq from single cells using Smart-seq2." Nat Protoc 9 (1): 171–81.
  • Pijuan-Sala, B., J. A. Griffiths, C. Guibentif, T. W. Hiscock, W. Jawaid, F. J. Calero-Nieto, C. Mulas, et al. 2019. "A Single-Cell Molecular Map of Mouse Gastrulation and Early Organogenesis." Nature 566 (7745): 490–95.
  • Pollen, A. A., T. J. Nowakowski, J. Shuga, X. Wang, A. A. Leyrat, J. H. Lui, N. Li, et al. 2014. "Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex." Nat. Biotechnol. 32 (10): 1053–8.
  • Richard, A. C., A. T. L. Lun, W. W. Y. Lau, B. Gottgens, J. C. Marioni, and G. M. Griffiths. 2018. "T cell cytolytic capacity is independent of initial stimulation strength." Nat. Immunol. 19 (8): 849–58.
  • Ritchie, M. E., B. Phipson, D. Wu, Y. Hu, C. W. Law, W. Shi, and G. K. Smyth. 2015. "limma powers differential expression analyses for RNA-sequencing and microarray studies." Nucleic Acids Res. 43 (7): e47.
  • Robinson, M. D., and A. Oshlack. 2010. "A scaling normalization method for differential expression analysis of RNA-seq data." Genome Biol. 11 (3): R25.
  • Robinson, M. D., D. J. McCarthy, and G. K. Smyth. 2010. "edgeR: a Bioconductor package for differential expression analysis of digital gene expression data." Bioinformatics 26 (1): 139–40.
  • Rosati, E., C. M. Dowds, E. Liaskou, E. K. K. Henriksen, T. H. Karlsen, and A. Franke. 2017. "Overview of methodologies for T-cell receptor repertoire analysis." BMC Biotechnol. 17 (1): 61.
  • Scialdone, A., K. N. Natarajan, L. R. Saraiva, V. Proserpio, S. A. Teichmann, O. Stegle, J. C. Marioni, and F. Buettner. 2015. "Computational assignment of cell-cycle stage from single-cell transcriptome data." Methods 85 (September): 54–61.
  • Scialdone, A., Y. Tanaka, W. Jawaid, V. Moignard, N. K. Wilson, I. C. Macaulay, J. C. Marioni, and B. Gottgens. 2016. "Resolving early mesoderm diversification through single-cell expression profiling." Nature 535 (7611): 289–93.
  • Segerstolpe, A., A. Palasantza, P. Eliasson, E. M. Andersson, A. C. Andreasson, X. Sun, S. Picelli, et al. 2016. "Single-Cell Transcriptome Profiling of Human Pancreatic Islets in Health and Type 2 Diabetes." Cell Metab. 24 (4): 593–607.
  • Shao, C., and T. Höfer. 2017. "Robust classification of single-cell transcriptome data by nonnegative matrix factorization." Bioinformatics 33 (2): 235–42.
  • Shugay, M., D. V. Bagaev, I. V. Zvyagin, R. M. Vroomans, J. C. Crawford, G. Dolton, E. A. Komech, et al. 2018. "VDJdb: a curated database of T-cell receptor sequences with known antigen specificity." Nucleic Acids Res. 46 (D1): D419–D427.
  • Simes, R. J. 1986. "An Improved Bonferroni Procedure for Multiple Tests of Significance." Biometrika 73 (3): 751–54.
  • Soneson, C., and M. D. Robinson. 2018. "Bias, robustness and scalability in single-cell differential expression analysis." Nat. Methods 15 (4): 255–61.
  • Stegle, O., S. A. Teichmann, and J. C. Marioni. 2015. "Computational and analytical challenges in single-cell transcriptomics." Nat. Rev. Genet. 16 (3): 133–45.
  • Stoeckius, M., C. Hafemeister, W. Stephenson, B. Houck-Loomis, P. K. Chattopadhyay, H. Swerdlow, R. Satija, and P. Smibert. 2017. "Simultaneous epitope and transcriptome measurement in single cells." Nat. Methods 14 (9): 865–68.
  • Stoeckius, M., S. Zheng, B. Houck-Loomis, S. Hao, B. Z. Yeung, W. M. Mauck, P. Smibert, and R. Satija. 2018. "Cell Hashing with barcoded antibodies enables multiplexing and doublet detection for single cell genomics." Genome Biol. 19 (1): 224.
  • Stoeckius, M., S. Zheng, B. Houck-Loomis, S. Hao, B. Yeung, P. Smibert, and R. Satija. 2017. "Cell 'Hashing' with Barcoded Antibodies Enables Multiplexing and Doublet Detection for Single Cell Genomics." bioRxiv. Cold Spring Harbor Laboratory. doi:10.1101/237693.
  • Street, K., D. Risso, R. B. Fletcher, D. Das, J. Ngai, N. Yosef, E. Purdom, and S. Dudoit. 2018. "Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics." BMC Genomics 19 (1): 477.
  • Tasic, B., V. Menon, T. N. Nguyen, T. K. Kim, T. Jarsky, Z. Yao, B. Levi, et al. 2016. "Adult mouse cortical cell taxonomy revealed by single cell transcriptomics." Nat. Neurosci. 19 (2): 335–46.
  • The ENCODE Project Consortium. 2012. "An integrated encyclopedia of DNA elements in the human genome." Nature 489 (7414): 57–74.
  • Tian, L., X. Dong, S. Freytag, K. A. Le Cao, S. Su, A. JalalAbadi, D. Amann-Zalcenstein, et al. 2019. "Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments." Nat. Methods 16 (6): 479–87.
  • Tung, P. Y., J. D. Blischak, C. J. Hsiao, D. A. Knowles, J. E. Burnett, J. K. Pritchard, and Y. Gilad. 2017. "Batch effects and the effective design of single-cell gene expression studies." Sci. Rep. 7 (January): 39921.
  • Van der Maaten, L., and G. Hinton. 2008. "Visualizing Data Using T-SNE." J. Mach. Learn. Res. 9 (2579-2605): 85.
  • Von Luxburg, U. 2010. "Clustering Stability: An Overview." Foundations and Trends in Machine Learning 2 (3): 235–74.
  • Wajapeyee, N., S. Z. Wang, R. W. Serra, P. D. Solomon, A. Nagarajan, X. Zhu, and M. R. Green. 2010. "Senescence induction in human fibroblasts and hematopoietic progenitors by leukemogenic fusion proteins." Blood 115 (24): 5057–60.
  • Wang, X. 2012. "A Fast Exact K-Nearest Neighbors Algorithm for High Dimensional Search Using K-Means Clustering and Triangle Inequality." Proc Int Jt Conf Neural Netw 43 (6): 2351–8.
  • Wolf, F. Alexander, Fiona Hamey, Mireya Plass, Jordi Solana, Joakim S. Dahlin, Berthold Gottgens, Nikolaus Rajewsky, Lukas Simon, and Fabian J. Theis. 2017. "Graph Abstraction Reconciles Clustering with Trajectory Inference Through a Topology Preserving Map of Single Cells." bioRxiv. Cold Spring Harbor Laboratory. doi:10.1101/208819.
  • Xu, C., and Z. Su. 2015. "Identification of cell types from single-cell transcriptomes using a novel clustering method." Bioinformatics 31 (12): 1974–80.
  • Young, M. D., and S. Behjati. 2018. "SoupX Removes Ambient RNA Contamination from Droplet Based Single Cell RNA Sequencing Data." bioRxiv.
  • Zeisel, A., A. B. Munoz-Manchado, S. Codeluppi, P. Lonnerberg, G. La Manno, A. Jureus, S. Marques, et al. 2015. "Brain structure. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq." Science 347 (6226): 1138–42.
  • Zeng, H., E. H. Shen, J. G. Hohmann, S. W. Oh, A. Bernard, J. J. Royall, K. J. Glattfelder, et al. 2012. "Large-scale cellular-resolution gene profiling in human neocortex reveals species-specific molecular signatures." Cell 149 (2): 483–96.
  • Zheng, G. X., J. M. Terry, P. Belgrader, P. Ryvkin, Z. W. Bent, R. Wilson, S. B. Ziraldo, et al. 2017. "Massively parallel digital transcriptional profiling of single cells." Nat Commun 8 (January): 14049.
  • Ziegenhain, C., B. Vieth, S. Parekh, B. Reinius, A. Guillaumet-Adkins, M. Smets, H. Leonhardt, H. Heyn, I. Hellmann, and W. Enard. 2017. "Comparative Analysis of Single-Cell RNA Sequencing Methods." Mol. Cell 65 (4): 631–43.