An era of single-cell genomics consortia

Ando, Yoshinari; Kwon, Andrew Tae-Jun; Shin, Jay W.

doi:10.1038/s12276-020-0409-x

Download PDF

Review Article
Open access
Published: 15 September 2020

An era of single-cell genomics consortia

Experimental & Molecular Medicine volume 52, pages 1409–1418 (2020)Cite this article

6590 Accesses
11 Citations
10 Altmetric
Metrics details

Subjects

Abstract

The human body consists of 37 trillion single cells represented by over 50 organs that are stitched together to make us who we are, yet we still have very little understanding about the basic units of our body: what cell types and states make up our organs both compositionally and spatially. Previous efforts to profile a wide range of human cell types have been attempted by the FANTOM and GTEx consortia. Now, with the advancement in genomic technologies, profiling the human body at single-cell resolution is possible and will generate an unprecedented wealth of data that will accelerate basic and clinical research with tangible applications to future medicine. To date, several major organs have been profiled, but the challenges lie in ways to integrate single-cell genomics data in a meaningful way. In recent years, several consortia have begun to introduce harmonization and equity in data collection and analysis. Herein, we introduce existing and nascent single-cell genomics consortia, and present benefits to necessitate single-cell genomic consortia in a regional environment to achieve the universal human cell reference dataset.

Single-cell genomics meets human genetics

Article 21 April 2023

Fast, sensitive and accurate integration of single-cell data with Harmony

Article 18 November 2019

Impact of the Human Cell Atlas on medicine

Article 08 December 2022

Introduction

RIKEN has spearheaded international consortium efforts to establish an atlas of human promoters and enhancers through the FANTOM project. Gene expression profiling of 400+ human cell types using CAGE revealed gene-regulatory modules that define cell types and states^1,2. This comprehensive landscape resource has revealed pervasive transcription of coding and noncoding RNA in the human genome, and precise understanding of how and where genes are activated³. However, the FANTOM data were derived from bulk samples, ignoring the cellular heterogeneity that exists in both tissues and the cell culture system. Creating an atlas that maps promoters and enhancers across millions of single cells in the human body will not only reveal regulatory regions of our genome, but also gene-regulatory programs that control cell fates and pathology of genetic diseases.

Similarly, the genotype-tissue expression (GTEx) project has generated a large genomic dataset, including over 10,000 bulk RNA-seq samples representing 54 different tissues (30 organs) acquired from 948 individuals with genotype information^{4,5,6,7,8,9,10}. This rich dataset allows for linking genetic variants at gene expression levels through expression quantitative trait loci analysis (eQTL). Despite its efforts to collect a variety of tissues from a relatively large cohort of individuals, the expression profiles are based on bulk, lacking cellular heterogeneity. To circumvent this, the GTEx project has recently released a unique strategy to infer cellular heterogeneity based on gene signatures from different cell types known to be present in a given tissue. The method relies on the Tabula Muris dataset¹¹ to deconvolute the cellular composition over 6,000 additional GTEx samples corresponding to 28 tissues, and reveals tissue-specific eQTLs colocalizing with GWAS variants that were not detected in bulk, but only discovered through deconvolution strategy¹². GTEx has built an extensive and mature infrastructure to obtain fresh tissues from relatively large cohorts. It is a matter of time before the consortium combines new technologies such as single-cell RNA-seq with archival and new tissues for single-cell eQTL RNA-seq analysis¹³.

Thanks to recent technological advances, we can now profile large numbers of dissociated cells, and study the RNA transcripts, proteins, and chromatin profiles of 10–100 k individual cells at a reasonable cost (consensus approach). Moreover, we can characterize DNA sequences for reconstruction of cell lineages¹⁴, and combine these to relate different gene features to cellular identities. We can also profile multiple classes of RNA, including noncoding RNAs, enhancer RNAs¹⁵, and multiplex transcripts and proteins in situ to map cells and their molecules to their positions in histological sections (spatial approach)^16,17,18,19. Applying both consensus and spatial strategies across tissues and populations, together with advanced database infrastructure and computational tools, should allow us to define “what is normal” in cells, and provide a universal reference map of the human body.

In recent years, numerous reports demonstrating the power of single-cell genomics are prevailing, where topics include cellular ontology^20,21 and functional conservation across species²²; cell fate and lineage determinants^23,24; dynamic changes in cell states such as the cell cycle²⁵ and transient responses²⁶; molecular mechanisms that control intra- and intercellular regulatory networks^27,28; fundamental research in disease studies and pathology^29,30,31,32. In parallel, tens and hundreds of thousands of single-cell genomics data across various human tissues are leading to the discovery of new cell types and states, fundamentally changing the picture of human anatomy in multiple ways (summary of large-scale single-cell genomics data across human tissue in Table 1). Integrating our knowledge that we gained through single-cell genomics shows tremendous potential for translational discoveries and applications, and impacting diagnostic and clinical practices.

Table 1 Single-cell profiling of major human organs and tissues.

Full size table

Cells in our body can now be explained by several features, including their shape, location in a tissue, gene expression, and function in a high-throughput manner. However, we have not comprehensively determined how these features are associated with each other, and what constitutes “normal” with respect to the health status of an individual. As a result, our knowledge of the cellular makeup and relations of the human body and disease is still limited. Therefore, we need a comprehensive reference database through an integrative, systematic effort, and many teams of scientists working together to produce data that are not only consistent, high quality, and interoperable, but also driven by biology and medicine.

In the last few years, several single-cell genomics consortia have been created to address the issue of systematic data integration and harmonization. Consortia such as the Human Cell Atlas aims to bring together domain experts to consolidate single-cell data in a central portal. In parallel, several tissue-centric consortia, such as the BRAIN initiatives, aim to dive deeper into the complex nature of individual organ systems. Here, we summarize the ever-growing single-cell genomics consortia and describe their missions. We further showcase benefits from generating single-cell data in a regional and coherent manner through the formation of single-cell consortia.

Single-cell genomics consortia

In light of the enormous complexity of the human body and the rapidly evolving technology landscape, in October 2016, more than 150 international scientists met in London to launch the planning process for an ambitious new initiative: the Human Cell Atlas (HCA), an international collaboration to create comprehensive reference maps of all human cells^33,34,35,36. The HCA consortium aims to build this ambitious yet essential resource in phases, starting with cells in tissues and eventually organs and systems, with the aim of constructing an increasingly detailed, valuable, and comprehensive atlas with guiding principles for the community that includes open data sharing high-quality data, equity, ethical considerations, flexibility, international, and technology and computational innovations³⁷.

At the time of writing, hundreds of thousand single-cell data across major organ tissues, including colon, liver, immune, and developmental tissues, already populated the database³⁸. The recent meeting in Barcelona, Spain, laid out the roadmap toward building the first draft of the atlas, where the draft will clearly define not only cell types but also cell states, identify the molecular program in both dissociated and spatial contexts, and project the trajectory and relation to time. The first draft of the HCA aims to profile the molecular and spatial characteristics of cells from major tissues and systems from healthy donors, with geographic, age, and ethnic diversity in mind.

The unique nature of the HCA is forming a large global coalition of scientists that builds upon grassroot communities to work toward a common goal. The support of seed funds from philanthropic foundations and government bodies jumpstarted the project, connecting scientists all around the globe. However, with membership reaching over 1600 and over 100 research institutes represented, arriving at a single solution to create the atlas can be logistically challenging to coordinate. At the same time, the relatively flexible, community-based nature of HCA attracts motivated scientists to freely explore the field and collaboratively construct platforms that will set new standards for building an international reference for human cells.

Concurrently, several government and philanthropic-led programs are in full swing that are incentivized through defined funding structures. In 2003, a Swedish-based program called the Human Protein Atlas aimed to map all human proteins in cells, tissues, and organs using integration of various omics technologies, including antibody-based imaging and RNA-seq. Their current Tissue Atlas includes 44 normal human tissue types with 15,313 genes represented in the protein data with available antibodies³⁹. Concurrent programs include The Cell Atlas, which represents 64 cell lines with subcellular details⁴⁰, The Pathology Atlas representing 17 different cancer types⁴¹, The Brain Atlas of the human, pig, and mouse, The Blood Atlas including the secretome, and The Metabolic Atlas with over 120 curated molecular pathways. Their latest efforts to consolidate data with FANTOM5 and GTEx RNA gene expression data aim to reach a multi-omics integrated portal system^42,43. In the European Union (EU), the LifeTime initiative was recently introduced to fundamentally impact basic science across multiple fields, including developmental biology, regeneration, and stem cell biology through single-cell genomic technologies⁴⁴. With strong emphasis on disease and collaboration with industry partners, the initiative aims to synthesize novel solutions based on single-cell genomics technologies to improve human health and reduce the economic burdens of the aging population.

In the United States, the National Institutes of Health (NIH) have recently reported to support the Human Biomolecular Atlas Program (HuBMAP)⁴⁵ for 7 years. The consortium aims to develop a widely accessible framework for mapping the human body at single-cell resolution, with a strong focus on spatial molecular mapping. Unlike GTEx, HuBMAP focuses on generating single-cell data using samples from a more limited number of individuals while investing in a robust common coordinated framework to make data more integratable and communicable across various consortia.

Tissue-centric consortia

A broad profiling of the human body across many tissues will certainly be necessary to understand the holistic anatomy of our body. At the same time, the intricacies of individual organ systems and their associated diseases are unending. Several disease- or tissue-centric consortia are starting to adapt single-cell genomics to dive deeper into resolving the map of each organ system. In December 2016, the US Congress authorized $1.8 billion in funding for the Cancer Moonshot over 7 years, covering a wide range of areas, including patient engagement, drug resistance, prevention, and early detection of hereditary cancer⁴⁶. One of the flagship projects includes the human tumor cell atlas, a collaborative project to build three-dimensional atlases of cellular, morphological, and molecular features of human cancer over time. The consortium is organized with a central data coordination center working with the human tumor atlas focusing on advanced cancers, and the pre-cancer atlas (PCA) focusing on conditions that are likely to become cancer. Comparative datasets from healthy counterparts, such as the human cell atlas, may become essential to interpret the risks and severity of diseases such as cancer.

As one of the BRAIN initiative’s priority areas, the consortium aims to characterize all cell types in the nervous system at single-cell resolution⁴⁷, and to develop tools to record, mark, and manipulate neurons in the living brain. Comparing human and nonhuman primates, the group previously revealed global, regional, and cell-type-specific species expression differences in rare subpallial-derived interneurons expressing dopamine biosynthesis genes in humans⁴⁸. More recently, the group performed single-cell RNA-seq on 40,000 cells to create a high-resolution single-cell gene expression atlas of the developing human cortex⁴⁹, permitting inference of gene-regulatory networks involved in neurogenesis, evolution, and neuropsychiatric diseases. Similarly, the Allen Brain Atlas, led by the Allen Brain Institute, for many years, has driven large-scale mapping projects in the brain⁵⁰. Seeking to combine genomics with neuroanatomy by creating gene expression maps for the mouse and human brain, they recently used single-cell SMART-seq analysis to profile 50,000+ cells across the human and mouse cortex. In their recent work, they identified a highly diverse set of excitatory and inhibitory neurons that are mostly sparse, and showed high conservation in cellular programs. At the same time, the authors reported stark differences in cellular proportions, laminar distributions, gene expression, and morphology between humans and mice⁵¹.

LungMAP: The Molecular Atlas of Lung Development Program^52,53 is a NIH-funded consortium focusing on the human lung that serves as a research resource and public education tool. The consortium of four research centers, a data-coordinating center, and a human tissue repository integrates imaging, transcriptomics, and proteomics in a comprehensive data resource called BREATH. The group recently published comprehensive anatomic ontologies for lung development, comparing alveolar formation and maturation within mouse and human lung⁵⁴. Cellular ontology is an important step toward standardizing and expanding the current terminology of fetal and adult lungs as a resource for broader single-cell genomics consortia.

Other tissue-centric consortia include the Kidney Precision Medicine Program (KPMP) that aims to create a kidney tissue atlas⁵⁵, the Immunological Genome Project (ImmGen), where they recently published a matched epigenome and transcriptome analysis in 86 primary cell types spanning the mouse immune system, establishing an atlas of 512,595 active cis-regulatory elements^56,57, and the GenitoUrinary Development Molecular Anatomy Project (GUDMAP), where they focus on spatial imaging and gene expression profiling of the kidney, lower urinary tract and nociceptors (pain receptors), and the associated cell types in pain processing of the urinary and pelvic regions in mice and more recently in human samples^58,59. The list of single-cell genomics consortia is growing and is summarized in Table 2.

Table 2 List of existing and nascent single-cell genomics consortiums.

Full size table

Building a single-cell genomics consortium

Despite international efforts to integrate single-cell genomics data, such as the HCA, establishing a consortium in a local environment will benefit science in multiple ways: (1) research and clinical networks within the respective nations spark a new level of scientific collaboration that builds toward clinical and translational research; (2) physical proximity suggests easier access to samples that often leads to manageable coordination toward standardization, tissue procurement, and minimizing batch effects; (3) data from local cohorts generate appropriate genetic and environmental backgrounds with key emphases on diseases that are prevalent in the region; (4) empowers local scientists toward genomics technology and computational innovations; and (5) directly addresses local regulations and policies around ethics and data sharing. Here, we exemplify, non-exhaustively, our efforts in Japan toward building a regional single-cell genomics consortium.

Systematic workflows

The single-cell genomics consortium by nature unites scientists and clinicians from different disciplines to spur cross-disciplinary creativity while providing the necessary structure to guide the effort. To better standardize and coordinate efforts from sample collection and data production to analysis, we need to establish a systematic workflow to coordinate with clinicians and researchers across Japan, involving sample-processing standard operating protocols (SOPs), quality control (QC) metrics, central databases, analytical pipelines, and ethics and data policies (Fig. 1).

**Fig. 1: A model of single cell medical network (SC_MED) consortium.**

The single-cell genomics consortium in Japan aims to generate single-cell datasets, mostly with standardized 5′-RNA-seq technology, derived from both healthy and disease samples that will incubate within the consortium, while data generated from healthy samples will be shared with the global HCA community as early as possible. We will also incorporate HCA data, both cellular and spatial, as well as key analytical pipelines, and apply them to address biology-driven questions posed by individual biological collaborators. In parallel, we will integrate 5′-based single-cell data from multiple sample providers to explore gene regulation, focusing on promoter and enhancer activities, on a global scale, leading to the cis-regulatory atlas.

To ensure high-quality single-cell data production, the consortium created a central data center in RIKEN and a team of sample coordinators that closely interacts with individual sample providers to optimize protocols and QC for library production and sequencing. Most human specimens will come from clinical biopsies and surgical resections of living patients, and occasionally from healthy living donors, deceased transplant organ donors, and rapid autopsy from deceased donors. Therefore, maximizing sample quality early in sample collection by minimizing the time between biopsy/resection and preservation is critical. Although several reviews comparing dissociation methods have been reported^60,61,62, the “gold standard” SOP for tissue dissociation, unfortunately, is not available at this stage. When access to fresh samples is not possible, cryopreservation after cellular dissociation may maintain higher quality compared with direct cryopreservation of the whole biospecimen. Nonetheless, cryopreservation of adjacent sections or tissues will benefit by gathering pathological information and storage for future technologies, and implementing complementary methods, such as multiplexed spatial analysis, should reflect cellular compositions found through dissociation methods. To further minimize possible technical and biological variations, the consortium can provide SOPs with clear instructions, and requires comprehensive metadata (donor information, site, and time). The Human Cell Atlas relies on a central repository for SOPs (protocols.io⁶³) and systematic collection of metadata. Constant exchange of protocols and metadata with the open-source community will move toward standardization in the long run.

Performing cell sorting to enrich the desired cell type is possible, although conventional fluorescence-activated cell sorters (FACS) contain insidious chemicals and induce physical stress to cells that may alter gene expression profiles. The latest single-cell genomics platforms can profile a relatively large number of cells (~3000–5000 cells) in a single run, allowing for unbiased sampling of the cell population without FACS. When the desired cell population is rare, performing negative selection by means of bead-conjugated antibodies targeting unwanted cell populations (e.g., dead cells and CD45+ immune cells) will significantly enrich for target cells, and at the same time, minimize cellular stress. Concurrently, DNA-barcoded antibodies can be used to target specific epitopes and profile the transcriptome and target protein⁶⁴.

The consortium requires robust QC metrics that are critical to the success of downstream processes. A high proportion and appropriate number of viable cells will increase the chance of generating a high-quality dataset. Additional metrics to avoid sample mislabeling, patient data swapping, and employing robust computational QCs are implemented to ease data integration and lead to a more biologically meaningful interpretation of single-cell genomics data.

Data integration and genomic analysis

Compared with bulk data analysis, single-cell genomics data bring unique challenges in their analysis in two aspects: high dimensionality due to the sheer increase in the number of observations made, and high variability from the inherent sparsity of the data stemming from both biological variations and limited sensitivity of the current methods. Furthermore, the massive amount of data that are generated from single-cell analyses brings additional challenges in data access, management, and infrastructure. As such, we established a robust database framework to handle vigorous activities that are specifically tailored to address individual collaborators while maintaining standardization through single-cell genomics platforms, dissociation protocols, centralized databases, and experimental designs. The general outline of computational tools is described below; however, to narrow the gap between computational scientists and sample collaborators, the consortium continues to develop and implement graphic user interfaces that can be easily implemented by collaborating members.

Estimation of the gene expression levels from scRNAseq data requires careful quality control steps to remove unwanted noise from cell debris or free-floating RNA. The raw expression data that pass this QC step need to be normalized using single-cell-specific approaches, such as the use of spike-ins and modeling of cell-specific factors, as the global scaling approach used in bulk data analysis is no longer suitable [CellRanger⁶⁵ and SCATER⁶⁶]. A number of expression-level imputation approaches have been developed as well to estimate the expression values that might have been missed owing to dropout events [MAGIC⁶⁷, scIMPUTE⁶⁸, and SCRABBLE⁶⁹]. In addition, the data need to be corrected for other confounding factors such as batch effects and cell cycles [SEURAT⁷⁰, fastMNN⁷¹, SCLVM⁷², and CCREMOVER⁷³].

Once the data processing is complete, the next step is to assign identities to individual cells, which are usually from mixed populations. This step generally involves dimensional reduction and clustering of the expression data to group the cells with similar transcription profiles [PCA⁷⁴, TSNE⁷⁵, and UMAP⁷⁶]. While traditional clustering methods, such as hierarchical clustering, can be used, a number of single-cell-specific methods have been developed, and there are benchmarks [SC3⁷⁷ and DUO⁷⁸]. For those cells undergoing continuous differentiation or stimulations, trajectory inference techniques have been used to assign them onto a continuous path of changes in order to establish a temporal ordering of the cells, which is referred to as pseudotime [DIFFMAP⁷⁹, MONOCLE⁸⁰, and RNAVELOCITY⁸¹]. Finally, when discussing cell identity, one confounding factor that is especially relevant to single-cell genomics consortia is how one differentiates between cell type (stable features of a cell’s identity) and cell state (transient aspects of a cell’s status). How to firmly establish these concepts using data-driven and generalizable approaches is a discussion that is necessary within the single-cell consortium.

Once the cell types have been established, we can proceed to identify the gene signatures that are specific to each type, and make inferences about the biology behind them. The most common technique is to perform differential expression analysis among different populations of cell types or states. Due to the technical challenges imposed by the high dispersion and dropout events inherent in scRNAseq data, numerous efforts have been made to develop single-cell-specific techniques that address these issues [MAST⁸² and SCDE⁸³]. Other approaches involve the inference of gene-regulatory networks, using both existing methods developed for bulk data and newly developed single-cell-specific methods [WGCNA⁸⁴ and SCODE⁸⁵].

Finally, the ongoing efforts to generate single-cell genomics data that can be spatially resolved have brought some important advances recently, giving us a chance to not only identify the cell types but also their spatial locations in the original tissue the cells were sampled from [STAHL⁸⁶, SEQFISH¹⁸, and SLIDESEQ¹⁹]. This has important implications for single-cell genomic consortia, as it will allow us to investigate the interplay between gene-regulatory networks and physical locations of different cell types that interact with one another. For readers who would like more in-depth reviews of the current methodologies employed in this field, there are a number of comprehensive review papers and handbooks that have been published in recent years^87,88,89,90.

Perspective

Despite the generation of a growing amount of single-cell data, single-cell consortia from the US/EU can inadvertently lead to biased representation of single-cell genomics data, further exacerbating the skewness of genetic representation that was pandemic during the genomic era^91,92. For instance, a concern for lack of Asian genomes in the reference datasets is rising (e.g., 1.3% of GTEX is Asian, where 59.6% of Asians make up the world’s population⁹³). Poor representation in reference data can lead to misinterpretation of research and clinical results⁹⁴. While greater efforts are being made to represent regional/ethnic diversity in global consortia including HCA, uplifting regional research groups to lead data production is necessary, and creation of a local consortium can be one of the steps to achieve meaningful human cell reference for all. It will be imperative to work with regional research–clinical communities together with funding agencies to initiate a dialogue toward better standardization and harmonization of single-cell genomics data while maintaining a constructive relationship with global single-cell genomics communities to engage and represent all of us on the global scale.

References

Forrest, A. R. et al. A promoter-level mammalian expression atlas. Nature 507, 462–470 (2014).
Article CAS PubMed Google Scholar
Andersson, R. et al. An atlas of active enhancers across human cell types and tissues. Nature 507, 455–461 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hon, C. C. et al. An atlas of human long non-coding RNAs with accurate 5' ends. Nature 543, 199–204 (2017).
Article CAS PubMed PubMed Central Google Scholar
GTEX Consortium. The genotype-tissue expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article CAS Google Scholar
Carithers, L. J. et al. A novel approach to high-quality postmortem tissue procurement: the GTEx Project. Biopreserv. Biobank 13, 311–319 (2015).
Article PubMed PubMed Central Google Scholar
GTEX Consortium. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article PubMed Central CAS Google Scholar
eGTEX Project. Enhancing GTEx by bridging the gaps between genotype, gene expression, and disease. Nat. Genet. 49, 1664–1670 (2017).
Article CAS Google Scholar
Battle, A. et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
Article PubMed Google Scholar
Aguet, F. et al. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Preprint at https://www.biorxiv.org/content/10.1101/787903v1, https://doi.org/10.1101/787903 (2019).
Kim-Hellmuth, S. et al. Cell type specific genetic regulation of gene expression across human tissues. Preprint at https://www.biorxiv.org/content/10.1101/806117v2, https://doi.org/10.1101/806117 (2019).
The Tabula Muris Consortium. et al. Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris. Nature 562, 367–372 (2018).
Article CAS Google Scholar
Donovan, M. K. R., D’Antonio-Chronowska, A., D’Antonio, M. & Frazer, K. A. Cellular deconvolution of GTEx tissues powers discovery of disease and cell-type associated regulatory variants. Nat. Commun. 11, 955 (2020).
Article CAS PubMed PubMed Central Google Scholar
van der Wijst, M. G. P. et al. Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs. Nat. Genet. 50, 493–497 (2018).
Article PubMed PubMed Central CAS Google Scholar
Kester, L. & van Oudenaarden, A. Single-cell transcriptomics meets lineage tracing. Cell Stem Cell 23, 166–179 (2018).
Article CAS PubMed Google Scholar
Kouno, T. et al. C1 CAGE detects transcription start sites and enhancer activity at single-cell resolution. Nat. Commun. 10, 360 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chen, K. H., Boettiger, A. N., Moffitt, J. R., Wang, S. & Zhuang, X. RNA imaging. Spatially resolved, highly multiplexed RNA profiling in single cells. Science 348, aaa6090 (2015).
Article PubMed PubMed Central CAS Google Scholar
Moffitt, J. R. et al. Molecular, spatial, and functional single-cell profiling of the hypothalamic preoptic region. Science 362, eaau5324 (2018).
Article PubMed PubMed Central CAS Google Scholar
Eng, C. L. et al. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH. Nature 568, 235–239 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rodriques, S. G. et al. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science 363, 1463–1467 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hou, R., Denisenko, E. & Forrest, A. R. R. scMatch: a single-cell gene expression profile annotation tool using reference datasets. Bioinformatics 35, 4688–4695 (2019).
Article CAS PubMed PubMed Central Google Scholar
Abdelaal, T. et al. A comparison of automatic cell identification methods for single-cell RNA sequencing data. Genome Biol. 20, 194 (2019).
Article PubMed PubMed Central CAS Google Scholar
Kanton, S. et al. Organoid single-cell genomic atlas uncovers human-specific features of brain development. Nature 574, 418–422 (2019).
Article CAS PubMed Google Scholar
Chen, H. et al. Single-cell trajectories reconstruction, exploration and mapping of omics data with STREAM. Nat. Commun. 10, 1903 (2019).
Article PubMed PubMed Central CAS Google Scholar
Zhou, Y. et al. Single-cell transcriptomic analyses of cell fate transitions during human cardiac reprogramming. Cell Stem Cell 25, 149–164 (2019).
Article CAS PubMed PubMed Central Google Scholar
Liu, Z. et al. Reconstructing cell cycle pseudo time-series via single-cell transcriptome data. Nat. Commun. 8, 22 (2017).
Article PubMed PubMed Central CAS Google Scholar
Shin, D., Lee, W., Lee, J. H. & Bang, D. Multiplexed single-cell RNA-seq via transient barcoding for simultaneous expression profiling of various drug perturbations. Sci. Adv. 5, eaav2249 (2019).
Article CAS PubMed PubMed Central Google Scholar
Vento-Tormo, R. et al. Single-cell reconstruction of the early maternal-fetal interface in humans. Nature 563, 347–353 (2018).
Article CAS PubMed Google Scholar
Kumar, M. P. et al. Analysis of single-cell RNA-Seq identifies cell-cell communication associated with tumor characteristics. Cell Rep. 25, 1458–1468 (2018).
Article CAS PubMed PubMed Central Google Scholar
Navin, N. E. The first five years of single-cell cancer genomics and beyond. Genome Res. 25, 1499–1507 (2015).
Article CAS PubMed PubMed Central Google Scholar
Suvà, M. L. & Tirosh, I. Single-cell rna sequencing in cancer: lessons learned and emerging challenges. Mol. Cell 75, 7–12 (2019).
Article PubMed CAS Google Scholar
Mathys, H. et al. Single-cell transcriptomic analysis of Alzheimer's disease. Nature 570, 332–337 (2019).
Article CAS PubMed PubMed Central Google Scholar
Grubman, A. et al. A single-cell atlas of entorhinal cortex from individuals with Alzheimer’s disease reveals cell-type-specific gene expression regulation. Nat. Neurosci. 22, 2087–2097 (2019).
Article CAS PubMed Google Scholar
Rozenblatt-Rosen, O., Stubbington, M. J. T., Regev, A. & Teichmann, S. A. The Human Cell Atlas: from vision to reality. Nature 550, 451–453 (2017).
Article CAS PubMed Google Scholar
Regev, A. et al. The Human Cell Atlas. Elife 6, e27041 (2017).
Article PubMed PubMed Central Google Scholar
Regev, A. et al. The Human Cell Atlas White Paper. Preprint at https://arxiv.org/abs/1810.05192 (2018).
Human Cell Atlas. https://www.humancellatlas.org.
Human Cell Atlas. https://www.humancellatlas.org/learn-more/human-cell-atlas/.
Human Cell Atlas Data Portal. https://data.humancellatlas.org.
Uhlén, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article PubMed CAS Google Scholar
Thul, P. J. et al. A subcellular map of the human proteome. Science 356, eaal3321 (2017).
Article PubMed CAS Google Scholar
Uhlen, M. et al. A pathology atlas of the human cancer transcriptome. Science 357, eaan2507 (2017).
Article PubMed CAS Google Scholar
FANTOM5. https://fantom.gsc.riken.jp/5/.
GTEx Portal. https://gtexportal.org/home/.
The LifeTime Initiative. https://lifetime-fetflagship.eu.
HuBMAP Consortium. The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature 574, 187–192 (2019).
Article CAS Google Scholar
Human Tumor Atlases. https://www.cancer.gov/research/key-initiatives/moonshot-cancer-initiative/implementation/human-tumor-atlas.
The Brain Initiative. https://braininitiative.nih.gov/.
Sousa, A. M. M. et al. Molecular and cellular reorganization of neural circuits in the human lineage. Science 358, 1027–1032 (2017).
Article CAS PubMed PubMed Central Google Scholar
Polioudakis, D. et al. A single-cell transcriptomic atlas of human neocortical development during mid-gestation. Neuron 103, 785–801 (2019).
Article CAS PubMed PubMed Central Google Scholar
Allen Brain Map. https://portal.brain-map.org/.
Hodge, R. D. et al. Conserved cell types with divergent features in human versus mouse cortex. Nature 573, 61–68 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ardini-Poleske, M. E. et al. LungMAP: the molecular atlas of lung development program. Am. J. Physiol. Lung Cell. Mol. Physiol. 313, L733–L740 (2017).
Article PubMed PubMed Central CAS Google Scholar
LungMAP. https://lungmap.net/.
Pan, H., Deutsch, G. H. & Wert, S. E., Ontology Subcommittee & NHLBI Molecular Atlas of Lung Development Program Consortium. Comprehensive anatomic ontologies for lung development: a comparison of alveolar formation and maturation within mouse and human lung. J. Biomed. Semant. 10, 18 (2019).
Article Google Scholar
Kidney Precision Medicine Project. https://www.niddk.nih.gov/research-funding/research-programs/kidney-precision-medicine-project-kpmp.
Yoshida, H. et al. The cis-regulatory atlas of the mouse immune system. Cell 176, 897–912 (2019).
Article CAS PubMed PubMed Central Google Scholar
The Immunological Genome Project. http://www.immgen.org/.
McMahon, A. P. et al. GUDMAP: the genitourinary developmental molecular anatomy project. J. Am. Soc. Nephrol. 19, 667–671 (2008).
Article PubMed Google Scholar
Harding, S. D. et al. The GUDMAP database—an online resource for genitourinary research. Development 138, 2845–2853 (2011).
Article CAS PubMed PubMed Central Google Scholar
Vieira Braga, F. A. & Miragaia, R. J. Tissue handling and dissociation for single-cell RNA-seq. Methods Mol. Biol. 1979, 9–21 (2019).
Article PubMed CAS Google Scholar
O’Flanagan, C. H. et al. Dissociation of solid tumor tissues with cold active protease for single-cell RNA-seq minimizes conserved collagenase-associated stress responses. Genome Biol. 20, 210 (2019).
Article PubMed PubMed Central CAS Google Scholar
Denisenko, E. et al. Systematic assessment of tissue dissociation and storage biases in single-cell and single-nucleus RNA-seq workflows. Preprint at https://www.biorxiv.org/content/10.1101/832444v2, https://doi.org/10.1101/832444 (2019).
Human Cell Atlas Method Development Community. https://www.protocols.io/groups/hca.
Stoeckius, M. et al. Simultaneous epitope and transcriptome measurement in single cells. Nat. Methods 14, 865–868 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zheng, G. X. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049 (2017).
Article CAS PubMed PubMed Central Google Scholar
McCarthy, D. J., Campbell, K. R., Lun, A. T. & Wills, Q. F. Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R. Bioinformatics 33, 1179–1186 (2017).
CAS PubMed PubMed Central Google Scholar
van Dijk, D. et al. Recovering gene interactions from single-cell data using data diffusion. Cell 174, 716–729 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, W. V. & Li, J. J. An accurate and robust imputation method scImpute for single-cell RNA-seq data. Nat. Commun. 9, 997 (2018).
Article PubMed PubMed Central CAS Google Scholar
Peng, T., Zhu, Q., Yin, P. & Tan, K. SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data. Genome Biol. 20, 88 (2019).
Article PubMed PubMed Central Google Scholar
Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420 (2018).
Article CAS PubMed PubMed Central Google Scholar
Haghverdi, L., Lun, A. T. L., Morgan, M. D. & Marioni, J. C. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat. Biotechnol. 36, 421–427 (2018).
Article CAS PubMed PubMed Central Google Scholar
Buettner, F. et al. Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells. Nat. Biotechnol. 33, 155–160 (2015).
Article CAS PubMed Google Scholar
Barron, M. & Li, J. Identifying and removing the cell-cycle effect from single-cell RNA-Sequencing data. Sci. Rep. 6, 33892 (2016).
Article CAS PubMed PubMed Central Google Scholar
Meng, C. et al. Dimension reduction techniques for the integrative analysis of multi-omics data. Brief. Bioinform. 17, 628–641 (2016).
Article CAS PubMed PubMed Central Google Scholar
van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Google Scholar
McInnes, L., Healy, J. & Melville, J. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. Preprint at https://arxiv.org/abs/1802.03426 (2018).
Kiselev, V. Y. et al. SC3: consensus clustering of single-cell RNA-seq data. Nat. Methods 14, 483–486 (2017).
Article CAS PubMed PubMed Central Google Scholar
Duò, A., Robinson, M. D. & Soneson, C. A systematic performance evaluation of clustering methods for single-cell RNA-seq data. F1000Res 7, 1141 (2018).
Article PubMed CAS Google Scholar
Haghverdi, L., Buettner, F. & Theis, F. J. Diffusion maps for high-dimensional single-cell analysis of differentiation data. Bioinformatics 31, 2989–2998 (2015).
Article CAS PubMed Google Scholar
Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 32, 381–386 (2014).
Article CAS PubMed PubMed Central Google Scholar
La Manno, G. et al. RNA velocity of single cells. Nature 560, 494–498 (2018).
Article PubMed PubMed Central CAS Google Scholar
Finak, G. et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 16, 278 (2015).
Article PubMed PubMed Central CAS Google Scholar
Kharchenko, P. V., Silberstein, L. & Scadden, D. T. Bayesian approach to single-cell differential expression analysis. Nat. Methods 11, 740–742 (2014).
Article CAS PubMed PubMed Central Google Scholar
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 9, 559 (2008).
Article CAS Google Scholar
Matsumoto, H. et al. SCODE: an efficient regulatory network inference algorithm from single-cell RNA-Seq during differentiation. Bioinformatics 33, 2314–2321 (2017).
Article PubMed PubMed Central CAS Google Scholar
Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
Article PubMed CAS Google Scholar
Hon, C. C., Shin, J. W., Carninci, P. & Stubbington, M. J. T. The human cell atlas: technical approaches and challenges. Brief Funct. Genom. 17, 283–294 (2018).
Article CAS Google Scholar
Amezquita, R. A. et al. Orchestrating single-cell analysis with bioconductor. Nat. Methods 17, 137–145 (2020).
Article CAS PubMed Google Scholar
Vieth, B., Parekh, S., Ziegenhain, C., Enard, W. & Hellmann, I. A systematic evaluation of single cell RNA-seq analysis pipelines. Nat. Commun. 10, 4667 (2019).
Article PubMed PubMed Central CAS Google Scholar
Luecken, M. D. & Theis, F. J. Current best practices in single-cell RNA-seq analysis: a tutorial. Mol. Syst. Biol. 15, e8746 (2019).
Article PubMed PubMed Central Google Scholar
Devaney, S. All of Us. Nature 576, S14–S17 (2019).
Article Google Scholar
Lappalainen, T., Scott, A. J., Brandt, M. & Hall, I. M. Genomic analysis in the age of human genome sequencing. Cell 177, 70–84 (2019).
Article CAS PubMed PubMed Central Google Scholar
GTEx Analysis Release V8. https://gtexportal.org/home/tissueSummaryPage.
GenomeAsia100K. https://genomeasia100k.org.
Villani, A. C. et al. Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors. Science 356, eaah4573 (2017).
Article PubMed PubMed Central CAS Google Scholar
Buenrostro, J. D. et al. Integrated single-cell analysis maps the continuous regulatory landscape of human hematopoietic differentiation. Cell 173, 1535–1548 (2018).
Article CAS PubMed PubMed Central Google Scholar
Papalexi, E. & Satija, R. Single-cell RNA sequencing to explore immune cell heterogeneity. Nat. Rev. Immunol. 18, 35–45 (2018).
Article CAS PubMed Google Scholar
Lake, B. B. et al. Integrative single-cell analysis of transcriptional and epigenetic states in the human adult brain. Nat. Biotechnol. 36, 70–80 (2018).
Article CAS PubMed Google Scholar
Velmeshev, D. et al. Single-cell genomics identifies cell type-specific molecular changes in autism. Science 364, 685–689 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lake, B. B. et al. A single-nucleus RNA-sequencing pipeline to decipher the molecular anatomy and pathophysiology of human kidneys. Nat. Commun. 10, 2832 (2019).
Article PubMed PubMed Central CAS Google Scholar
Arazi, A. et al. The immune cell landscape in kidneys of patients with lupus nephritis. Nat. Immunol. 20, 902–914 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stewart, B. J. et al. Spatiotemporal immune zonation of the human kidney. Science 365, 1461–1466 (2019).
Article CAS PubMed PubMed Central Google Scholar
Schiller, H. B. et al. The Human Lung Cell Atlas: a high-resolution reference map of the human lung in health and disease. Am. J. Respir. Cell Mol. Biol. 61, 31–41 (2019).
Article CAS PubMed PubMed Central Google Scholar
Vieira Braga, F. A. et al. A cellular census of human lungs identifies novel cell states in health and in asthma. Nat. Med 25, 1153–1163 (2019).
Article CAS PubMed Google Scholar
Reyfman, P. A. et al. Single-cell transcriptomic analysis of human lung provides insights into the pathobiology of pulmonary fibrosis. Am. J. Respir. Crit. Care Med. 199, 1517–1536 (2019).
Article CAS PubMed PubMed Central Google Scholar
Plasschaert, L. W. et al. A single-cell atlas of the airway epithelium reveals the CFTR-rich pulmonary ionocyte. Nature 560, 377–381 (2018).
Article CAS PubMed PubMed Central Google Scholar
Aizarani, N. et al. A human liver cell atlas reveals heterogeneity and epithelial progenitors. Nature 572, 199–204 (2019).
Article CAS PubMed PubMed Central Google Scholar
Muraro, M. J. et al. A single-cell transcriptome atlas of the human pancreas. Cell Syst. 3, 385–394 (2016).
Article CAS PubMed PubMed Central Google Scholar
Smillie, C. S. et al. Intra- and inter-cellular rewiring of the human colon during ulcerative colitis. Cell 178, 714–730 (2019).
Article CAS PubMed PubMed Central Google Scholar
Martin, J. C. et al. Single-cell analysis of crohn's disease lesions identifies a pathogenic cellular module associated with resistance to anti-TNF therapy. Cell 178, 1493–1508 (2019).
Article CAS PubMed PubMed Central Google Scholar
See, K. et al. Single cardiomyocyte nuclear transcriptomes reveal a lincRNA-regulated de-differentiation and cell cycle stress-response in vivo. Nat. Commun. 8, 225 (2017).
Article PubMed PubMed Central CAS Google Scholar
Menon, R. et al. Single-cell analysis of progenitor cell dynamics and lineage specification in the human fetal kidney. Development 145, dev164038 (2018).
Article PubMed PubMed Central CAS Google Scholar
Popescu, D. M. et al. Decoding human fetal liver haematopoiesis. Nature 574, 365–371 (2019).
Article CAS PubMed PubMed Central Google Scholar
Suryawanshi, H. et al. Cell atlas of the fetal human heart and implications for autoimmune-mediated congenital heart block. Cardiovasc Res. https://doi.org/10.1093/cvr/cvz257 (in press, 2019).
Taylor, D. M. et al. The pediatric cell atlas: defining the growth phase of human development at single-cell resolution. Dev. Cell 49, 10–29 (2019).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, Q. H. et al. Profiling human breast epithelial cells using single cell RNA sequencing identifies cell diversity. Nat. Commun. 9, 2028 (2018).
Article PubMed PubMed Central CAS Google Scholar
Guo, J. et al. The adult human testis transcriptional cell atlas. Cell Res. 28, 1141–1157 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lukowski, S. W. et al. A single-cell transcriptome atlas of the adult human retina. EMBO J. 38, e100811 (2019).
Article PubMed PubMed Central CAS Google Scholar
Menon, M. et al. Single-cell transcriptomic atlas of the human retina identifies cell types associated with age-related macular degeneration. Nat. Commun. 10, 4902 (2019).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

To the coordinating members of the single-cell genomics consortium in Japan, specifically Piero Carninci, Chung Chau Hon, Miho Itoh, Massaki Furuno, and Takeya Kasukawa. Funding support was received from the Ministry of Education Science and Technology (MEXT) to RIKEN IMS.

Author information

Authors and Affiliations

RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-Cho, Tsurumi-Ku, Yokohama, 230-0045, Japan
Yoshinari Ando, Andrew Tae-Jun Kwon & Jay W. Shin

Authors

Yoshinari Ando
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Tae-Jun Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Jay W. Shin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jay W. Shin.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ando, Y., Kwon, A.TJ. & Shin, J.W. An era of single-cell genomics consortia. Exp Mol Med 52, 1409–1418 (2020). https://doi.org/10.1038/s12276-020-0409-x

Download citation

Received: 13 December 2019
Revised: 24 January 2020
Accepted: 10 February 2020
Published: 15 September 2020
Issue Date: September 2020
DOI: https://doi.org/10.1038/s12276-020-0409-x

This article is cited by

Single-cell analysis of salt-induced hypertensive mouse aortae reveals cellular heterogeneity and state changes
- Ka Zhang
- Hao Kan
- Xin Ma
Experimental & Molecular Medicine (2021)
Machine learning methods to model multicellular complexity and tissue specificity
- Rachel S. G. Sealfon
- Aaron K. Wong
- Olga G. Troyanskaya
Nature Reviews Materials (2021)
Single-cell genomics technology: perspectives
- Tae Hee Hong
- Woong-Yang Park
Experimental & Molecular Medicine (2020)