CAPITAL: a major advance in single-cell RNA data analysis

Researchers from Osaka University have developed a computational tool called CAPITAL that can carry out accurate comparative analysis of complex single-cell sequencing datasets.
CAPITAL: a major advance in single-cell RNA data analysis
Like

New developments in high-throughput biological studies mean that the genes that are active in just a single cell can now be determined. However, analyzing the complex datasets that result can be challenging. Now, a team at Osaka University has developed CAPITAL, a new computational tool for comparing complex datasets from single cells (Fig. 1).

Overview of CAPITAL: an algorithm for comparing pseudotime trajectories with branches
Fig. 1 Overview of CAPITAL: an algorithm for comparing pseudotime trajectories with branches.

RNA sequencing provides information on the subset of the entire population of genes that are actively being expressed, or are “switched on.” As technology has advanced, it has become possible to sequence the RNA population of just a single cell. This can provide a great deal of information on the specific changes in gene expression involved when a large population of mixed cells undergoes dynamic, transitional processes, such as differentiation or cell death, because each individual cell can be specifically analyzed rather than all the different cell types being pooled.

CAPITAL is specifically designed to compare complex datasets from single cells undergoing transitional processes. These analyses are carried out by defining a “pseudotime trajectory,” which places the cells along a hypothetical path that reflects their progress through the transitional process. These trajectories are not always straightforward and linear; they can become very complex and branching. In the past, only linear trajectories could be aligned for comparison [1, 2], but the team’s innovation means that complex branching trajectories can now be accurately and automatically aligned and compared.

After developing the algorithm used for CAPITAL, which implements a method known as tree alignment [3], they tested it on both synthetic datasets and authentic datasets from bone marrow cells [4-6]. The results demonstrated that CAPITAL is statistically more accurate and robust than the computational algorithms that existed previously [7-9], showing major advances over these methods (Fig. 2).

CAPITAL is statistically better than data integration methods in trajectory conservation on 2,278 pairs of synthetic datasets with multiple branches
Fig. 2 CAPITAL is statistically better than data integration methods in trajectory conservation on 2,278 pairs of synthetic datasets with multiple branches.

Trajectory comparison is a powerful analysis that can, as an example, identify the gene expression dynamics between different species to provide information on evolutionary processes. “We showed in this study that CAPITAL can reveal the existence of different molecular patterns between humans and mice even when some of the expression patterns are similar and appear to be conserved”, says lead author Reiichi Sugihara (Fig. 3). “This will allow the identification of novel regulators that determine cell fates”. This technology is not limited to just this type of data, as senior author Yuki Kato explains: “Our novel computational tool can be applied to a wide range of high-throughput datasets, including pseudotemporal, spatial, and epigenetic data”.

Expression dynamics of gene CSF1 along pseudotime from hematopoietic stem cell to erythrocyte between human and mouse from bone marrow cells data
Fig. 3 Expression dynamics of gene CSF1 along pseudotime from hematopoietic stem cell to erythrocyte between human and mouse from bone marrow cells data.

This powerful new technique will allow the global comparison of single-cell trajectories, which may lead to the identification of novel disease-associated genes that could not be identified by earlier comparative methods. Thus, CAPITAL represents a significant advance in the field of single-cell biology.

References

  1. Alpert, A., Moore, L. S., Dubovik, T. & Shen-Orr, S. S. Alignment of single-cell trajectories to compare cellular expression dynamics. Nat. Methods 15, 267–270 (2018).
  2. Cacchiarelli, D. et al. Aligning single-cell developmental and reprogramming trajectories identifies molecular determinants of myogenic reprogramming outcome. Cell Syst. 7, 258–268 (2018).
  3. Jiang, T., Wang, L. & Zhang, K. Alignment of trees—an alternative to tree edit. Theor. Comput. Sci. 143, 137–148 (1995).
  4. Setty, M. et al. Characterization of cell fate probabilities in single-cell data with Palantir. Nat. Biotechnol. 37, 451–460 (2019).
  5. Velten, L. et al. Human haematopoietic stem cell lineage commitment is a continuous process. Nat. Cell Biol. 19, 271–281 (2017).
  6. Paul, F. et al. Transcriptional heterogeneity and lineage commitment in myeloid progenitors. Cell 163, 1663–1677 (2015).
  7. Lopez, R., Regier, J., Cole, M. B., Jordan, M. I. & Yosef, N. Deep generative modeling for single-cell transcriptomics. Nat. Methods 15, 1053–1058 (2018).
  8. Hie, B., Bryson, B. & Berger, B. Efficient integration of heterogeneous single-cell transcriptomes using Scanorama. Nat. Biotechnol. 37, 685–691 (2019).
  9. Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587 (2021).

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Subscribe to the Topic

Biotechnology
Life Sciences > Biological Sciences > Biotechnology

Related Collections

With collections, you can get published faster and increase your visibility.

Applied Sciences

This collection highlights research and commentary in applied science. The range of topics is large, spanning all scientific disciplines, with the unifying factor being the goal to turn scientific knowledge into positive benefits for society.

Publishing Model: Open Access

Deadline: Ongoing