2. scRNA-Seq—A Tool to Study Host–Pathogen Interactions and Biosignatures in TB
Pulmonary TB, caused by infection with
Mtb, is primarily identified as a lung disease wherein the bacilli are able to evade the host immune system in ~5–7% of individuals and cause extensive inflammation and pathology
[24][49]. In 90% of the infected individuals, the infection is controlled by the host immune response, and the mycobacterial replication is contained within granulomatous structures
[25][50]. This results in LTBI that can either last for the lifetime of the individual or it can reactivate in an immunosuppressed condition, such as HIV co-infection
[26][51]. The interaction of the immune cells and complex intracellular bacterial pathogens, such as
Mtb, produces multidimensional interactions. Transcriptional profiling at the single-cell level within the dynamic granulomatous structure provides an opportunity to understand the infection as a consolidated process between the host and pathogen (
Figure 1). Indeed, heterogeneity is integral to the interactions between intracellular bacteria and host immune cells such as macrophages
[27][52]. Additionally, there is evidence that variation in the host and pathogen transcriptional programs causes differential infection outcomes
[28][29][53,54]. Further, each granuloma within an individual differs substantially in its composition, inflammatory profile, size, and bacterial ecology
[10]. Cellular compositions and cell-to-cell signaling networks have been identified using high-throughput scRNA-seq within TB lung granulomas in the cynomolgus macaque model of TB
[30][55]. Granulomas with persistent bacteria were characterized as having TH2 immunity-based signaling among mast, endothelial, fibroblast, and plasma cells. Granulomas demonstrating bacterial control appeared enriched in cytotoxic T cells signaling via TH1, TH17 immunity
[30][55]. In a similar approach to studying heterogeneity in the cellular composition of leprosy granulomas, scRNA-seq was utilized to perform Uniform Manifold Approximation and Projection (UMAP) dimensionality reduction and cell clustering
[31][56]. This was followed by differential expression analysis to find the cluster markers that defined signature genes. This ultimately resulted in the identification of 66 key antimicrobial genes expressed by T cells, macrophages, keratinocytes, and fibroblasts that have been reported to participate in the immune response to mycobacterial infections
[31][56]. By using scRNA-seq in combination with other technologies, both these studies were able to demonstrate the participation of the non-traditional immune cells in a spatial model of mycobacterial infection. Future studies must further elucidate these observations to outline the mechanisms involved in the HIV-1-mediated disruption of granulomas in LTBI.
Figure 1. Applications of scRNA-seq in TB pathogenesis and diagnostics. (1) Lung or bronchoalveolar lavage fluid is collected from infected patients. (2) Single-cell suspension prepared from the sample is subjected to (3) barcoded library construction. (4) The cDNA libraries undergo next-generation sequencing that can be analyzed using different visualization tools. Data generated using scRNA-seq techniques can be used to study cellular heterogeneity in granulomas and identify biomarkers for vaccines and therapeutics in diagnostics and precision medicine. The figure was made with help of Biorender.
Despite advances in understanding the host–pathogen interactions in TB, the lack of biomarkers to distinguish active TB from LTBI remains a major challenge for TB control. Individual immune cell subsets have been delineated using scRNA-seq on peripheral blood mononuclear cells (PBMCs) from healthy controls (HC), LTBI, and TB patients using an unbiased, surface-marker-free approach
[32][57]. Identifying the nature and function of these circulating cell subsets provided a useful framework for examining TB disease progression. It also identified new cell type markers in PBMC as well as an altered T-cell subsets distribution in TB. A CD3-CD7+GZMB+ subset was found to be significantly decreased in patients with TB, compared to HC and LTBI controls, suggesting that this could serve as a distinguishing marker between TB and LTBI
[32][57]. In another study, Xu et al. combined expression profiling by array and scRNA-seq to profile three key immune-related hub genes: ADM, IFIT3, and SERPING1, with the potential to discriminate TB from LTBI and HC
[33][26]. In addition to identifying potential biomarkers to predict TB, NGS has also contributed to the characterization of certain circulating cell subsets involved in immune dysregulation in HIV-1 and TB co-infection
[9]. Comparison of the transcriptome of PBMCs from patients with HIV-1 infection alone and those with HIV-1-TB co-infection revealed a higher proportion of inflammatory CD14+CD16+ monocyte subset in HIV/TB cohort. Thus, peripheral monocyte subsets might serve as a discriminating biomarker for the diagnosis of HIV/TB co-infection. Additionally, a biosignature comprising
RAB20 and
INSL3 genes in the peripheral blood was identified using scRNA-seq technology to accurately classify TB status among advanced HIV patients in two geographically distinct cohorts
[34][58]. Yet, another study successfully identified that intermediate and classical monocytes contribute to the blood immune signatures of ATB in humans using scRNA-seq profiling
[35][59].
Though scRNA-seq has proven to be instrumental in addressing cell-to-cell heterogeneity, it is imperative to consider this variability as a paradoxical phenomenon. The discovery of a pool of potential candidate biomarkers comes with a deluge of false positives. Interpreting the candidate genes requires careful functional considerations that can add another layer of complex statistical analyses. It is also important to consider that TB manifests itself at the proteome and whole organ levels, not just the transcriptome level. Hence, future NGS studies must aim to consolidate the biomarkers from the periphery and local
[36][60] immune responses to present a comprehensive biosignature of TB progression in the setting of HIV co-infection.
3. Predicting Disease Progression in TB and HIV Using scRNA-Seq
Infections are dynamic processes in which the outcomes are dictated by the complex interactions of the pathogen and the multiple biological factors of the host. scRNA-seq allows a deeper understanding of the human cellular landscape by studying cellular behavior at a higher resolution. The early diagnosis of LTBI and screening of those at a higher risk of HIV co-infection is imperative for TB prevention programs
[37][105]. A simultaneous epitope and transcriptome measurement, using Cellular Indexing of Transcriptomes and Epitopes (CITE-seq), identified a T-cell state associated with TB progression that responded to ex vivo
Mtb peptide stimulation
[38][106]. The study was based on the fact that TB progression was largely influenced by host memory T cells, which had been shown previously. However, prior studies have failed to comprehensively describe the landscape of the progression-associated memory T cells in TB
[39][40][107,108]. Most of the significant differences resided in a rare multimodally defined T
H17 subset (C-12). The C-12’s abundance in patients that had recovered from TB indicated either a prior TB disease or susceptibility to TB disease progression. Additionally, mutations in the RAR-related orphan receptor C (RORC), that are highly expressed in C-12, were shown to increase susceptibility to TB
[38][41][106,109]. The study provides a resource to enable further studies of the different memory T cell phenotypes that could serve as biomarkers for progression to TB in the setting of HIV co-infection.
In a different approach by Ben-Moshe et al., scRNA-seq of human peripheral blood cells infected with
Salmonella was used to develop a deconvoluted algorithm to study three cohorts of TB patients at different stages of the disease
[42][110]. The trained algorithm predicted the power of the monocyte-infection-induced signature in classifying the LTBI individuals at a higher risk of developing active TB. The study highlights the role of global infection dynamics, such as the monocyte-infection-induced state in
Salmonella and TB. The approach can be applied to additional co-morbid pathogens, such as HIV, and at additional time points in order to train the deconvolution further to differentiate the infection-induced states and infectious outcomes. Early interleukin-1 (IL-1) signaling enrichment using scRNA-seq has been implicated as an important pathway in response to
Mtb stimulation in human peripheral blood cells
[43][111]. The study provided a detailed analysis of the significance of cell-type specificity compared to pathogen-specificity and its role in influencing gene expression in TB. Concomitantly, elevated levels of monocytic IL-1b upon TLR-2 and TLR-4 stimulation have been shown to be associated with reduced odds of TB recurrence in ART-treated patients
[44][112].
Innovative single-cell technologies indicate efforts to shift from traditional therapy in TB to precision medicine and host-directed therapeutics. scRNA-seq allows the simultaneous analysis of the host and pathogen transcriptomics on a wide array of samples ranging from primary cell cultures to animal tissue biopsies or blood from the human host. Future studies could focus on utilizing single-cell technology in combination with spatial transcriptomics to understand the impact of gut microbiota, especially in the context of HIV co-infection in humans.
HIV acquisition and the clinical outcomes of HIV-1 infection are highly variable among individuals. Genetic factors, socio-economic backgrounds, and complex interactions between the virus and the host determine them. Besides the genetic variation of major host factors, such as human leukocyte antigen receptor (HLA) and HIV co-receptor CCR5, polymorphisms at several loci affect the outcomes of HIV infection as well as the pharmacokinetics of anti-HIV drugs (reviewed in reference
[45][113]). Nevertheless, even among the most highly exposed individuals, a fraction remains HIV-negative without any known genetic association
[46][47][114,115]. Similarly, important differences in the natural course of HIV infection (such as time from infection to AIDS diagnosis and opportunistic infections or malignancies) were only partially explained by known variables such as genetic polymorphism, age, and co-morbidities
[48][116]. Single-cell transcriptome studies, performed in patient-derived cells and in vitro infected primary cells and cell lines, have significantly contributed to understanding the host and virus variability that may alter the overall clinical course of infection (
Table 1;
Figure 2).
Figure 2. Multi-omics approach to understanding heterogeneity in HIV infection. (1) and (2) Polymorphonuclear cells, collected from HIV-infected untreated and HIV-infected cART-treated individuals, are subjected to multi-omics (3) including transcriptomics, proteomics, genomics, metabolomics, and epigenomics. The data generated can enable (4) an understanding of the transcriptionally active regions in productive infection and (5) help identify sites of therapeutic interventions. The figure was made with help of Biorender.
CD4+ T cells are the main target of HIV infection. However, CD4+ T cells are not equally permissive to infection, varying between individuals and across cells isolated from the same individual
[49][50][51][52][29,117,118,119]. Combining scRNA-seq and cell surface marker expression using flow cytometry, Rato et al. defined the ‘HIV-permissive cell’. Their data showed that the transcriptional heterogeneity at the single-cell level was contributed to mainly by the varying degrees of response to T-cell receptor (TCR) stimulation, allowing them to partition cell populations into high and low HIV-permissive subsets
[53][120]. The scRNA-seq has been used to determine the cellular transcriptome changes induced by active HIV replication in macrophages
[54][121], the multicellular immune responses during hyperacute HIV infection
[55][30], the poor restoration of T cell function after ART in HIV-infected patients
[56][57][122,123], immune exhaustion in chronic HIV infection
[58][124], immune reconstitution failure in highly active antiretroviral therapy (HAART)-treated patients
[59][125], and the collective effects of drug abuse and SIV infection in microglia and brain macrophages from SIV-infected rhesus monkeys
[60][126]. All these contribute to our understanding of HIV progression by itself and make educated predictions of outcomes in the setting of
Mtb co-infection.
Table 1.
HIV acquisition and the clinical outcomes of HIV-1 infection.
Wang et al.
[58][124] investigated the role of HIV-1 infection on the role of immune cell exhaustion at the transcriptomic level in HIV disease progression using scRNA-seq. Three unique clusters of exhausted CD4+ and CD8+ T cells and interferon-responsive CD8+ T cells were identified in the PBMCs from HIV-infected donors compared to healthy donors. KLRG1+ expressing exhausted CD8+ T cell population was also identified as a critical disease progression marker that could serve to be an immunotherapy target for chronic HIV infection treatment. scRNA-seq has facilitated the identification of rare immune cell subsets that drive immune activation and neuronal damage during HIV infection in the central nervous system
[70][128]. Using scRNA-seq on cerebrospinal fluid and blood from adults with and without HIV, Farhadian et al. identified a rare subset of myeloid cells associated with neurodegenerative-disease-associated microglia
[70][128]. The authors were able to outline a potential mechanistic link between neuronal injury in HIV and the associated progression of neurodegenerative conditions.
Kazer et al. performed scRNA-seq on PBMCs of four untreated individuals before, and longitudinally during, acute HIV infection to discover the gene responses that vary by time and cell subset in HIV infection
[55][30]. They successfully identified gene expression responses that were concealed in bulk analysis: proinflammatory T cell differentiation, prolonged major histocompatibility complex II upregulation, and persistent natural killer (NK) cell cytolytic killing. The study provided a unified framework to study coordinated dynamic cellular responses in the early stages of HIV. This has significant implications in
Mtb/HIV co-infection since LTBI reactivation is often associated with immune dysregulation in the very early phase of HIV co-infection.
Overall, single-cell technologies have contributed significantly to our understanding of cell subset diversity and individual cell heterogeneity in both TB and HIV. It has facilitated the identification of novel cell subsets and their molecular and functional characteristics during the infection process that can open the door for therapeutic and vaccine development strategies.
4. Implications of scRNA-Seq in Animal Models of TB and HIV
The nonhuman primate (NHP) model remains the most accurate animal model that recapitulates the entire spectrum of TB as seen in humans
[71][72][129,130]. Rhesus macaque (Macaca mulatta) is the most widely used model amongst all the NHP species. This is due to their social adaptability and genetic diversity, which makes them adept at investigating intra-specific variation
[73][131]. The model is highly tractable and can be utilized to study latent TB and LTBI reactivation and can be co-infected with Simian Immunodeficiency Virus (SIV)
[13][18][19][74][13,18,19,132]. Cataloging the diverse cellular architecture of the primate lung in LTBI and active TB is critical in understanding human disease progression. The multi-omics approach provides an atlas that serves as an open resource for identifying novel targets for disease interventions. scRNA-seq on dematricized single-cells from lungs of active-pulmonary-TB-infected rhesus macaques identified an influx of plasmacytoid dendritic cells (pDCs), the interferon (IFN)-responsive macrophage population, and activated T-cell responses
[75][133]. On the other hand, LTBI lungs were characterized by a CD27+ natural killer cell subset. The study outlined the presence of unique immune populations associated with either TB control or dissemination, which could be targeted for therapeutics or vaccines in the future. Several studies have also attempted to generate an NHP cell atlas using scRNA-seq to provide a catalog of features that can be used to study human disease
[76][77][134,135]. These studies have mapped the receptor and co-receptors for viruses causing human infectious diseases, then intersected the data with human genetic disease orthologues for translational associations.
A novel approach combined bacterial fitness fluorescent reporter strains with scRNA-seq to dissect the functional heterogeneity of
Mtb-infected alveolar and interstitial macrophages in a mouse model
[78][136]. The approach showed that, while the local lung macrophage population was epigenetically constrained in response to the infection, an inter-species comparison confirmed the conserved nature of the macrophage subsets between mice and humans. Such a multimodal scRNA-seq approach has important implications for understanding the regulation of host cells before and during TB/HIV co-infection. Akter et al. employed scRNA-seq on
Mtb-infected murine lungs to identify the enrichment of a type I Interferon (IFN) signature in the lymphocytes and a higher expression of Ly6A on the surface of activated lymphocytes
[36][60]. The results are in concordance with the human TB signatures
[79][137] and identify Ly6A as a critical marker of T-cell activation in TB. The dynamic changes in T-cell subsets during TB infection, measured using scRNA-seq, revealed a novel signature for exhausted CD4+ (
H1FX,
ZFP36L2,
VIM,
PPP1R15A) and CD8+ T cells (
ITM2C)
[80][138]. T-cell exhaustion has important implications in immune deficiency, exacerbation of acute TB, and CD8+ T cell function. The identification of different gene signatures and pathways associated with exhausted T-cell subsets can potentially facilitate a comprehensive understanding of the role of these subsets in
Mtb pathogenesis in the setting of HIV co-infection.
Similar to TB, the diverse and complex pathogenesis of HIV-1 remains a challenge in developing efficacious vaccines and cure strategies. The absence of treatment results in a highly variable disease progression with considerable heterogeneity in the phenotype of infected immune cells and their functions
[81][139]. The heterogeneous nature of the HIV-infected cells also makes it difficult to target and treat the latent reservoirs
[62][28]. The single-cell approach allows the study of the diverse mechanisms associated with HIV latency, including transcriptional repression and post-transcriptional blocks
[82][140]. The effects of chronic HIV infection, including the persistence of viral reservoirs and neuropathogenesis, can be mimicked using Simian Immunodeficiency Virus (SIV) in an infected rhesus macaque model. The transcriptome of brain myeloid cells in cART-treated versus untreated macaques was examined using scRNA-seq. The results demonstrated that the virus induced changes in the brain microglia and that cART restored the homeostatic state of microglia like the uninfected control
[83][141]. The findings provide a critical perspective on the role of microglia/macrophages as key targets of the SIV infection of the central nervous system. This finding has direct implications in elucidating the underlying mechanisms of alteration of the clinical signs and delays in the diagnosis of neurotuberculosis, the most devastating extrapulmonary form of TB in the setting of HIV infection. The differences in the neonatal versus adult responses to the HIV-1 envelope were demonstrated using a macaque model
[84][142]. Single-cell transcriptomic analyses showed that neonatal macaques had a decreased immunosuppression transcriptome signature compared to adult macaques following HIV envelope immunization. Additionally, the neonates had a high transcriptome profile of
BCL2 in T cells and decreased
IL10RA transcripts in T, B, and NK cells post-HIV envelope vaccination compared to adult macaques. These findings are especially critical to advance our understanding of the transcriptomic changes in neonates who are exposed to HIV and have TB.
Though the NHP model of SIV has unequivocally illustrated the mechanisms of HIV pathogenesis, there are significant genetic and biological differences between lentiviruses and some HIV-specific interventions that cannot be tested in nonhuman hosts. Humanized mice represent an attractive alternative animal model that recapitulates key aspects of human HIV biology and pathogenesis
[85][86][143,144]. Single-cell transcriptome analysis of human CD45+CD3-CD19- cells isolated from the spleens of humanized mice demonstrated a significant upregulation of the genes involved in type I IFN inflammatory pathways in the innate immune subsets
[87][145]. In another study by Aso et al., the characterization of HIV-1-producing cells in a humanized mouse model using scRNA-seq demonstrated the heterogeneity of HIV-1-infected cells, including
CXCL13high cells that provide clues for the development of an HIV-1 cure
[88][146]. Humanized mice also support dual infections of
Mtb and HIV in the lung, CNS, and other organs
[89][90][91][147,148,149]. Co-infection in humanized mice reproduces several important aspects of microbial synergy, such as greater viral and bacterial burden and exacerbated inflammation. Applying single-cell RNA sequencing technology in this model would further enable the profiling of the complex virus, bacterial, and host dynamics during infection (
Figure 3). It is a powerful tool that helps improve our understanding of the intercellular communication networks and the host–bacteria/virus interactions in immunologically relevant animal models.
Figure 3. scRNA-seq in preclinical animal models to study TB/HIV co-infection. The nonhuman primate and humanized mouse model can be utilized to gain transcriptomic insights into several aspects of TB and HIV co-infection in humans, including the impact of treatment, diagnostics, inflammation, and immune activation. The figure was made with help of Biorender.