Tuberculosis (TB) is an infectious disease caused by Mycobacterium tuberculosis (Mtb), with 10.4 million new cases per year reported in the human population. Recent studies on the Mtb transcriptome have revealed the abundance of noncoding RNAs expressed at various phases of mycobacteria growth, in culture, in infected mammalian cells, and in patients. Among these noncoding RNAs are both small RNAs (sRNAs) between 50 and 350 nts in length and smaller RNAs (sncRNA) < 50 nts.
1. Introduction
Mycobacterium tuberculosis (Mtb) remains one of the leading infectious causes of human mortality, supplanted only in 2020 by the COVID-19 pandemic triggered by the SARS-CoV-2 virus. Mtb evolved from an ancestral smooth tubercule bacillus (e.g.,
M. canettii,
M. pseudotuberculosis), acquiring virulence elements to attain its preferred pathogenicity towards humans
[1]. The acquisition of these virulence elements coincided with Mtb undergoing a genomic downsizing relative to the 100 different smooth tubercule bacilli species characterized
[1][2][1,2]. Despite this downsizing, a core genome is evident among the pathogenic strains of mycobacteria. Several decades of research efforts have been devoted to understanding how the ~4000 protein-coding elements evident in the Mtb genome contribute to growth, survival, and pathogenic processes
[3][4][5][6][7][3,4,5,6,7]. Recent technical advances in deciphering the complex nature of Mtb and related mycobacterial genomes, including improved large-scale RNA-sequencing strategies, have revealed an abundance of small RNAs (sRNA). First, described as ranging in size from 50 to 350 nucleotides (nts)
[8][9][10][11][12][8,9,10,11,12], these small RNAs now include some as small as 18 nts
[13]. The sRNAs, originally selected with sequences > 100 nts in length, were found to represent ~11% of the intergenic transcripts (IGRs) identified from the exponential phase cultures. In addition to the sRNAs, IGRs include 5′ and 3′ UTRs, tRNAs, and antisense RNAs. Based on the normalized read counts for sense, antisense, and intergenic noncoding RNAs, the antisense and intergenic noncoding RNAs made up roughly 25% of the transcripts mapping outside of ribosomal RNA genes
[10]. The sRNAs are detected in both exponential and/or stationary phase cultures, in infected eukaryotic cells, and in patients with tuberculosis (TB), suggesting key roles in all aspects of mycobacterial growth and survival
[9][10][11][12][13][14][9,10,11,12,13,14].
2. Functional Roles of Mycobacterial sRNAs and sncRNAs
A key step in identifying putative biological roles for the sRNAs relates to what stages in a mycobacterial growth cycle they are expressed
[15][41]. Additional insights have come from the environmental conditions that affect sRNA expression. Among the conditions are oxidative stress, nutrient deprivation, DNA damage, antibiotic exposure, and/or acidic environments, the latter occurring in the phagolysosome formed in macrophages and dendritic cells. Putative functional roles for the numerous sRNAs need also to consider the stability of the sRNA, affected by both the relative GC content and secondary RNA structures. Examples of several better characterized sRNAs are B11/6C, MTS1338/DrrS, Ms1, MTS0097/Mcr11, ncRv11846/MrsI, Mcr7, and sncRNA-1 (
Figure 12,
Table 1).
Figure 12. Functional contributions of seve ral Mtb-encoded sRNAs including B11/6C (A), MTS1338/DrrS (B), MTS0997/Mcr11 (C), ncRv11846/MrsI (D), sncRNA-1 (E), Ms1 (F), and Mcr7 (G). Indirect interactions are shown with a dashed line. (A) B11/6C is induced by undefined factors and positively regulates the expression genes (panD, dnaB) coupled to growth. (B) DrrS+ is induced by the DosR regulon upon nitric oxide stress. Then, it undergoes post transcriptional processing to yield MTS1338/DrrS. MTS1338/DrrS promotes the expression of three operons (rv0079-rv0081, rv0082-rv0087, and rv1620c-rv1622c), which cause defects in Mtb growth and promote persistence. The mechanism of this MTS1338/DrrS mediated regulation has not been characterized. (C) The expression of MTS0997/Mcr11 is regulated by AbmR, an ATP-bound transcription factor. After transcription, MTS0997/Mcr11 undergoes processing at the 3′ end and then regulates the expression of genes (lipB, fadA3, and accD5) involved in fatty acid production in a site-specific manner. This is negatively regulated by fatty acids. (D) In iron-restricted environments, the iron-responsive transcription factor IdeR induces the expression of ncRv11846/MrsI, which in return hinders the translation of nonessential iron storing proteins (hypF, bfrA, and fprA). This increases the level of free iron that can be used for essential functions. (E) sncRNA-1 is induced in infected macrophages and gets processed to yield 25 nts RNA. The processed sncRNA-1 enhances the expression of rv0242c and rv1094, two genes involved in oleic acid production. This regulatory network then promotes Mtb growth and survival inside macrophages. (F) Ms1 sequesters RNA polymerase (RNAP) at the stationary phase. Upon entrance to the outgrowth phase, Ms1 is degraded by PNPase, and some other RNases not yet identified, which release RNAP to promote global transcription. (G) PhoP induces the expression of Mcr7, which abrogates the translation of tatC. tatC encodes TatC, which is involved in the protein secretion pathway.
Table 1. List of sRNA identified by Arnvig et al., DiChiara et al., Gerrick et al., and Coskun et al.
Name |
Northern or PCR Size |
Location |
Surrounding Genes |
Expression |
B11/6C (Candidate_1603) [8][9] |
93 |
4099386-4099478 (−) |
rv3660c- rv3661 |
H2O2 and pH = 5 |
B55 (Candidate_84) [8][9] |
61 |
704187-704247 (+) |
rv0609A- rv0610c |
H2O2 and Mitomycin C |
C8 (Mcr6, candidate_1621) [8][9][11] |
58, 70, 128 |
4168154-4168281 (−) |
rv3722c- rv3723 |
TBD a |
F6 (Mcr14, candidate_29) [8][9][16] |
38, 58, 102 |
293604-293705 (+) |
fadA2-fadE5 |
H2O2 and pH = 5 |
G2 (Candidate_1269) [8][9] |
67, 214, 229 |
1914962-1915190 (−) |
tyrS-IprJ |
TBD |
ASdes (candidate_121) [8][9][17] |
48, 63, 68, 83, 94, 109, 149, 169, 195 |
918264-918458 (+) |
within desA1 |
TBD |
ASpks [9] |
78, 89, 91, 102, 129, 142, 162 |
2299745-2299886 (+) |
within pks12 |
H202 |
AS1726 [9] |
61, 77, 85, 110, 213 |
1952291-1952503 (−) |
within Rv1726 |
TBD |
AS1890 [9] |
63, 109, 191, 238 |
2139419-2139656 (+) |
within Rv1890 |
TBD |
MTS2823 or Ms1 [10][18][19] |
250, 300 |
4100669-4100968 (+) |
rv3661- rv3662c |
in vivo |
MTS1338/DrrS [10][20][21] |
108, 109, ~160, 273 |
1960667-1960783 (+) |
rv1733c- rv1735c |
NO, stationary phase, in vivo |
MTS0997/Mcr11 (Candidate_1693) [8][10][11][14][22] |
115 |
1413094-1413224 (−) |
rv1264- rv1265 |
in vivo, stationary phase, low pH, or hypoxia |
Mcr1 [11] |
>300 |
2029043-2029087 (TBD) |
ppe26-ppe27 |
TBD |
Mcr2 [11] |
120 |
1108857-1108824 (TBD) |
rv0967- rv0968 |
TBD |
Mcr3 (candidate_190) [8][11] |
118 |
1471619-1471742 (+) |
murA-rrs |
TBD |
Mcr4 (candidate_1314) [8][11] |
200–250 |
2137148-2137103 (TBD) |
fbpB-rv1887 |
TBD |
Mcr5 [11] |
80 |
2437823-2437866 (−) |
within rv2175c |
TBD |
Mcr7 [11][23] |
350–400 |
2692172-2692521 (+) |
rv2395-pe_PGRS41 |
TBD |
Mcr8 (candidate_1935) [8][11] |
200 |
4073966-4073908 (TBD) |
rv3661–rv3662c |
TBD |
Mcr9 (candidate_1502) [8][11] |
66–82 |
3317634-3317517 (TBD) |
ilvB1-cfp6 |
TBD |
Mcr10 [11] |
120 |
1283693-1283815 (+) |
within rv1157c |
TBD |
Mcr12 [11] |
118 |
1228436-1228381 (TBD) |
rv1072- rv1073 |
TBD |
Mcr13 [11] |
311 |
4315154-4315215 (TBD) |
rv3866- rv3867 |
TBD |
Mcr15 [11] |
>300 |
1535417-1535716 (−) |
rv1363c- rv1364c |
TBD |
Mcr16 [11] |
100 |
2517032-2517134 (−) |
within fabD |
TBD |
Mcr17 [11] |
82–90 |
2905457-2905402 (TBD) |
within rv2613c |
TBD |
Mcr18 [11] |
82 |
3466287-3466332 (TBD) |
within nuoC |
TBD |
Mcr19 [11] |
66–82 |
575033-575069 (+) |
within rv0485 |
TBD |
ncRv11846/MrsI [12] |
100 |
2096766-2096867 (+) |
blal-rv1847 |
iron starvation, oxidative stress, and membrane stress |
sncRNA-1 [13] |
25 |
4352927-4352951 |
esxA-rv3876 |
inside macrophages |
sncRNA-6 [13] |
21 |
786003-786083 |
rv0685- rv0686 |
inside macrophages |
sncRNA-8 [13] |
24 |
1471701-1471724 |
murA-rrs |
inside macrophages |