You're using an outdated browser. Please upgrade to a modern browser for the best experience.
Garlic Virus E Genome
Edit

Garlic (Allium sativum L.) plants exhibiting mosaics, deformation, and yellow stripes symptoms were identified in Meerut City, Uttar Pradesh, India. To investigate the viruses in the garlic samples, the method of high-throughput sequencing (HTS) was used. Complete genome of the garlic virus E (GarV-E) isolate (NCBI accession No. MW925710) was retrieved. The virus complete genome comprises 8450 nucleotides (nts), excluding the poly (A) tail at the 3′ terminus, with 5′ and 3′ untranslated regions (UTRs) of 99 and 384 nts, respectively, and ORFs encoding replicase with a conserved motif for RNA-dependent RNA polymerase (RdRP), TGB1, TGB2, TGB3, serine-rich protein, coat protein, and nucleic acid binding protein (NABP). The sequence homology shared 83.49–90.40% and 87.48–92.87% with those of GarV-E isolates available in NCBI at the nucleotide and amino acid levels, respectively. Phylogenetic analysis showed a close relationship of this isolate from India (MW925710) with GarV-E isolate YH (AJ292230) from Zhejiang, China. The presence of GarV-E was also confirmed by RT-PCR.

garlic high-throughput sequencing RT-PCR phylogenetic analysis

1. Introduction

Garlic (Allium sativum L.; Family: Amaryllidaceae) is an aromatic bulbous crop native to central Asia and is consumed worldwide as food in addition to traditional remedies for various diseases [1]. It is highly prone to viral infection, which has adversely reduced bulb weight [2]. Garlic crops are often infected by multiple viruses belonging to several genera that are known as the “garlic virus complex” [2]. Many viruses infecting garlic have been identified in India, including potyvirus (onion yellow dwarf virus; OYDV, leek yellow stripe virus; LYSV), carlavirus (garlic common latent virus; GarCLV, shallot latent virus; SLV), tospovirus (iris yellow spot virus; IYSV), allexivirus (garlic virus A; GarV-A, garlic virus B; GarV-B, garlic virus C; GarV-C, garlic virus D; GarV-D and garlic virus X; GarV-X) [3][4][5][6][7][8][9][10][11][12]. Garlic virus E (GarV-E) belongs to the single-stranded, positive-strand RNA virus of the genus Allexivirirus and family Alphaflexiviridae. It was previously reported in garlic from China, Poland, Australia, the USA, and Japan [13][14][15][16]. These viruses are reported to be transmitted by an insect mite vector [17], vegetative propagation, and mechanically [18]. The disease symptoms include leaf mosaic, deformation, and yellow stripes, which reduce yield and deteriorate the quality of the crop. Because of vegetative propagation, these viruses can accrete in the bulb and can be transmitted to successive generations. Hence, the eradication of these viruses becomes onerous. Considering the importance of garlic, identification, characterization of the virus associated with the disease, and an appropriate management strategy are required [19].
To date, in the public domain, only four complete genomic sequences of GarV-E isolates have been reported from China [20][21]. Several studies have reported genetic differences based on coat protein (CP) sequences within Allexivirus species [21], and there are currently 15 partial CP/NABP sequences of GarV-E submitted globally available in the NCBI database [16][22].

2. Sequence Analysis

To reveal viruses that might be associated with the symptoms, the RNA of pooled symptomatic clove and leaf samples was sequenced using the Illumina HiSeq 2000 platform. The size of the Illumina sequencing data generated was approximately 43 million 125 bp paired-end reads in the two libraries. After trimming 34,873,264 bp (average length 124.54 bp) and 31,494,032 bp (average length 124.57 bp), raw sequence reads were obtained. A total of 133,971 and 108,668 contigs were generated from clove and leaf samples of garlic, respectively. All contigs were subjected to a BLASTn search against the nr database, which revealed whole-genome sequences of GarV-E apart from other garlic viruses, including potyvirus (onion yellow dwarf virus; OYDV, leek yellow stripe virus; LYSV), carlavirus (garlic common latent virus; GarCLV, shallot latent virus; SLV), and allexivirus (garlic virus A; GarV-A, garlic virus B; GarV-B, garlic virus C; GarV-C, garlic virus D; GarV-D and garlic virus X; GarV-X). Sequence taxonomic profiling was visualized using a Krona graph, and sequence reads belonging to Allexivirus, GarV-A (14%), GarV-B (5%), GarV-D (36%), GarV-E (18%), and GarV-X (27%), were obtained. The sequence mapping, BLAST analysis, and Kraken approach generated the complementary datasets, which were supported with 100% convergence. Reference-based mapping of the data revealed that the viral reads mapped to GarV-A, GarV-D, GarV-E, GarV-X, OYDV, LYSV, and GarCLV in both of the samples. The obtained data revealed that 100,184 (0.33%) reads from clove and 510,948 (1.95%) reads from leaf sample mapped with the GarV-E genome.

3. Genome Annotation and Analysis of Garlic Virus E

BLASTn program-based analysis showed that the GarV-E contig comprises the complete genome sequence of 8450 bp ssRNA. The 5′ UTR and 3′ UTR sequences were not included in the study, and the genome sequence obtained in the study was deposited to NCBI with accession number MW925710. In addition to exploring the amino acid sequence in all possible open reading frames (ORFs), it was viable to detect the characteristic domains along with conserved motifs specific to the genus Allexivirus [13][23][24][25]. The ORF Finder and smart BLAST tool revealed that ORF1 encodes replicase (4671 nt; 1556 aa) with a conserved motif SG×3T×3NT×22GDD, which is the proposed active site of the RNA-dependent RNA polymerase (RdRP), was found at amino acid positions 1317–1353, ORF2 a TGB1 (735 nt; 244 aa), ORF3 a TGB2 (309 nt; 102 aa), ORF4 a TGB3 (225 nt; 74 aa), ORF5 a serine-rich protein (234 nt; 77 aa), ORF6 a coat protein (759 nt; 252 aa), and ORF7 an NABP (348; 127 residues) (Figure 1). The pairwise sequence comparison at the level of nucleotides and deduced amino acids of all seven ORFs revealed 72.8–98.3% and 80.5–98.7% identities, respectively, with the Chinese isolates (AJ292230, MN059326, MN059327, and MN059328) (Table 1).
Figure 1. Genomic organization of GarV-E (MW925710) showing seven predicted open reading frames and their corresponding products: replicase, TGB1, TGB2, TGB3, serine-rich protein, viral coat protein (CP), and nucleic acid binding protein (NABP).
Table 1. Pairwise percent sequence identity of Indian GarV-E Isolate (MW925710) at the nucleotide (nts) level and its deduced amino acid (aa) sequence of ORFs with other complete genomes of Allexiviruses (for which complete genome sequences are available).
  Replicase TGB 1 TGB 2 TGB3 Serine-Rich Protein CP NABP
AJ292230_GarV-E_China 90.2
(92.87)
86.9
(89.9)
90.7
(92.2)
85.7
(88.2)
88.4
(88.3)
90.6
(92.8)
93.4
(95.1)
MN059326_GarV-E_China 82.8
(87.48)
98.3
(98.7)
84.9
(84.4)
72.8
(80.5)
77.7
(81.8)
84.0
(87.6)
86.9
(90.5)
MN059327_GarV-E_China 82.5
(87.60)
94.9
(97.5)
86.5
(86.4)
72.8
(80.5)
78.2
(81.8)
83.1
(86.5)
86.1
(89.7)
MN059328_GarV-E_China 82.5
(87.54)
94.6
(97.5)
86.8
(83.4)
75.3
(81.2)
78.6
(83.1)
83.3
(87.3)
85.3
(87.1)
MK518067_GarV-D_India 80.6
(82.09)
57.3
(61.3)
76.6
(82.5)
66.6
(72.8)
75.2
(72.7)
74.8
(77.1)
71.8
(74.9)
MN059391_GarV-D_China 79.8
(81.40)
57.7
(61.3)
76.6
(82.5)
66.2
(72.8)
75.2
(72.7)
74.9
(77.1)
73.4
(74.3)
KF550407_GarV-D_Argentina 76.7
(80.6)
57.3
(60.0)
73.7
(79.6)
61.7
(66.2)
69.2
(68.8)
71.1
(75.5)
72.8
(74.6)
KF555653_GarV-D_Australia 74.8
(77.9)
58.1
(60.3)
73.7
(80.5)
57.7
(61.4)
68.3
(68.8)
70.7
(76.7)
73.8
(74.1)
KF632716_GarV-A_Spain 59.0
(61.5)
56.1
(57.6)
73.0
(77.6)
61.7
(66.8)
67.9
(64.9)
67.4
(72)
69.3
(74.5)
MN059255_GarV-A_China 58.9
(62.7)
56.3
(57.6)
73.0
(77.6)
61.7
(65.5)
66.6
(63.6)
67.3
(71.6)
68.8
(73.7)
MN059151_GarV-B_China 65.8
(68.3)
53.1
(52.8)
58.9
(48.0)
37.1
(47.5)
54.2
(57.1)
54.2
(57.3)
63.8
(70.6)
MK503771_GarV-X_India 65.4
(67.9)
53.2
(55.9)
56.0
(53.8)
40.0
(51.0)
50.4
(49.3)
56.2
(56.9)
64.4
(69.0)
MN059424_GarV-X_China 65.7
(68.2)
53.2
(55.1)
57.6
(51.9)
39.6
(43.5)
51.7
(51.9)
57.3
(57.3)
64.9
(74.7)
MN059170_GarV-B_China 66.0
(68.2)
52.4
(52.6)
58.6
(49.0)
39.3
(43.0)
53.8
(49.3)
55.8
(57.6)
56.3
(62.8)
JX310755_ShV-X_Russia 55.7
(58.5)
60.7
(63.6)
61.5
(58.2)
41.6
(44.1)
58.1
(55.8)
57.7
(60.0)
68.8
(72.4)

4. Sequence Similarity and Phylogenetic Analysis

BLASTn [26] searches of the NCBI databases showed that the complete genome of GarV-E isolate India (MW925710) shared 83.49–90.40% nucleotide sequence identities with previously reported isolates (AJ292230, MN059326, MN059327, and MN059328). Moreover, the Indian isolate was more closely related to isolate YH (AJ292230) (90.40%) from Zhejiang, China. A similar result was obtained using an NJ-based phylogenetic tree of the complete genome sequence of GarV-E with other complete genome sequences of GarV-E and other Allexivirus species from different regions of the world.
In this entry, ingroups were selected from the same species from different countries based on the closely related complete genome, and complete CP sequences to the respective viruses and outgroups were selected from the Allexivirus genus virus containing enough homologous sites to the respective ingroup virus species to assess the evolutionary relationship. The phylogenetic tree revealed that GarV-E Indian isolates (accession no. MW925710) grouped in the same clade as other GarV-E isolates reported from other countries (Figure 2). Similar phylogenetic tree results were obtained at amino acid (aa) level. The pairwise sequence identities (%) of the GarV-E complete genome sequence (MW925710) shared nucleotide (nt) identity at 79.80–90.10% and amino acid (aa) identity at 79.90–89.1% with other GarV-E isolates reported globally (Table 2).
Figure 2. Phylogenetic analysis of GarV-E isolates in the complete genome amino acid sequence using Neighbor joining algorithm. The evolutionary distances were computed using p-distance method with 1000 bootstrap replicates. The scale bar indicates the number of substitutions per site.
Table 2. Comparisons of nucleotide sequence (nts) and amino acid (aa) identity of pairwise combinations of complete genome sequences of garlic virus E (accession no. MW925710) with other complete genome sequences of Allexiviruses. (for which complete genome sequences are available).
Seq-> nts/aa MW925710_GarV-E_India AJ292230_GarV-E_China MN059326_GarV-E_China MN059327_GarV-E_China MN059328_GarV-E_China MK518067_GarV-D_India MN059391_GarV-D_China KF550407_GarV-D_Argentina KF555653_GarV-D_Australia KF632716_GarV-A_Spain MN059255_GarV-A_China MN059151_GarV-B_China MN059170_GarV-B_China MK503771_GarV-X_India MN059424_GarV-X_China JX310755_ShV-X_Russia
MW925710_GarV-E_India ID 89.1 80.1 79.9 82.9 71.6 71.6 70.4 70.3 57.4 57.1 59.6 60.2 59.5 60.1 52.7
AJ292230_GarV-E_China 90.1 ID 84.9 84.9 81.2 61.7 61.8 61.2 62.1 52.4 52.2 52.5 52.7 52.5 53.0 47.9
MN059326_GarV-E_China 83.7 87.9 ID 95.6 91 75 75.1 74.4 75 63 62.9 64.3 64.3 63.9 64.6 57.7
MN059327_GarV-E_China 83.1 88.1 94.5 ID 91.7 74.6 74.7 74.4 75.3 63 62.9 64.2 64.2 64.1 64.8 57.8
MN059328_GarV-E_China 79.8 83.6 89.7 91.5 ID 91.7 74.6 74.7 74.4 75.3 63 62.9 64.2 64.2 64.1 64.8
MK518067_GarV-D_India 75.1 68.9 68.7 68.3 65.2 ID 99.6 90.4 89.6 63.6 62.9 63.4 63.7 62.9 63.6 57.4
MN059391_GarV-D_China 74.6 68.8 68.7 68.2 65.1 98.6 ID 90.3 89.5 63.8 63 63.5 63.8 63 63.7 57.5
KF550407_GarV-D_Argentina 71.8 69.3 69.2 69.1 66.3 85.4 84.5 ID 89.2 63.8 63 63.2 63.5 63.1 63.7 58
KF555653_GarV-D_Australia 75 69.4 69.0 68.8 65.8 83.2 83.7 86.4 ID 63.4 62.8 63.5 63.5 62.8 63.3 57.9
KF632716_GarV-A_Spain 65 61.7 61.3 61.5 58.7 61.7 61.5 62.1 6.2 ID 78.7 57.3 57.5 57.3 57.8 64.4
MN059255_GarV-A_China 65 61.6 61.2 61.4 58.6 61.6 61.4 62.0 61.8 98.3 ID 56.9 56.9 56.6 57 63.1
MN059151_GarV-B_China 59.8 61.1 61.2 61.1 59.1 61.0 61.1 69 61.1 57.3 57.3 ID 90.8 75.8 75.2 54.2
MN059170_GarV-B_China 59.7 61.0 61.1 61.2 59.2 65 66 66 67 56.9 57.1 86.4 ID 74.7 75.3 54
MK503771_GarV-X_India 59.6 67 68 67 58.8 61.2 61.3 61.3 61.0 56.4 56.5 77 70 ID 90.6 54.3
MN059424_GarV-X_China 59.9 61.1 69 68 59.0 67 61.0 68 61.2 56.7 56.8 73 69.9 99 ID 54.3
JX310755_ShV-X_Russia 55.7 56.5 56.3 56.6 54.1 56.7 56.8 56.4 56.4 63.4 63.5 54.3 54.1 54.1 54.2 ID
The results of the phylogenetic tree constructed with nucleotide sequences of the complete coat protein (CP) gene available in NCBI for the genus Allexivirus were consistent with the results obtained for the complete genome. Previously, in many of the assessments between members of diverse species of Allexivirus, the percent nt and aa sequence identities of the CP gene showed values greater than those suggested by ICTV. This was also identified by [21], who proposed that GarV-A may be combined with other viruses, such as GarV-D and GarV-E, providing more than 73% nt identity among the CP genes to become acceptable for GarV species characterization. Similarly, GarV-B may be combined with GarV-X [21]. The comparison of the CP gene of GarV-E Meerut India (MW925710) with similar sequences shared nucleotide (nt) identity at 83.3–90.6% and amino acid (aa) identity at 86.5–92.8% with other GarV-E isolates reported globally (Table 1).
Out of 16 samples, seven, including samples used for HTS, were found to be positive for virus infection using RT-PCR with an amplicon of ~750 bp. The sequences obtained were deposited to NCBI with accession numbers MW925695, OK064618, OK064619, OK064620, and OK064621. BLASTn analysis of the partial CP/NABP gene of GarV-E India (MW925695) shared 83.63–92.96% nucleotide identity with other isolates available in NCBI. The pairwise sequence identity comparison of the partial CP/NABP gene of GarV-E India (MW925695) with similar sequences shared nucleotide (nt) identity at 83.5–92.9% and amino acid (aa) at 82.5–96.7%. Moreover, the Indian isolate shared a high sequence identity with E-JF-2 isolate (LC097189) (90.40% nt and 96.7% aa) isolated from Fukuoka, Japan. To better understand the genetic variability of the GarV-E isolates, researchers selected 18 partial CP/NABP coding region sequences of GarV-E and other Allexivirus from different geographical locations to construct a phylogenetic tree. In the phylogenetic tree, the GarV-E partial CP/NABP India isolate (accession no. MW925695) was grouped in the same clade as other GarV-E isolates reported from other countries.
NJ-based phylogenetic analysis of the complete genome of the virus (Figure 2), complete coat protein region, and partial CP/NABP coding region showed consistent clustering of isolates. They all suggest a close relationship between the Indian GarV-E isolate and other GarV-E isolates, supported by high posterior probability values. Recombination appears to be rare in single-stranded, negative-sense RNA viruses, although for those with segmented genomes, such as influenza A, a genetic exchange can still occur through reassortment [27]. Researchers did not find any strong signatures of recombination by RDP4 in individual alignments of the Indian GarV-E isolate (data not presented).

References

  1. Camargo Filho, W.P.; Camargo, F.P. A quick review of the production and commercialization of the main vegetables in Brazil and the world from 1970 to 2015. Hortic. Bras. 2017, 35, 160–166.
  2. Da Silva, L.A.; Oliveira, A.S.; Melo, F.L.; Ardisson-Araújo, D.M.; Resende, F.V.; Resende, R.O.; Ribeiro, B.M. A new virus found in garlic virus complex is a member of possible novel genus of the family Betaflexiviridae (order Tymovirales). PeerJ 2019, 7, e6285.
  3. Ghosh, D.K.; Ahlawat, Y.S. Filamentous viruses associated with mosaic disease of garlic in India. Indian Phytopathol. 1997, 50, 266–276.
  4. Gawande, S.J.; Gurav, V.S.; Ingle, A.A.; Gopal, J. First report of Garlic virus A in garlic from India. Plant Dis. 2015, 99, 1288.
  5. Gawande, S.J.; Khar, A.; Lawande, K.E. First report of Iris yellow spot virus on garlic in India. Plant Dis. 2010, 94, 1066.
  6. Gupta, N.; Prabha, K.; Islam, S.; Baranwal, V.K. First report of Leek yellow stripe virus in garlic from India. J. Plant Pathol. 2013, 95 (Suppl. 4), 4–75.
  7. Majumder, S.; Baranwal, V.K. First report of Garlic common latent virus in garlic from India. Plant Dis. 2009, 93, 106.
  8. Roylawar, P.B.; Khandagale, K.S.; Randive, P.M.; Atre, G.E.; Gawande, S.J.; Singh, M. First Report of Garlic Virus B Infecting Garlic in India. Plant Dis. 2021, 105, 1232.
  9. Roylawar, P.B.; Khandagale, K.S.; Gawai, T.; Gawande, S.J.; Singh, M. First Report of Garlic Virus C Infecting Garlic in India. Plant Dis. 2019, 103, 1439.
  10. Singh, J.; Singh, M.K.; Ranjan, K.; Kumar, A.; Kumar, P.; Sirohi, A.; Baranwal, V.K. First complete genome sequence of garlic virus X infecting Allium sativum-G282 from India. Genomics 2020, 112, 1861–1865.
  11. Singh, J.; Truong, T.N.; An, D.; Prajapati, M.R.; Manav, A.; Quoc, N.B.; Baranwal, V.K. Complete genome sequence and genetic organization of a Garlic virus D infecting garlic (Allium sativum) from northern India. Acta Virol. 2020, 64, 427–432.
  12. Prajapati, M.R.; Manav, A.; Singh, J.; Singh, M.K.; Ranjan, K.; Kumar, A.; Kumar, P.; Kumar, R.; Baranwal, V.K. Identification of Garlic virus A infecting Allium sativum L. through next-generation sequencing technology. J. Hortic. Sci. Biotechnol. 2021, 1–10.
  13. Chen, J.; Adams, M.J. Molecular characterization of a complex mixture of viruses in garlic with mosaic symptoms in China. Arch. Virol. 2001, 146, 1841–1853.
  14. Chodorska, M.; Paduch-Cichal, E.; Szyndel, M.S.; Kalinowska, E. First report of Garlic virus D, E, and X on garlic in Poland. J. Plant Pathol. 2013, 95 (Suppl. 4), 4–70.
  15. Bereda, M.; Paduch-Cichal, E. Allexiviruses–pathogens of garlic plantsAllexiwirusy–patogeny czosnku pospolitego. Prog. Plant Prot. 2016, 56, 302–311.
  16. Nurulita, S.; Geering, A.D.; Crew, K.S.; Harper, S.; Thomas, J.E. First report of garlic virus E in Australia. Australas. Plant Dis. Notes 2020, 15, 32.
  17. Kang, S.G.; Koo, B.J.; Lee, E.T.; Chang, M.U. Allexivirus transmitted by eriophyid mites in garlic plants. J. Microbiol. Biotechnol. 2007, 17, 1833–1840.
  18. Katis, N.I.; Maliogka, V.I.; Dovas, C.I. Viruses of the genus Allium in the Mediterranean region. Adv. Virus Res. 2012, 84, 163–208.
  19. Yadav, V.; Majumder, S. The first complete genome sequence of garlic common latent virus occurring in India. Virus Dis. 2019, 30, 311–314.
  20. Chen, J.; Chen, J. Genome organization and phylogenetic tree analysis of Garlic virus E, a new member of genus Allexivirus. Chin. Sci. Bull. 2002, 47, 33–37.
  21. Chen, J.; Zheng, H.Y.; Antoniw, J.F.; Adams, M.J.; Chen, J.P.; Lin, L. Detection and classification of allexiviruses from garlic in China. Arch. Virol. 2004, 149, 435–445.
  22. Yoshida, N.; Shimura, H.; Masuta, C. Allexiviruses may have acquired inserted sequences between the CP and CRP genes to change the translation reinitiation strategy of CRP. Arch. Virol. 2018, 163, 1419–1427.
  23. Lezzhov, A.A.; Gushchin, V.A.; Lazareva, E.A.; Vishnichenko, V.K.; Morozov, S.Y.; Solovyev, A.G. Translation of the shallot virus X TGB3 gene depends on non-AUG initiation and leaky scanning. J. Gen. Virol. 2015, 96, 3159–3164.
  24. Lukhovitskaya, N.I.; Vetukuri, R.R.; Sama, I.; Thaduri, S.; Solovyev, A.G.; Savenkov, E.I. A viral transcription factor exhibits antiviral RNA silencing suppression activity independent of its nuclear localization. J. Gen. Virol. 2014, 95, 2831–2837.
  25. Morozov, S.Y.; Solovyev, A.G. Phylogenetic relationship of some “accessory” helicases of plant positive-stranded RNA viruses: Toward understanding the evolution of triple gene block. Front. Microbiol. 2015, 6, 508.
  26. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410.
  27. Simon-Loriere, E.; Holmes, E.C. Why do RNA viruses recombine? Nat. Rev. Microbiol. 2011, 9, 617–626.
More
Upload a video for this entry
Information
Subjects: Virology
Contributor MDPI registered users' name will be linked to their SciProfiles pages. To register with us, please refer to https://encyclopedia.pub/register :
View Times: 885
Revisions: 4 times (View History)
Update Date: 14 Mar 2022
Academic Video Service