Soybean Resistance to Soybean Cyst Nematode: Comparison
Please note this is a comparison between versions V1 by Bahram Samanfar and V5 by Bruce Ren.

Plant pathogens greatly impact food security of the ever-growing human population. Breeding resistant crops is one of the most sustainable strategies to overcome the negative effects of these biotic stressors. In order to efficiently breed for resistant plants, the specific plant–pathogen interactions should be understood. Soybean is a short-day legume that is a staple in human food and animal feed due to its high nutritional content. Soybean cyst nematode (SCN) is a major soybean stressor infecting soybean worldwide including in China, Brazil, Argentina, USA and Canada. There are many Quantitative Trait Loci (QTLs) conferring resistance to SCN that have been identified; however, only two are widely used: rhg1 and Rhg4. Overuse of cultivars containing these QTLs/genes can lead to SCN resistance breakdown, necessitating the use of additional strategies.

• soybean
• soybean cyst nematode (SCN)
• disease control
• pathogen management
• omics

## 1. Introduction

#### 1.1. Soybean

Soybean [Glycine max (L.) Merr.] cultivation occupies more than 6% of the world’s arable land with an ever-increasing production area. Soybean seeds contain around 31–44% protein and 19–26% oil, making soybean an excellent staple for both human food and animal feed [1]. Using soybean for crop rotation is also an important tool in sustainable agriculture due to its nitrogen fixation capability [2].
Based on analysis of the cultivar Williams 82, soybean has a genome size of ~1.1 GB contained in 20 chromosomes, with ~89,500 protein coding transcripts annotated for ~55,600 gene loci [3]. The soybean genome is believed to have undergone two major duplication events 59 and 13 million years ago, as well as many chromosomal rearrangements, and rounds of diploidization, all contributing to diversification of the genome [4]. This makes understanding the soybean genome complicated, given that ~75% of genes are present as paralogs [4].
Across the diversity of areas that soybean is cultivated, the plant must deal with various abiotic stressors relating to excess water, drought, iron and other mineral deficiencies, daylength, hail, wind and cold weather conditions [5]. For example, soybean is a short-day plant grown in latitudes 35°S to 50°N and is subjected to photoperiod sensitivities; thus, challenges emerge when trying to expand its cultivation past those latitudes [6]. Soybean also deals with several biotic stressors. Among these is soybean cyst nematode Heterodera glycines Ichinohe, (SCN). SCN is one of the most devastating pathogens to soybean and is widely present in many areas around the world and is continuing to spread to regions of soybean production in North America [7]. Hence, it is important to study this pathogen and identify resistance genes/QTLs in soybean for use in effective management of the disease.

#### 1.2. Soybean Cyst Nematode (SCN)

SCN is a plant parasitic nematode that causes major soybean yield loss (over $1.5 billion annually in the United States) [7]. It has a fully sequenced genome size of ~158 MB comprising of 9 chromosomes and about ~22,400 annotated gene models [8]. Once SCN is present in the soil, eradication is nearly impossible because some eggs contained within the nematode cysts can remain alive for up to ten years and the infective juveniles can be released from the cysts whenever conditions become favorable [9]. Depending on the environmental conditions, the life cycle of SCN can be completed over a 4-week period. The life stages are: egg stage, juvenile stages (J1-J4), and adult stages (female or male). The first two juvenile stages occur within the egg. Once the J1 is formed it molts within the eggshell into the J2. Triggered by environmental factors including the presence of a host plant root, the J2 will hatch from the egg and enter the root. The infective J2 stage enters the root of the host plant using their stylet and by secreting cell-wall-degrading enzymes (e.g., cellulases). The J2 then induces formation of a specialized metabolically active feeding site made up of multinucleate (syncytium) vascular tissue of the roots [9]. In an incompatible interaction (unsuccessful infection), the syncytium is still formed but degrades over time and is overcome by the surrounding cells, whereas in a compatible interaction the syncytium is maintained and expands [10]. The J2 become immobilized and continues to feed at the syncytium and then molt to the J3, J4 and then eventually into the adult stage. Male adults leave the soybean roots after several days of maturing, and no longer harm the soybean plant, while the females continue to feed and increase in size. Damage to soybean plants is largely due to the female feeding. The adult female swells and pushes through the root surface with only the head left in the root. She then releases a pheromone to attract males for mating. Mated females deposit some eggs within a gelatinous matrix at its posterior end, ready to hatch and infect more soybean within the same year. As many as 500 viable eggs remain within the female body that then encysts and dies. These eggs within the cyst can remain viable for years in the soil, until a new host is present and conditions are favorable for renewed infections [9]. Symptoms of soybean infected with SCN include chlorosis of the leaves, darker and less developed roots, and stunting which leads to significant yield loss varying between 5–80%. Nodulation with nitrogen-fixing bacteria can also be reduced [9]. In general, there are significant economic losses caused by SCN. In a study from 1996–2016 across 28 states, SCN caused the greatest total dollar loss (~$800 USD/hectare) with a peak loss of just under \$1600 USD/hectare in 2012. Additionally, over 30% of yield loss that occurs in SCN infested fields is without any noticeable aboveground symptoms [11]. A second study of about 15 years in more than 25,000 experimental plots at 122 location years highlighted that the number of virulent SCN populations reproducing on PI 88,788 grew from 2001–2015 due to the overuse of resistant cultivars. However, no effects were seen on Peking-derived varieties within the same study [12]. The study emphasized the critical need for novel sources of resistance as the usefulness of current resistance is continually diminishing [12]. Images of soybean roots infected with SCN are shown in Figure 1a,b in addition to a comparison of an SCN cyst vs. a nodule on roots in Figure 1b and the greenhouse facility for SCN phenotyping in Figure 1c.
Figure 1. Soybean cyst nematode Heterodera glycines Ichinohe (SCN) infection on soybean roots and phenotyping facility. (a) SCN females shown on soybean roots; (b) A comparison between a nodule = n within the soybean root vs. a female nematode = c, highlighted with red arrows; (c) An SCN phenotyping facility at Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu Research and Development Centre.

#### 1.3. Host–Pathogen Interactions

Plants have several lines of defense against pathogens. These include mechanical defences provided by, for example, the cuticle and cell wall. Plants also exhibit a first line of active plant defense by which pathogen-associated molecular patterns (PAMPs) induce pattern triggered immunity (PTI) [13]. Unfortunately, pathogens can supress various PTI components with effector proteins which they deliver into the plant. Plants also have a second actively induced immune system which is stronger, referred to as effector triggered immunity (ETI) [14]. ETI is much more specific, as it involves the recognition of these effectors/avirulence (Avr) genes by specific resistance (R) genes within the plant. It is known that PTI and ETI both take part in the innate immune response in plants, and in recent years growing evidence proposes that intricate interactions occur between pattern-recognition receptors in the PTI pathway and nucleotide-binding domain leucine-rich repeat containing receptors in the ETI pathway along with common signalling components which are shared by both [15]. Further research is required and the components that make them up are still largely unknown [16].
To date, only two soybean loci have been utilized on a large scale for SCN resistance: Rhg1 and Rhg4. Specifically, SCN resistance is conferred by the recessive form of Rhg1, rhg1 and the dominant form of Rhg4. The Rhg1 locus was mapped to chromosome 18, with the rhg1 gene itself displaying incomplete dominance. Rhg4 was mapped to chromosome 8 [17][18][19][20][17,18,19,20]. rhg1 is made up of a 31 kb multi-gene segment coding for three different proteins all involved in resistance [21]. The first is an α-SNAP protein (GmSNAP18), the second is a wound-inducible domain protein (WI12) (GmWI12) and the third is an amino acid transporter (AAT) (GmAAT) [21][22][21,22]. Rhg-1 has two resistant alleles: rhg1-a (Peking-type) resistance with low copy number (3 or less copies of GmSNAP18, GmAAT, and GmWI12 in one genomic segment) and rhg1-b (PI 88788-type) resistance with a high copy number (4 or more copies of GmSNAP18, GmAAT and GmWI12 in one genomic segment) [23]. The rhg1-a allele carries a retrotransposon in the α-SNAP protein, while the α-SNAP protein in rhg1-b does not. This causes the rhg1-a “Peking-type” varieties to require Rhg4 for complete resistance, while rhg1-b “PI 88788-type” varieties do not. Rhg4 codes for a cytosolic serine hydroxymethyltransferase (SHMT) protein, which is responsible for resistance against SCN [20]. Having only two soybean resistance loci (rhg1 and Rhg4) to SCN will not be sustainable for much longer, and resistance breakdown with more aggressive SCN populations are inevitable. Stages of SCN infection in fields are seen in Figure 2a,b, while later stages of infection are shown in Figure 2c.
Figure 2. Soybean field in Agriculture and Agri-Food Canada, Ottawa Research and Development Centre (AAFC-ORDC) post SCN infection at different stages. (a) Soybean during early infection still appear relatively healthy; (b) leaf chlorosis and yellowing then becomes visible; (c) soybean plants become yellow and die during later stages of infection.

## 2. What Is New at the rhg1 and Rhg4 Loci?

The interaction of rhg1 and Rhg4 for resistance against SCN was clearly established in the past [24][26]. Since then, new findings have expanded the researcheours understanding of these systems.

#### 2.1. α- SNAP

As a component of rhg1, the soybean gene GmSNAP18 codes for a soluble N-ethylmelaimide sensitive factor (NSF) attachment protein (α-SNAP) for which multiple haplotypes exist, each conferring a different type of resistance [22]. Using a positional cloning technique, along with region-specific extraction sequencing (RES-Seq) in resistant and susceptible lines, it was shown that Haplotype I (rhg1-a) carried the Peking-type resistance while Haplotype II (rhg1-b) carried PI 88788-type resistance and Haplotype III carried the susceptible version of GmSNAP18 (rhg1-s). The transcript levels of GmSNAP18 were 2.1 times higher in the rhg1-a resistant cultivars than the susceptible rhg1-s cultivars. They were also 8.3 times higher in the rhg1-b resistant cultivars than rhg1-s under uninfected conditions and even higher during infection. The rhg1-a allele also carries a retrotransposon in the α-SNAP protein while the α-SNAP protein in rhg1-b does not [23]. In mammalian genomes, the α-SNAP protein works with NSF which together act to mediate trafficking, disassembling and reusing of other important proteins associated with vesicle docking and fusion. NSF proteins are always encoded because null mutations are lethal in animals. Interestingly, soybean cultivars carrying the SCN-resistant rhg1 haplotypes encode an unusual α-SNAP protein, which does not bind well with NSF, disrupting vesicle trafficking and leading to the death of the cell. However, a gene encoding a novel form of NSF protein, found on chromosome 7, had a unique N-domain that mitigated both toxicity and poor NSF binding of rhg1 α-SNAPs during SCN resistance [25][27]. It was shown that resistant rhg1 soybean contained the unique NSFChr07 (termed NSFRAN07 for “Rhg1-associated NSF on chromosome 07”) while the susceptible ones contained the wild-type NSFChr07.
The molecular mechanism of soybean’s resistance to SCN was further explored by a group that identified two syntaxins of the t-SNARE (SNAP REceptor) family that interact with the α-SNAP protein [26][28]. The authors used yeast-two-hybrid assays in addition to knockout methods to confirm the role of two syntaxin1 genes, Syn12 and Syn16, in SCN resistance [26][28]. The importance of syntaxin and the SNARE regulon was also explored through a homologue of the defense regulon found in Arabidopsis thaliana containing syntaxin PENETRATION1, an ATP-binding cassette and a secreted glucosidase [27][29]. Previous studies showed callose as being present during the defense process in plants against different pathosystems through a process involving vesicle membrane proteins and syntaxins [28][30]. The authors suggested that since myosin and SNARE components function in defense against SCN, then callose synthesis may also play a role. The results of their experiments in both overexpressing and knocking out callose genes confirmed the role of callose in defense. This study allowed an expansion of the already known central defense role and vesicle trafficking, adding callose synthase to factors responsible for defense against SCN. Another group also identified a SNARE protein interacting with α-SNAP (GmSYP31A) [29][31]. Transgenic hairy root soybean plants overexpressing GmSYP31A in susceptible Williams 82 led to increased resistance to SCN while RNAi silencing of GmSYP31A led to SCN susceptibility in resistant lines. Further analysis utilizing green fluorescent protein (GFP) revealed endoplasmic reticulum-Golgi trafficking and exocytosis defects with overexpressed GmSYP31. It was suggested that the interaction of the secretory protein GmSYP31A and the voltage-dependent anion channel played a role in the vesicle trafficking pathway as well as in mitochondrial-mediated cell death, which led to SCN resistance. This area of research was also explored by other researchers who demonstrated that the Conversed Oligomeric Golgi (COG) complex plays a role in retrograde trafficking of many proteins, including syntaxins, which interact with the NSF α-SNAP protein conferring SCN resistance [30][32]. Overexpression of 14 out of the 16 COG genes in susceptible soybean cultivar showed SCN suppression by 50% or more. Additionally, altered expression levels of the COG genes had an impact on transcript abundance of syntaxin 31, and its involvement in SCN resistance [30][32].
Membrane trafficking modifications in resistant reactions caused by α-SNAP-syntaxin and subsequent interactions with the COG complex also influence exocytosis. A paper was published identifying 61 exocyst genes, some of which were differentially expressed in the syncytium during the defense response to SCN [31][33]. The authors then further dove into 9 recently identified MAPK genes involved in resistance and their involvement with exocyst genes [32][34]. This study demonstrated the importance of the tethering stage of vesicle transport and its role in defense against SCN, and also demonstrated that exocyst genes are controlled by MAPK genes. The importance of MAPKs in signal transduction highlighted interactions with several defense genes, including a homolog of a pathogenesis-related 1 gene (PR1-6) that is induced by GmMAPK4-1, which may explain how the MAPK4-1 gene functions in defense [32][34]. RNA-seq analysis of transgenic soybean lines overexpressing the nine MAPKs involved in SCN defense led to the identification of several differentially expressed genes implicated in the resistance reaction [33][35]. From those, 71 were found to have transcripts in SCN syncytia in soybean roots, of which 45 had no expression prior to SCN infection. Eight proteins also had secretion signals, including glycosyl hydrolases, endomembrane protein, galactose mutarotase-like, pathogenesis-related thaumatin, FASCICLIN-like arabinogalactan protein and peroxidase. Functional validations confirmed the roles of some of these genes in defense. Xyloglucan endotransglycosylase/hydrolase (XTH) was also found to be highly expressed in syncytia and reduced infection when artificially overexpressed in a susceptible cultivar [34][36]. The protein functions in cutting and rejoining xyloglucan (XyG) chains to allow cell expansion. Further analysis into the mechanism of how this protein functions in resistance is necessary; however, the authors identified that increasing XTH42 leads to a decrease in XyG chain length, while the decrease in XTH43 transcripts increase XyG chain length, which is believed to have a role on the cell wall and its ability to expand and form a syncytium.

#### 2.2. WI12

Recent findings within another component of rhg1, the wound-inducible domain protein (WI12) (GmWI12), suggested that the WI12Rhg1 protein interacts with DELLA proteins [35][37]. DELLA proteins are negative regulators of the gibberellic acid (GA) signalling pathway associated with the plant’s immune response and its survival [36][37][38,39]. The authors found that WI12 knockout roots reduced DELLA18 expression levels and that the two proteins directly interact based on yeast and plant experiments. A double knockout of DELLA18 and its homolog DELLA11 significantly increased the number of female nematodes on Peking roots. Finally, the authors also highlight the involvement of plant hormones GA, Jasmonic Acid (JA) and Salicylic Acid (SA), controlled by DELLA, in SCN resistance.

#### 2.3. SHMT

The GmSHMT08 gene at Rhg4 encodes a serine hydroxymethyltransferase (SHMT). The Peking-type lines (rhg1-a) are fully dependent on a specific allele of SHMT at Rhg4 for SCN resistance [38][40]. A series of GmSHMT08 mutants obtained by forward genetics screening confirmed that Peking-type is mechanistically different from PI 88788-type resistance [38][40]. It is now established that the Peking-type, rhg1-a allele (low copy number) has higher resistance due to its interaction with Rhg4 while the PI 88788-type rhg1-b allele (high copy number) does not benefit from the presence of Rhg4 [39][41]. This was confirmed by deep re-sequencing of 106 soybean accessions which were also challenged with five different SCN HG types [40][42]. At least 5.6 rhg1 copies were required for PI88788-type resistance which was independent of the Rhg4 haplotype. However, due to the presence of a retrotransposon within the α-SNAP protein and copy number dropping below 5.6 (1.9–3.5), the Rhg4 haplotype was necessary for resistance in Peking cultivars. SHMT catalyzes the conversion of L-serine to glycine and tetrahydrofolate to 5,10-methylenetetrahydrofolate. The resistant Rhg4 allele differs from the susceptible by two polymorphisms [20]. The SHMT structure was compared by applying homology modelling between susceptible and resistant lines without observing major structural changes, although a slight rotation in the small domain of the susceptible enzyme was noted [41][43]. Near the entrance of the THF-binding site, this structure includes a loop which in the resistant line looks disordered. The effects of this disordered loop were tested and appeared to severely impair binding affinity for folate. This step is important because folate is essential for SCN’s development since the nematode is unable to synthesize it. These results indicated that SCN resistance in relation to Rhg4 might be related to impairment in folate binding. However, the direct interaction of SHMT and α-SNAP was also proposed [42][44]. The products of both GmSHMT08 and GmSNAP18 were localized in the cytosol, supporting the hypothesis that the proteins could physically interact [42][43][44,45]. This interaction is thought to be facilitated by the presence of GmPR08-Bet VI, a pathogenesis-related protein (PR-10) which is known to bind to hormones, lipids and antibiotics which are bulky hydrophobic compounds [44][46]. Overexpression of this protein resulted in a 65% reduction in the number of cysts compared to control treatments. Bimolecular fluorescence complementation assays confirmed physical interactions between GmSHMT08 and GmPr08-Bet VI, which were enhanced when GmSNAP18 was also present. Interaction between rhg1 and Rhg4 in SCN resistance could therefore be the result of a multiprotein complex composed of GmSHMT08/GmSNAP18/GmPR08-Bet VI [45][47].

## 3. Wild Soybean as a Resistance Reservoir

Soybean’s wild relative, G. soja, has been studied mainly to understand soybean domestication, but its high genetic diversity is known to contain desirable traits for crop improvement, including SCN resistance [46][70]. GWAS was conducted on 1032 Glycine soja accessions in order to have a better understanding of wild soybean resistance against SCN [47][71]. Ten SNPs significantly associated with resistance to SCN were found on chromosomes 2, 4, 9, 16 and 18, three of which were previously identified, but none of which were among the rhg1 or Rhg4 QTLs. These regions contained 83 gene models, and some were compatible with plant resistance against disease including: calcium-dependent phospholipid-binding protein, NB-ARC domains containing protein, LRR protein, cytochrome P450, and ethylene-responsive element binding factor. One specific gene, Glyma.18G102600, an NB-ARC domain containing protein, located in a strong linkage disequilibrium block on Chr. 18 seemed highly promising. A transcriptomics database of the response of resistant and susceptible G. soja accessions to SCN was also created [48][72]. Another GWAS on G. soja lines identified SNPs on chromosomes 18 and 19 as being significantly associated with resistance to SCN (HG 2.5.7), as well as identified 58 gene candidates [49][73]. From these, 16 were related to disease resistance, encoding LRR proteins, ring/U-box, receptor-like protein, and MYB transcription factor. Other authors compared transcript expression of the resistant G. soja line NRS100 to the well-known G. max Williams 82 (susceptible) and Peking (resistant). The resistant G. soja (NRS100) did not show any significant differential expression at SHMT, SNAP paralog or SNAP18 which are found in rhg1 and Rhg4. The proposed defense mechanism in NRS100 included reduced JA signalling which allowed SA signals to induce a defense response, along with increased polyamine metabolism triggering H2O2 regulation and induction of PR proteins which defend the integrity of the cell walls and hinder pathogen invasion [50][74]. Finally, a cross between G. max and G. soja along with chromosome segment substitution lines (CSSLs) for QTL mapping of SCN resistance was performed [51][75]. Thirty-three QTLs were detected on 18 different chromosomes with high significance in relation to SCN resistance. The CSSLs combining positive alleles were highly resistant to SCN in absence of rhg1 and Rhg4. These studies shed light on the importance of G. soja germplasm and new strategies for resistance breeding.
Feedback