Genomics is the cornerstone of breeding, and studies based on whole-genome sequencing and genome-wide association study have greatly driven forward genomics-assisted breeding in many crops. Lotus (Nelumbo nucifera), under the Nelumbonaceae family, is one of the relict plants possessing important scientific research and economic values. As a basal eudicot species, lotus plays an essential role in studying plant evolution and phylogeny. It is adapted to the aquatic environment, while its relatives are shrubs or trees living on land. Water lily lies at the phylogenetic position of the base angiosperm and has similar living conditions and flowers. However, its genomes are vastly different. Lotus has unique features such as water-repellent self-cleaning function, multi-seed production, and flower thermogenesis, which may relate to flower protogyny or provide a warm environment for pollination.
Lotus is one of the relict plants retaining the original morphology of its ancestors, as well as Ginkgo biloba, Liriodendron, and Metasequoia glyptostroboides. It belongs to the Nelumbo genus of the Nelumbonaceae family, which includes two species, namely Asian lotus (Nelumbo nucifera Gaertn.) and American lotus (Nelumbo lutea Pear.). The two species are named for their different geographical distributions. Asian lotus is mainly distributed in Asia and the north of Oceania, while American lotus is distributed in North America and South America. The plant morphology differs between them. Asian lotus is a tall plant, with oval leaves and seeds, and red or white flower colors, whereas American lotus is a short plant, nearly round and with dark green leaves, spherical seeds, and yellow flowers [16][1]. There is no strict reproductive isolation between them, and the life cycles are similar at about five months. Asian lotus is commonly called lotus and has more than 3000 years of cultivation history as a horticultural crop [17][2]. Lotus seeds and rhizomes have rich nutritional value and unique health-care function. Lotus seeds contain starch, proteins, amino acids, polysaccharides, polyphenols, alkaloids, and mineral elements. Lotus rhizome has a high vitamin C content. During the long period of domestication and artificial selection, about 4500 lotus cultivars have been obtained up till the present [18][3]. These cultivars have been planted to produce edible vegetables, snacks, beverages, restorative materials, and ornamental flowers, which impact human life and economic development. The lotus industry is also important for rural revitalization in the Yangtze River, Pearl River, and Huang Huai river basins. The cultivated lotus is generally divided into rhizome lotus, seed lotus, and flower lotus based on their different usage. The notable feature of the rhizome lotus is the enlarged rhizome but with few flowers. It can be divided into power and crisp type according to the taste of the rhizome. Different varieties were bred to meet the taste of the different regions of people or for further usage. The main breeding goal of rhizome lotus is to improve the yield and quality of the rhizome. Seed lotus is mainly for lotus seed production, with high yield, good quality, and disease resistance being the breeding goals. Flower lotus is preferred for ornamental use, and it has distinct flower colors and shapes. During long cultivation, ornamental lotus with different flower morphologies were obtained, including few-petaled, double-petaled, petaloidy, and thousand-petalled flowers. Red, pink, yellow, and white are the main flower colors. Currently, the breeding objective is mainly aimed at flower shape and color, yield or quality of lotus seed and rhizome, and wide adaptability.
Items | Year | Sequencing Technology | Final Assembly (Mb) | Contig N50 | Number of Genes | Repeat Sequences | Ref. | |
---|---|---|---|---|---|---|---|---|
China Antique v1.0 | 2013 | Illumina, 454 | 804 | 38.8 Kb | 26685 | 57% | [20] | [4] |
Taizi | 2013 | Illumina Hiseq2000 | 792 | 39.3 Kb | 40348 | 49.48% (TEs) | [21] | [5] |
China Antique v2.0 | 2020 | Pacbio Sequel, Illumina | 821.2 | 484.3 Kb | 32124 | 58.50% | [23] | [7] |
Taikonglian NO.3 | 2022 | Nanopore | 807 | 5.1 Mb | 28274 | 63.11% | [24] | [8] |
American lotus | 2022 | Pacbio RSII, Hi-C | 843 | 1.34 Mb | 31382 | 81.00% | [25] | [9] |
With the advent of a new sequencing platform, the genome of “CA” was re-assembled using 11.9 Gb long-read data from PacBio Sequel, and 94.2 Gb previously sequenced short-read data [23][7]. The new assembly of “CA” is 807.6 Mb with the contig N50 being 484.3 Kb, which has significantly increased the quality of the genome. The ratio of repetitive sequence (58.5%) was similar to the first version. Moreover, a cultivated lotus, “Taikonglian NO. 3”, was also assembled using the Oxford Nanopore sequencing platform (57.9 Gb raw data) with the contig N50 being 5.1 Mb, and eight chromosomes were anchored based on high-throughput chromatin conformation capture (Hi-C) data [24][8]. Another lotus species, American lotus, was recently assembled using PacBio RSII (74.6 Gb raw data) and Hi-C (50.32 Gb raw data), and the total length is 843 Mb while contig N50 is 1.34 Mb [25][9].
Flower color, shape, and flowering time are important traits that determine the ornamental value of lotus. There are three different colors in lotus, red and white in Asian lotus and yellow in American lotus. The red color in Asian lotus is determined by the contents of anthocyanin [40[13][14],41], which is controlled by key enzyme encoding genes, and their regulating transcription factors (TFs) such as MYB, basic-Helix-Loop-Helix (bHLH), WD40 in its biosynthetic pathway. Among all the enzyme encoding genes in this pathway, NnANS and NnUFGT seem to be the decisive two genes [42,43][15][16]. Several TFs including 5 MYB, 2 bHLH, and one WD-repeat genes, may be involved in the regulation of anthocyanin biosynthesis in lotus based on a transcriptome analysis [43][16]. Among them, a bHLH gene NnTT8 was verified to regulate anthocyanin biosynthesis [44][17], whereas the yellow color of American lotus is determined by carotenoid, and no anthocyanin was detected [25,45][9][18].
The rhizome is the main edible part of lotus. It is important to explore the mechanisms underlying rhizome formation and expansion in rhizome lotus breeding. Comparative transcriptomic and proteomic analyses focusing on rhizome development have been conducted to dig out the key genes and pathways critical for the crucial physiological process [52,53,54][19][20][21]. Furthermore, re-sequencing of the natural and genetic F2 populations has also identified several genetic regions and candidate genes that might be involved in lotus rhizome enlargement [55][22]. A systematic analysis was conducted on one candidate gene CONSTANS-LIKE 5 (COL5). Functional analysis in the potato system indicated that NnCOL5 might be positively associated with rhizome enlargement by regulating the expression of CO-FT genes and the GA signaling pathway [56][23]. In addition, one SNP was identified in another candidate gene NnADAP of AP2 subfamily, which is closely associated with rhizome enlargement phenotype and the soluble sugar content [57][24]. There is a big difference between temperate and tropical lotus, especially the rhizome’s morphology.
Lotus seeds are rich in nutrients and functional compounds such as alkaloids, flavonoids, and polyphenols [58,59][25][26]. They are consumed “as both food and medicine” [60][27]. It is essential to increase the yield and nutrition of lotus seed. The main factors determining lotus seed yield are the seed size and the number of lotus seeds per seedpod. Transcriptome analysis on the cotyledon of “CA” and “Jianxuan-17 (JX-17)” seeds at different developmental stages identified 8437 differentially expressed genes (DEGs). Many DEGs are involved in the brassinosteroid biosynthesis pathway, and further analysis predicted two AGPase genes as candidate genes affecting lotus seed yield [61][28]. It seems that phytohormones are involved in lotus seed development. A combination of metabolomic and proteomic methods revealed that 15 DAP (Day After Pollination) was a switch time point from the physiological active to the nutrition accumulation stage [62][29]. Starch is the primary nutritional component in mature lotus seed [63][30]. Its contents and the proportion of amylose and amylopectin could largely determine the nutritional value and taste of lotus cotyledon, respectively. ADP-glucose pyrophosphorylase (AGPase) plays an important role in regulating starch biosynthesis.
The new varieties of lotus with high yield, wide adaptability, and stress resistance play a vital role in improving the economic value of this important horticulture crop. The variations identification, functional gene cloning, and metabolites alterations among diverse germplasm resources were investigated in the past decades, driven by the progressively improved genome information which could facilitate breeding practices in lotus (Figure 1). However, a high-quality reference genome is the limiting factor that will affect the molecular breeding process. Improvement of the lotus reference genome will be a requisite in the future, directly affecting the accuracy of molecular markers and the efficiency of cloning functional genes. Gapless reference genomes and pan-genomes have become the new reference, based on which plentiful information of genomes such as open chromatin and more variant information can be explored. With the explosive growth of large-omics data, deep learning can be used to mine biological information and decipher gene regulation networks.