Applications of Artificial Intelligence in Smart-Crop Breeding: Comparison
Please note this is a comparison between Version 2 by Jessie Wu and Version 1 by Hafeez Khan.

Artificial intelligence (AI) has emerged as a revolutionary field, providing a great opportunity in shaping modern crop breeding, and is extensively used indoors for plant science. Advances in crop phenomics, enviromics, together with the other “omics” approaches are paving ways for elucidating the detailed complex biological mechanisms that motivate crop functions in response to environmental trepidations. These “omics” approaches have provided plant researchers with precise tools to evaluate the important agronomic traits for larger-sized germplasm at a reduced time interval in the early growth stages. The big data and the complex relationships within impede the understanding of the complex mechanisms behind genes driving the agronomic-trait formations. AI brings huge computational power and many new tools and strategies for future breeding. 

  • artificial intelligence (AI)
  • crop breeding
  • genomics

1. Artificial Intelligence Technologies Benefiting Crop Breeding

Artificial intelligence uses computers and technology to simulate the human mind’s problem-solving and decision-making skills [11][1]. Artificial intelligence (AI), often known as machine intelligence, is an area of computer science that focuses on developing and managing technology, which can learn to make decisions and carry out activities independently without the need for human effort [12][2]. AI is a broad term that encompasses a wide range of technologies; it is a catch-all word for any software or hardware component that helps with machine learning, computer vision, natural language comprehension, and natural language processing (NLP) [13][3]. Traditional complementary metal-oxide-semiconductor (CMOS) hardware and the same fundamental computational processes that drive traditional software are used in today’s AI [14][4]. AI is the most rapidly emerging technology in computer science in today’s digital world, and it creates intelligent computers that replicate the intellect of the human mind [15][5]. For instance, the deep neural network (DNN), artificial neural network (ANN), random forest (RF), and support vector machine (SVM) are a few examples of machine-learning algorithms, as well as advanced hi-tech equipment such as the internet of things (IoT) [16][6]. AI is a fascinating hi-tech system that provides an endless opportunity as far as its agricultural applications are considered; hence, this opens up new frontiers for digital breeding [17][7]. Future AI generations are projected to inspire new sorts of brain-inspired circuits and architectures capable of making data-driven judgments faster and more precisely than humans can [18][8]. Furthermore, artificial intelligence, big data, machine learning, and data analytics are all terms that appear often in current academic and corporate writings that deal with data [19][9].
Big data, machine learning, and AI are some of the terms used to characterize modern computer processes [20][10]. Big data is concerned with the use of huge data of diverse types and complex structures that cannot be handled well when analyzed through classical approaches [21][11]. In this context, the AI trains a computer to perform jobs that are beyond human efforts, especially by considering the time and labor involved, and which are typically involved in decision-making in a variety of situations [22][12]. Machine learning (ML) is a branch of AI in which computers discover relationships from massive training datasets. For environment and weather applications, a simple definition is: firstly, big data involves the collection of meteorological or Earth System-related measurements, as well as high spatial and temporal resolution Earth System model (ESM) outputs for analysis; secondly, ML is the refining or discovery of new linkages between locations, times, and quantities in the datasets (e.g., where sea surface temperature features aid the weather prediction for months over land regions); thirdly, AI is a means of providing automatic warnings and guidance to society in the event of oncoming weather extremes, based on the links discovered by machine learning [23][13]. The current ease for application of ML methods due to improved computing capabilities is aided in part by the unique usage of computer graphics processing units (GPUs), with GPU speed improving at a quicker rate than ordinary central processing units [24][14]. This is an innovative use of computer memory to make calculations both more efficient and considerably closer to the data storage location [23][13]. The main emphasis of employing AI in breeding is that it complements the work of the breeder by guaranteeing continuous farm monitoring. Indeed, with the automation of farms and the generalization of data, breeders may dedicate more time to higher-value jobs by spending less time in their buildings. AI saves time in data identification and processing, which is of considerable benefit. Breeders and technical advisors acquire confidence and reactivity, allowing them to act when it is most appropriate [25][15].
AI technology has been used to accelerate the process of breeding new plant varieties, such as high-throughput genomics and phenomics to advanced breeding [15,26,27,28][5][16][17][18]. Increasingly, ML methodology has been used in genomic prediction, genomic selection, and marker-assisted selection [27,28][17][18]. Many agricultural companies such as Monsanto and John Deere have already invested hundreds of millions of dollars to develop such technologies that can utilize extensive data on soil type, seed variety, and weather to help farmers reduce costs and enhance yields [29][19]. Many of the same data sources, such as weather forecasts and Google Maps, are used to fuel both of their businesses. In addition, they may access farm equipment data that are wirelessly sent to the cloud [30][20]. As part of a precision-farming experiment in Romania, companies like Nippon Electric Company, Limited (NEC; headquartered in Minato, Tokyo, Japan) and Dacom (headquarter in Santa Clara, USA) employed environmental sensors and huge data analytics tools to increase yields. The use of current technologies and information systems enhances the overall productivity of agriculture [31][21]. Due to the agricultural data sets’ complexity, novel architecture and frameworks, algorithms, as well as the analytics face several obstacles in extracting the value and hidden information from this data [32][22]. The recent research on AI tools, including ML, deep learning, and predictive analysis intended toward increasing the planning, learning, reasoning, thinking, and action-taking abilities [33][23]. Plant Breeders are developing systems to aid in a better understanding of plant behavior under a variety of climatic situations [34][24]. Summit, the world’s most powerful supercomputer, was recently unveiled with the potential to hold 27,000 GPUs, paving the way for a bright future. AI has the potential to be a game-changer in the near future for bringing an agricultural revolution and global food security [35][25].

2. Exploring the Potential of Artificial AIIntelligence in Gene Function Analysis

The rapid development of high-throughput technologies in biological sciences has resulted in the generation of massive data in recent decades. Disciplines that attempt to collect and analyze enormous volumes of biological data are often referred to as “omics”, which is used to indicate the total quantity of DNA contained in each cell of an organism, with an additional flavour of openness to big challenges [47][26]. “Omics” data has become too large and complicated to be analyzed visually or by using statistical correlations. This has incited the use of so-called Machine Intelligence or AI which manages large amounts of data that are insurmountable for human minds, while extracting information that goes beyond our current understanding of the system under investigation and, most importantly, improving automatically based on the training data [48][27]. AI is already being used extensively in plant genomics and also possesses more future applications for in-depth genome exploration. A number of ML tools and algorithms are available for different kinds of bioinformatics analysis, such as protein-coding gene identification, cis-regulatory element identification, gene expression, subcellular location, protein-protein interaction, gene ontology, metabolic pathways, phenotypes, and genomic prediction (as reviewed by Mahood et al. (2020) [49][28]). In the not-too-distant future, AI is likely to be used to address a variety of plant science genomics concerns.AI algorithms might potentially be used to address comparative genomic investigations or information transfer from a model plant to a crop of interest [50][29]. DeepBind [51][30] and DeepSEA [52][31] are two models that have been created in recent years to predict and analyze genetic features [26][16]. Various sorts of expressions or sequencing data analysis can be thought of, with the goal of predicting gene functions or the differential effects of gene expression on a trait [53][32]. Although a significant amount of genomic data was produced as a result of the fruitful breakthroughs of high-throughput sequencing technology, the enormous amount of data generated creates a huge problem for storage and examination of the data [26][16]. The AI technology of bioinformatics enables the measurement of simultaneous expressions of a large number of genes, or even each and every gene that is included in the genome under a wide range of situations [54,55][33][34]. All of this combines to give biologists a more “relevant” representation of their data and the ability to integrate it, which enables them to examine their genomic data, test and confirm their assumptions throughout the experimental cycle, and ultimately improve their research [56,57][35][36].

3. Linking of Crop Genome to Phenome with Artificial AIntelligence

Currently, modern breeding approaches are focused on linking the genotype with the crop phenotype accurately and precisely. In advanced breeding, linking the whole of the genome information to high-throughput phenotypes remains a massive challenge, and is impeding the optimal application of field phenotyping and omics [15][5]. Germplasm collection and mapping populations can efficiently differentiate the phenomics and genomics data through AI. Crop diversity, single nucleotide polymorphisms (SNPs) detection and selection, quantitative trait loci (QTL) analysis, genome-wide association study (GWAS) analysis, and genomic selection and sequences generate a large amount of data; AI can evaluate and link the phenomics and genomics data from these big data to improve the breeding approaches. AI related to a computation and training model can predict the gene functional analysis and high-throughput crop phenotyping and also predict the performance of yield and traits of the crop [46,47,50,58,59][26][29][37][38][39]. Therefore, the integration of AI with phenomics and genomics tools can allow for rapid gene identification associated with the crop phenotypes that eventually accelerate crop improvement programs. In Figure 1, wresearchers summarize how to apply AI technology to link high-throughput genomics and phenomics, which can result in the production of better breeding strategies.
Figure 1. Artificial Intelligence used as a powerful tool for the prediction of high-throughput crop phenotyping and gene functional analysis in modern crop breeding. The high-throughput phenotypic and genotypic data were collected from large crop germplasm and breeding populations. The massive comprehensive database could integrate various resources with AI technology, such as phenotypic diversity of crops, SNPs polymorphisms, QTL analysis, GWAS analysis, genomics selection, and genome sequence. AI technologies are applied to predict the crop phenotype with whole genome prediction, the novel breeding strategies are produced through AI related to computation and training models.
Research on crop genomics is not only understanding the molecular mechanisms of phenotypes but also using technical data and bioinformatics techniques to analyse and understand the molecular mechanisms behind phenotypes [60][40]. To date, AI is a fascinating approach to bringing out these tasks inevitably [61][41]. AI approaches provide the platform to analyze huge, various, and useless datasets such as the generation of genome sequencing/photo imaging over conventional analytical strategies [15,62][5][42]. Recently, the AI approach has been explicitly employed in varied research fields of phenomics and genomics, such as: analysing genome assembly and genome-specific algorithms [26][16]; broad-range data analysis to mitigate multiplex biological complications in metabolomics, proteomics, genomics, transcriptomics, as well as systematic biology [62,63][42][43]; interpretation of gene expression cascades [64,65][44][45]; identification of significant SNPs in polyploid plants [66][46]; high-throughput crop stress phenotyping [41,67][47][48]. Scientists have employed AI and its developed models to modulate the flow of information from generic DNA to genetic-based phenotypes, to investigate the potential variants in natural populations [49][28]. More specifically, for breeders, AI will assist the further investigation of genetic loci to facilitate the agricultural output by triggering the genome algorithms and allowing high-throughput crop phenotyping in quantitative traits for open-field and controlled environments [49,68][28][49]. Additionally, AI can be cohesively combined with bioinformatics and genome sequencing analysis to interpret various molecular repertories such as transcription factor binding sites [69][50], long non-coding RNAs (lncRNAs) [70][51], microRNA (miRNAs), epistatic modifications, coding genes, targeted polyadenylation sites [71][52], as well as cis-regulatory elements (CREs) [49,72][28][53]. Various crop databases insert a huge amount of heterogeneous-related phenotypic and genotypic data (big data) recently providing insight into potential resources for breeders to untangle novel trait-identified candidate genes [73][54]. Luckily, AI provides a novel benchmark summary for analytical and computational methods for the integrated analysis of such enormous datasets based on the big-data spectrum [49,73][28][54]. In addition, employing AI to conclude the interrelations between candidate genes and CREs is a novel approach for categorizing and identifying previously unknown genes for significant crop improvements [74][55]. Furthermore, AI strategies have more potential for interpretation of the crop yield, variation in climatic assessment, high-throughput crop stress phenotyping, climate temperature, ultraviolent (UV) radiation, wind, and hail [26,73,75][16][54][56]. The role of AI is becoming more and more important in obtaining, analyzing, integrating, and managing genomic and phenomic data to increase agricultural climate resilience [76,77][57][58]. Next generation sequencing (NGS)-based genotyping methods have helped to improve gene-mapping resolution and gene identification and NGS-based genotyping for GWAS analysis has been used in crop improvement [68][49]. For example, in soybean, these kinds of studies have been widely used to identify genetic loci and candidate genes for seed weight [78][59], seed protein and oil contents [79][60], pod dehiscence [80][61], nitrogen fixation [81][62], soybean plant height and primary branches [81][62], agronomic traits [82][63], disease resistance [83][64], and tocopherol concentration [84][65]. Bulk segregant analysis (BSA) and its modified methodologies are currently used in many crops [85,86,87,88,89,90][66][67][68][69][70][71]. The NGS-based BSA is becoming a popular approach to identifying candidate genes for various traits, such as the soybean mosaic virus [91][72], charcoal rot resistance [92][73], flowering time [93][74], phytophthora resistance [94][75], and powdery-mildew resistance [95][76]. Recently, the deep-learning algorithm for BSA (DeepBSA) has been developed for QTL mapping and functional gene cloning in maize [96][77].

References

  1. Jarrahi, M.H. Artificial intelligence and the future of work: Human-AI symbiosis in organizational decision making. Bus. Horiz. 2018, 61, 577–586.
  2. Dwivedi, Y.K.; Hughes, L.; Ismagilova, E.; Aarts, G.; Coombs, C.; Crick, T.; Duan, Y.; Dwivedi, R.; Edwards, J.; Eirug, A. Artificial Intelligence (AI): Multidisciplinary perspectives on emerging challenges, opportunities, and agenda for research, practice and policy. Int. J. Inf. Manag. 2021, 57, 101994.
  3. Li, Y.; Zhou, X.; Li, G. Bridging natural language and graphical user Interfaces. In Artificial Intelligence for Human Computer Interaction: A Modern Approach; Springer: Berlin/Heidelberg, Germany, 2021; pp. 463–493.
  4. Kok, J.N.; Boers, E.J.; Kosters, W.A.; Van der Putten, P.; Poel, M. Artificial intelligence: Definition, trends, techniques, and cases. Artif. Intell. 2009, 1, 270–299.
  5. Harfouche, A.L.; Jacobson, D.A.; Kainer, D.; Romero, J.C.; Harfouche, A.H.; Mugnozza, G.S.; Moshelion, M.; Tuskan, G.A.; Keurentjes, J.J.B.; Altman, A. Accelerating climate resilient plant breeding by applying next-generation artificial intelligence. Trends Biotechnol. 2019, 37, 1217–1235.
  6. Baduge, S.K.; Thilakarathna, S.; Perera, J.S.; Arashpour, M.; Sharafi, P.; Teodosio, B.; Shringi, A.; Mendis, P.J.A.i.C. Artificial intelligence and smart vision for building and construction 4.0: Machine and deep learning methods and applications. Autom. Constr. 2022, 141, 104440.
  7. Montesinos-López, O.A.; Montesinos-López, A.; Crossa, J.; Gianola, D.; Hernández-Suárez, C.M.; Martín-Vallejo, J. Multi-trait, multi-environment deep learning modeling for genomic-enabled prediction of plant traits. G3 Genes Genomes Genet. 2018, 8, 3829–3840.
  8. Xu, Y.; Liu, X.; Cao, X.; Huang, C.; Liu, E.; Qian, S.; Liu, X.; Wu, Y.; Dong, F.; Qiu, C.-W. Artificial intelligence: A powerful paradigm for scientific research. Innovation 2021, 2, 100179.
  9. Duan, Y.; Edwards, J.S.; Dwivedi, Y.K. Artificial intelligence for decision making in the era of Big Data–evolution, challenges and research agenda. Int. J. Inf. Manag. 2019, 48, 63–71.
  10. Cravero, A.; Sepúlveda, S. Use and adaptations of machine learning in big data—Applications in real cases in agriculture. Electronics 2021, 10, 552.
  11. Supriya, M.; Deepa, A. Machine learning approach on healthcare big data: A review. Big Data Inf. Anal. 2020, 5, 58–75.
  12. Berente, N.; Gu, B.; Recker, J.; Santhanam, R. Managing artificial intelligence. MIS Q. 2021, 45, 1433–1450.
  13. Huntingford, C.; Jeffers, E.S.; Bonsall, M.B.; Christensen, H.M.; Lees, T.; Yang, H. Machine learning and artificial intelligence to aid climate change research and preparedness. Environ. Res. Lett. 2019, 14, 124007.
  14. Raschka, S.; Patterson, J.; Nolet, C. Machine learning in python: Main developments and technology trends in data science, machine learning, and artificial intelligence. Information 2020, 11, 193.
  15. Talaviya, T.; Shah, D.; Patel, N.; Yagnik, H.; Shah, M. Implementation of artificial intelligence in agriculture for optimisation of irrigation and application of pesticides and herbicides. Artif. Intell. Agric. 2020, 4, 58–73.
  16. Esposito, S.; Carputo, D.; Cardi, T.; Tripodi, P. Applications and trends of machine learning in genomics and phenomics for next-generation breeding. Plants 2020, 9, 34.
  17. Reinoso-Peláez, E.L.; Gianola, D.; González-Recio, O. Genome-enabled prediction methods based on machine learning. In Genomic Prediction of Complex Traits; Methods in Molecular Biology; Ahmadi, N., Bartholomé, J., Eds.; Humana: New York, NY, USA, 2022; Volume 2467.
  18. Crossa, J.; Pérez-Rodríguez, P.; Cuevas, J.; Montesinos-López, O.; Jarquín, D.; De Los Campos, G.; Burgueño, J.; González-Camacho, J.M.; Pérez-Elizalde, S.; Beyene, Y. Genomic selection in plant breeding: Methods, models, and perspectives. Trends Plant Sci. 2017, 22, 961–975.
  19. Faulkner, A.; Cebul, K.; McHenry, G. Agriculture Gets Smart: The Rise of Data and Robotics; Cleantech Agriculture Report; Cleantech Group: San Francisco, CA, USA, 2014.
  20. Stergiou, C.; Psannis, K.E. Recent advances delivered by Mobile Cloud computing and internet of things for big data applications: A survey. Int. J. Netw. Manag. 2017, 27, e1930.
  21. El Bilali, H.; Allahyari, M.S. Transition towards sustainability in agriculture and food systems: Role of information and communication technologies. Inf. Process. Agric. 2018, 5, 456–464.
  22. Priya, R.; Ramesh, D.J.S.C.I. ML based sustainable precision agriculture: A future generation perspective. Sustain. Comput. Inform. 2020, 28, 100439.
  23. Shaw, J.; Rudzicz, F.; Jamieson, T.; Goldfarb, A. Artificial intelligence and the implementation challenge. J. Med. Internet Res. 2019, 21, e13659.
  24. Jeong, J.H.; Resop, J.P.; Mueller, N.D.; Fleisher, D.H.; Yun, K.; Butler, E.E.; Timlin, D.J.; Shim, K.M.; Gerber, J.S.; Reddy, V.R.; et al. Random Forests for Global and Regional Crop Yield Predictions. PLoS ONE 2016, 11, e0156571.
  25. Streich, J.; Romero, J.; Gazolla, J.G.F.M.; Kainer, D.; Cliff, A.; Prates, E.T.; Brown, J.B.; Khoury, S.; Tuskan, G.A.; Garvin, M. Can exascale computing and explainable artificial intelligence applied to plant biology deliver on the United Nations sustainable development goals? Curr. Opin. Biotechnol. 2020, 61, 217–225.
  26. Jung, J.; Maeda, M.; Chang, A.; Bhandari, M.; Ashapure, A.; Landivar-Bowles, J. The potential of remote sensing and artificial intelligence as tools to improve the resilience of agriculture production systems. Curr. Opin. Biotechnol. 2021, 70, 15–22.
  27. Grapov, D.; Fahrmann, J.; Wanichthanarak, K.; Khoomrung, S. Rise of deep learning for genomic, proteomic, and metabolomic data integration in precision medicine. OMICS 2018, 22, 630–636.
  28. Mahood, E.H.; Kruse, L.H.; Moghe, G.D. Machine learning: A powerful tool for gene function prediction in plants. Appl. Plant Sci. 2020, 8, e11376.
  29. Caudai, C.; Galizia, A.; Geraci, F.; Le Pera, L.; Morea, V.; Salerno, E.; Via, A.; Colombo, T. AI applications in functional genomics. Comput. Struct. Biotechnol. J. 2021, 19, 5762–5790.
  30. Alipanahi, B.; Delong, A.; Weirauch, M.T.; Frey, B.J. Predicting the sequence specificities of DNA-and RNA-binding proteins by deep learning. Nat. Biotechnol. 2015, 33, 831–838.
  31. Zhou, J.; Troyanskaya, O.G. Predicting effects of noncoding variants with deep learning–based sequence model. Nat. Methods 2015, 12, 931–934.
  32. Dasari, C.M.; Amilpur, S.; Bhukya, R. Exploring variable-length features (motifs) for predicting binding sites through interpretable deep neural networks. Eng. Appl. Artif. Intell. 2021, 106, 104485.
  33. Lemmon, E.M.; Lemmon, A.R. High-throughput genomic data in systematics and phylogenetics. Annu. Rev. Ecol. Evol. Syst. 2013, 44, 99–121.
  34. Huang, D.W.; Sherman, B.T.; Lempicki, R.A. Bioinformatics enrichment tools: Paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009, 37, 1–13.
  35. Xia, E.H.; Li, F.D.; Tong, W.; Li, P.H.; Wu, Q.; Zhao, H.J.; Ge, R.H.; Li, R.P.; Li, Y.Y.; Zhang, Z.Z. Tea Plant Information Archive: A comprehensive genomics and bioinformatics platform for tea plant. Plant Biotechnol. J. 2019, 17, 1938–1953.
  36. Hu, H.; Scheben, A.; Edwards, D. Advances in integrating genomics and bioinformatics in the plant breeding pipeline. Agriculture 2018, 8, 75.
  37. Mochida, K.; Koda, S.; Inoue, K.; Hirayama, T.; Tanaka, S.; Nishii, R.; Melgani, F. Computer vision-based phenotyping for improvement of plant productivity: A machine learning perspective. GigaScience 2019, 8, giy153.
  38. Batley, J.; Edwards, D. The application of genomics and bioinformatics to accelerate crop improvement in a changing climate. Curr. Opin. Plant Biol. 2016, 30, 78–81.
  39. Wang, H.; Cimen, E.; Singh, N.; Buckler, E. Deep learning for plant genomics and crop improvement. Curr. Opin. Plant Biol. 2020, 54, 34–41.
  40. Altman, A.; Fan, L.; Foyer, C.; Cowling, W.; Mittler, R.; Qaim, M.; Weber, A.P.; Reynolds, M.; Varshney, R.K.; Fernie, A. Past and future milestones of plant breeding. Trends Plant Sci. 2021, 26, 530–538.
  41. Williamson, H.F.; Brettschneider, J.; Caccamo, M.; Davey, R.P.; Goble, C.; Kersey, P.J.; May, S.; Morris, R.J.; Ostler, R.; Pridmore, T. Data management challenges for artificial intelligence in plant and agricultural research. F1000Research 2021, 10, 324.
  42. Libbrecht, M.W.; Noble, W.S. Machine learning applications in genetics and genomics. Nat. Rev. Genet. 2015, 16, 321–332.
  43. Xu, C.; Jackson, S.A. Machine Learning and Complex Biological Data; Springer: Berlin/Heidelberg, Germany, 2019.
  44. Kwon, M.S.; Lee, B.T.; Lee, S.Y.; Kim, H.U. Modeling regulatory networks using machine learning for systems metabolic engineering. Curr. Opin. Biotechnol. 2020, 65, 163–170.
  45. Ni, Y.; Aghamirzaie, D.; Elmarakeby, H.; Collakova, E.; Li, S.; Grene, R.; Heath, L.S. A machine learning approach to predict gene regulatory networks in seed development in Arabidopsis. Front. Plant Sci. 2016, 7, 1936.
  46. Korani, W.; Clevenger, J.P.; Chu, Y.; Ozias-Akins, P. Machine learning as an effective method for identifying true single nucleotide polymorphisms in polyploid plants. Plant Genome 2019, 12, 180023.
  47. Selvaraj, M.G.; Valderrama, M.; Guzman, D.; Valencia, M.; Ruiz, H.; Acharjee, A. Machine learning for high-throughput field phenotyping and image processing provides insight into the association of above and below-ground traits in cassava (Manihot esculenta Crantz). Plant Methods 2020, 16, 87.
  48. Zhao, J.; Bodner, G.; Rewald, B. Phenotyping: Using machine learning for improved pairwise genotype classification based on root traits. Front. Plant Sci. 2016, 7, 1864.
  49. Razzaq, A.; Kaur, P.; Akhter, N.; Wani, S.H.; Saleem, F. Next-Generation Breeding Strategies for Climate-Ready Crops. Front. Plant Sci. 2021, 12, 620420.
  50. Long, P.; Zhang, L.; Huang, B.; Chen, Q.; Liu, H. Integrating genome sequence and structural data for statistical learning to predict transcription factor binding sites. Nucleic Acids Res. 2020, 48, 12604–12617.
  51. Sun, L.; Liu, H.; Zhang, L.; Meng, J. lncRScan-SVM: A tool for predicting long non-coding RNAs using support vector machine. PLoS ONE 2015, 10, e0139654.
  52. Gao, X.; Zhang, J.; Wei, Z.; Hakonarson, H. DeepPolyA: A convolutional neural network approach for polyadenylation site prediction. IEEE Access 2018, 6, 24340–24349.
  53. Ernst, J.; Kellis, M. Chromatin-state discovery and genome annotation with ChromHMM. Nat. Protoc. 2017, 12, 2478–2492.
  54. Tong, H.; Nikoloski, Z. Machine learning approaches for crop improvement: Leveraging phenotypic and genotypic big data. J. Plant Physiol. 2021, 257, 153354.
  55. Li, Y.; Shi, W.; Wasserman, W.W. Genome-wide prediction of cis-regulatory regions using supervised deep learning methods. BMC Bioinform. 2018, 19, 202.
  56. Crane-Droesch, A. Machine learning methods for crop yield prediction and climate change impact assessment in agriculture. Environ. Res. Lett. 2018, 13, 114003.
  57. Marsh, J.I.; Hu, H.; Gill, M.; Batley, J.; Edwards, D. Crop breeding for a changing climate: Integrating phenomics and genomics with bioinformatics. Theor. Appl. Genet. 2021, 134, 1677–1690.
  58. Shen, Y.; Zhou, G.; Liang, C.; Tian, Z. Omics-based interdisciplinarity is accelerating plant breeding. Curr. Opin. Plant Biol. 2022, 66, 102167.
  59. Jing, Y.; Zhao, X.; Wang, J.; Teng, W.; Qiu, L.; Han, Y.; Li, W. Identification of the genomic region underlying seed weight per plant in soybean (Glycine max L. Merr.) via high-throughput single-nucleotide polymorphisms and a genome-wide association study. Front. Plant Sci. 2018, 9, 1392.
  60. Li, D.; Zhao, X.; Han, Y.; Li, W.; Xie, F. Genome-wide association mapping for seed protein and oil contents using a large panel of soybean accessions. Genomics 2019, 111, 90–95.
  61. Hu, D.; Kan, G.; Hu, W.; Li, Y.; Hao, D.; Li, X.; Yang, H.; Yang, Z.; He, X.; Huang, F.J.; et al. Identification of loci and candidate genes responsible for pod dehiscence in soybean via genome-wide association analysis across multiple environments. Front. Plant Sci. 2019, 10, 811.
  62. Torkamaneh, D.; Chalifour, F.-P.; Beauchamp, C.J.; Agrama, H.; Boahen, S.; Maaroufi, H.; Rajcan, I.; Belzile, F. Genome-wide association analyses reveal the genetic basis of biomass accumulation under symbiotic nitrogen fixation in African soybean. Theor. Appl. Genet. 2020, 133, 665–676.
  63. Sonah, H.; O’Donoughue, L.; Cober, E.; Rajcan, I.; Belzile, F. Identification of loci governing eight agronomic traits using a GBS-GWAS approach and validation by QTL mapping in soya bean. Plant Biotechnol. J. 2015, 13, 211–221.
  64. Zhao, X.; Han, Y.; Li, Y.; Liu, D.; Sun, M.; Zhao, Y.; Lv, C.; Li, D.; Yang, Z.; Huang, L. Loci and candidate gene identification for resistance to Sclerotinia sclerotiorum in soybean (Glycine max L. Merr.) via association and linkage maps. Plant J. 2015, 82, 245–255.
  65. Sui, M.; Jing, Y.; Li, H.; Zhan, Y.; Luo, J.; Teng, W.; Qiu, L.; Zheng, H.; Li, W.; Zhao, X. Identification of loci and candidate genes analyses for tocopherol concentration of soybean seed. Front. Plant Sci. 2020, 11, 539460.
  66. Giovannoni, J.J.; Wing, R.A.; Ganal, M.W.; Tanksley, S.D. Isolation of molecular markers from specific chromosomal intervals using DNA pools from existing mapping populations. Nucleic Acids Res. 1991, 19, 6553–6558.
  67. Michelmore, R.W.; Paran, I.; Kesseli, R.V. Identification of markers linked to disease-resistance genes by bulked segregant analysis: A rapid method to detect markers in specific genomic regions by using segregating populations. Proc. Natl. Acad. Sci. USA 1991, 88, 9828–9832.
  68. Yang, Z.; Huang, D.; Tang, W.; Zheng, Y.; Liang, K.; Cutler, A.J.; Wu, W. Mapping of quantitative trait loci underlying cold tolerance in Rice seedlings via high-throughput sequencing of pooled extremes. PLoS ONE 2013, 8, e68433.
  69. Takagi, H.; Abe, A.; Yoshida, K.; Kosugi, S.; Natsume, S.; Mitsuoka, C.; Uemura, A.; Utsushi, H.; Tamiru, M.; Takuno, S.; et al. QTL-seq: Rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. Plant J. 2013, 74, 174–183.
  70. Zhou, H.; Tang, K.; Li, G.; Liu, W.; Yu, H.; Yuan, X.; Yang, S.; Bhattacharyya, M.K.; Feng, X. A robust and rapid candidate gene mapping pipeline based on M2 populations. Front. Plant Sci. 2021, 12, 681816.
  71. Majeed, A.; Johar, P.; Raina, A.; Salgotra, R.K.; Feng, X.; Bhat, J.A. Harnessing the potential of bulk segregant analysis sequencing and its related approaches in crop breeding. Front. Genet. 2022, 13, 944501.
  72. Yang, Q.; Jin, H.; Yu, X.; Fu, X.; Zhi, H.; Yuan, F. Rapid identification of soybean resistance genes to soybean mosaic virus by SLAF-seq bulked segregant analysis. Plant Mol. Biol. Rep. 2020, 38, 666–675.
  73. da Silva, M.P.; Zaccaron, A.Z.; Bluhm, B.H.; Rupe, J.C.; Wood, L.; Mozzoni, L.A.; Mason, R.E.; Yingling, S.; Pereira, A. Bulked segregant analysis using next-generation sequencing for identification of genetic loci for charcoal rot resistance in soybean. Physiol. Mol. Plant Pathol. 2020, 109, 101440.
  74. Watanabe, S.; Tsukamoto, C.; Oshita, T.; Yamada, T.; Anai, T.; Kaga, A. Identification of quantitative trait loci for flowering time by a combination of restriction site–associated DNA sequencing and bulked segregant analysis in soybean. Breed. Sci. 2017, 17013.
  75. Cheng, Y.; Ma, Q.; Ren, H.; Xia, Q.; Song, E.; Tan, Z.; Li, S.; Zhang, G.; Nian, H. Fine mapping of a Phytophthora-resistance gene RpsWY in soybean (Glycine max L.) by high-throughput genome-wide sequencing. Theor. Appl. Genet. 2017, 130, 1041–1051.
  76. Jiang, B.; Li, M.; Cheng, Y.; Cai, Z.; Ma, Q.; Jiang, Z.; Ma, R.; Xia, Q.; Zhang, G.; Nian, H. Genetic mapping of powdery mildew resistance genes in soybean by high-throughput genome-wide sequencing. Theor. Appl. Genet. 2019, 132, 1833–1845.
  77. Li, Z.; Chen, X.; Shi, S.; Zhang, H.; Wang, X.; Chen, H.; Li, W.; Li, L. DeepBSA: A deep-learning algorithm improves bulked segregant analysis for dissecting complex traits. Molecular Plant 2022, 15, 1418–1427.
More
Video Production Service