Cereals are an important and major source of the human diet. They constitute more than two-thirds of the world’s food source and cover more than 56% of the world’s cultivatable land. These important sources of food are affected by a variety of damaging diseases, causing significant loss in annual production. In this regard, detection of diseases at an early stage and quantification of the severity has acquired the urgent attention of researchers worldwide. One emerging and popular approach for this task is the utilization of machine learning techniques.
1. Introduction
Advancements in the area of machine learning and computer vision in the past decade had had a profound effect on the utilization of machine learning techniques in different sectors
[1]. Machine learning approaches are being used from the medical
[2][3][4][5][6][7][8][9] to the security sector
[10]. Recently, many works
[11] have been undertaken on the application of machine learning in the agriculture sector for the detection of plant diseases, such as coffee
[12] and Enset
[13], Crop yield prediction
[14], quality and growth monitoring
[15][16], supply chain performance
[17], and water stress determination
[18].
Plants constitute 98% of the world’s diet, two-thirds of which are Cereals
[19]. The eight major kinds of cereal, wheat, maize, rice, barley, sorghum, oats, millets, and rye cover 56 percent of the world’s arable land. Wheat, maize, and rice account for 80% of global cereal production
[19]. Plant diseases are the major cause of global crop yield reduction, resulting in 10% loss of all the global food production
[20]. The major plant disease-causing pathogens are viruses, bacteria, Oomycetes, fungi, nematodes, and other parasitic plants
[20]. When infections occur to a large extent, losses to cereal crop production could reach as high as 50%
[21]. Many laboratory techniques are available for the identification and detection of plant pathogens
[20], but rapid and early detection is an important factor in the successful containment and control
[22].
2. Cereal Crops and Diseases
Cereal is a crop closely related to grass and that is cultivated for its seed and is consumed as food by humans
[23]. According to the Cereal Disease, Methodology Manual
[19], the eight major kinds of cereal, covering 56% percent of the world’s arable land are Wheat, Maize, Rice, Barley, Sorghum, Oats, Millet, and Rye.
2.1. Wheat
Wheat is the most dominant and important source of food for humans and livestock
[24]. It is the main ingredient in flour, which is used in the making of bread, biscuits, and pastry
[19]. Wheat is cultivated across all parts of the earth, from Russia in the northern hemisphere to Argentina in the south
[24]. Diseases pose a serious threat to the global production of wheat
[20]. Diseases on wheat are caused by a variety of pathogens. These are, Fungai, Viruses, Bacteria, Insects and Nematodes
[25]. Some of the commonly occurring wheat diseases are given in
Table 1.
Table 1. Some wheat disease types and causing pathogens
[25].
Pathogen
|
Disease
|
|
Leaf Rust (Brown Rust), Stem Rust (Black Rust),
|
|
Stripe Rust (Yellow Rust), Common Root Rot,
|
Fungus
|
Common and Dwarf Bunt (Stinking Smut),
|
|
Root Lesion Nematode |
|
2.2. Maize (Corn)
Maize is an important staple food crop that is grown all over the globe. It is the largest grown cereal per unit area, yielding 785 million tons annually
[26]. Besides being a source of food, maize, and its products are used as raw materials for many industrial applications. Maize is prone to many types of diseases caused by a variety of pathogens. Fungal pathogens are the major causes of maize disease, while bacterial and viral diseases are less common but pose a serious threat
[21][27]. Commonly occurring maize diseases are given in
Table 2.
Table 2. Some Maize disease types and causing pathogens
[25][27].
Table 3. Some Rice disease types and causing pathogens
[20][31].
Pathogen
|
Disease
|
Fungus
|
|
Fungus
| Gray leaf spot, Brown spot,
|
|
| Wheat Blast,
|
|
|
Virus
|
Barley stripe mosaic
|
Cereal tillering virus |
2.5. Sorghum
Sorghum is the fifth most important cereal crop after wheat, maize, rice, and barley
[35]. It is cultivated around the globe and used as a 0 source of food and energy, when used as a bio-fuel
[36]. Sorghum production is highly affected by fungal and viral diseases, at times causing around 28% loss in production
[37]. Some commonly occurring sorghum diseases are presented in
Table 5,
Table 6 and
Table 7.
Table 5. Some Sorghum disease types and causing pathogens
[20][37].
Some Rye disease types and causing pathogens
[20].
UAV system and photo-bike used for hyperspectral imaging of wheat farms
[43].
Summary of various wheat leaf disease datasets is presented in
Table 8.
Table 8. Performance comparison of selected studies on machine learning based wheat disease detection and corresponding datasets.
Citatation
|
Year
|
Data Type
|
# of Classes
|
Sample Size
|
Method
|
Accuracy %
|
Bao et al. [38]
|
Anthracnose, Leaf blight, Zonate leaf spot
|
| 2021
|
Image
|
3
|
360
|
SVM
|
93.3%
|
Tar spot, Charcoal rot
|
Sood et al. [45]
|
2020
|
Image
|
3
|
876
|
VGG16
|
99.07%
|
Rust, Gray leaf spot
|
Bacteria
|
Bacterial stripe
|
Fungus
|
Crown rust, Stem rust, Powdery mildew
|
Smut disease, Leaf blight
|
|
Mukhtar et al. [46]
|
] |
|
| 2019
|
Hyper-spectral
|
2
|
145
|
Linear Regression
|
0.75R2
|
Huang et al. [41]
|
2019
|
Hyper-spectral
|
2
|
89
|
SVM
|
85.7%
|
3.2. Machine Learning in Rice Disease Detection
Identification and classification of 12 types of rice leaf diseases using MobileNetV2 architecture and attention mechanism were proposed by Chen et al.
[52]. The MobileNetV2 architecture was pre-trained on the ImageNet dataset and fine-tuned by using the transfer learning approach on a smaller local dataset. The authors utilized Channel Attention Mechanism (CAM) to better learn the inter-channel relationships. For fine-tuning and testing their proposed model, the authors collected a total of 1100 images of healthy and disease rice leaves. These 660 were compiled from various sources on the internet and 440 were collected from the field. The proposed model achieved an average classification accuracy of 99.67%. Similarly, Wang et al.
[53] proposed a MobileNetv2 based approach for the classification of three types of rice leaf diseases by utilizing attention mechanism and Bayesian optimization. Model training and validation were performed on a public dataset of 2370 images belonging to three classes of rice disease and one healthy class. The authors achieved a classification accuracy of 94.65%.
Liang et al.
[54] proposed a convolutional neural network-based rice blast disease detection approach. The authors proposed two CNN architectures, the first network containing four convolutional layers, four max-pooling layers, and three fully connected layers, and ReLU after each layer (
Figure 3a) and a second network having the same convolutional layers and max-pooling layer structure as the first network, but with two additional fully connected layers as shown in (
Figure 3b). The two models were trained on a custom dataset of 5808 images of healthy and rice blast infected leaves. The dataset was collected on-site and is divided into 2906 positive (rice blast infected) and 2902 healthy images. The authors utilized 5-fold cross-validation and a selected the second model due to its inherent stability on small datasets and chieved an accuracy of 95.83%. The proposed approach was compared to hand-crafted approaches like Local Binary Patterns Histogram (LBPH), Haar-WT. The comparison result suggests that the proposed CNN method achieves superior feature extraction and classification results. A similar approach for the detection and classification of three classes of rice disease was proposed by Rahman et al.
[55]. The authors proposed a convolutional neural network trained on a dataset of 300 images containing three types of rice leaf disease (Brown spot, Leaf blight, and Hispa) and one healthy class. The model achieved a classification accuracy of 90%. This low classification accuracy is a result of the small dataset size the authors used and the lack of utilizing transfer learning. Ramesh et al.
[56] proposed a convolutional neural network approach for the detection of three classes of rice disease. The authors utilized HSV color space for the separation of background and foreground and the K-means algorithm for disease segmentation.
Figure 3. Deep Convolutional Neural Network architecture for the detection of rice blast
[54].
A random forest classifier for the detection and classification of three types of rice leaf disease was proposed by Saha and Ahsan
[57]. A local dataset compromising a total of 276 images of healthy and infected rice leaves was collected by the authors for testing and training their proposed algorithm. Feature extraction was implemented by using intensity moments. The proposed approach achieved a classification accuracy of 91.47%. A deep learning method for the detection of 15 different rice diseases was implemented by Chen et al.
[58]. The authors developed a deep learning architecture based on the fusion of existing DenseNet and Inception architectures. For testing the proposed model, the authors compiled a dataset consisting of 500 images belonging to 15 classes of rice disease. Their proposed model achieved a classification accuracy of 94.07%.
Summary of various rice leaf disease datasets is presented in
Table 9.
Table 9. Performance comparison of selected studies on machine learning based rice disease detection and corresponding datasets.
Citatation
|
Year
|
Data Type
|
# of Classes
|
Sample Size
|
Method
|
Accuracy %
|
Chen et al. [52]
|
2021
|
Image
|
12
|
1100
|
MobileNetV2
|
99.67%
|
|
7448 |
|
|
faster R-CNN |
|
| 96.21% |
|
3.3. Machine Learning in Maize Disease Detection
An Enhanced CNN for the detection of nine classes of maize leaf disease was proposed by Agarwal et al.
[63]. They proposed a convolutional neural network with receptive field enlargement to enhance the feature extraction performance of the CNN, which is required due to the complexity of maize leaf images. To accomplish this task, the authors collected a dataset of 500 images of maize leaves belonging to nine different classes of maize leaf disease at different stages. The performance of the proposed approach was compared to existing models like AlexNet and GoogleNet and provided an improved classification accuracy of 95.12%. Sibiya et al.
[64] developed a convolutional neural network for the detection of three different maize leaf diseases by using the Neuroph framework for the java programming language. The proposed approach gave a classification accuracy of 93.5%.
Barman et al.
[65] proposed a MobileNet architecture-based maize leaf disease detection that will be deployed on Android mobile devices. The authors utilized a transfer learning approach to fine-tune the pre-trained MobileNet architecture. For this task, they used a public dataset (PlantVillage) with a total of 3852 images of four different classes of maize leaf diseases. The proposed approach yielded an accuracy of 94.53%.
Hasan et al.
[66] proposed a hybrid network by combining a convolutional neural network and bi-directional LSTM for the detection of nine classes of maize leaf diseases. bi-LSTM was selected by the authors to better accelerate CNN’s classification accuracy and increase the co-relation among extracted features. Training of the model was performed on the PlantVillage dataset, which contains 2500 images of maize leaves affected by nine different types of diseases. They implemented various image augmentation techniques and increased the size of the dataset to 29,065 images. The proposed approach achieved a classification accuracy of 99.02%, exceeding existing deep learning methods.
Xu et al.
[67] proposed a multi-scale convolutional global pooling convolutional neural network based on the AlexNet and Inception architecture. The proposed model improves on the AlexNet architecture by replacing the last fully connected layer with a global pooling layer and adding a batch normalization layer. This is implemented to solve the low accuracy achieved and the large training data size required when utilizing transfer learning. Training and testing of the proposed model were performed on the PlantVillage dataset. The authors found that the proposed approach improves average precision by more than 2% when compared to AlexNet. A VGG16 deep learning architecture-based maize disease identification was proposed by Tian
[68]. In this work, a transfer learning approach was used to fine-tune the pre-trained VGG16 architecture on a dataset consisting of 7858 images of maize leaves affected by six types of diseases. The proposed method achieved a classification accuracy of 96.8%. Summary of various maize leaf disease datasets is presented in
Table 10.
Table 10. Performance comparison of selected studies on machine learning based maize disease detection and corresponding datasets.
Citatation
|
Year
|
Data Type
|
# of Classes
|
Sample Size
|
Method
|
Accuracy %
|
Agarwal et al. [52]
|
2021
|
Image
|
9
|
500
|
CNN
|
95.12%
|
Wang et al. [53]
|
Sibiya et al. [64]
|
2021
|
|
2019
Image
|
Image (PlantVillage)
3
|
9
2370
|
2500
MobileNetV2
|
94.65%
|
|
| CNN |
|
| 95.5% |
|
Liang et al. [54]
|
Barman et al. [65] |
2021
|
|
2019
Image
|
2021
Image
11
|
Image (PlantVillage)
1
440
|
9
5808
|
2500
MobileNet
|
92%
|
Kumar et.al [47]
|
2021
|
Virus
|
Streak disease
|
Table 6. Some Oats disease types and causing pathogens
|
Root rot, Crown rot, Snow mold
|
Bacteria
|
|
Tan Spot
|
|
Bacterial Stripe (Black Chaff),
|
Bacteria
|
Basal Glume Rot and Bacterial Leaf Blight,
|
|
Bacterial Spike Blight (Gummosis)
|
2.4. Barley
Barley is an important staple food cereal crop, although it is produced in much less quantity than wheat, maize, and rice
[19]. It is farmed in significant quantities in sub-Saharan countries like Ethiopia
[32], where barley adaptation to high altitude environments makes it an important source of food and beverages for millions of people
[33]. Barley is affected by over 80 different diseases caused by a variety of pathogens
[34]. Some of these are summarized in
Table 4.
Table 4. Some Barley disease types and causing pathogens
[19][20][34].
Pathogen
|
Disease
|
| Leaf brown spot, Rice blast, Sheath rot
|
Fungus
|
Stripe rust, Leaf rust, Stem rust |
Stripe Rust (Yellow Rust)
|
Common rust, Northern leaf blight
|
|
Powdery mildew, Downy mildew
|
Common rust, Smut,
|
| CNN |
|
| MobileNetV2 |
| 95.83%
|
|
| 93.5% |
Halo blight
|
Virus
|
|
Virus
|
Yellow dwarf
|
Fungus
|
Snow mold, Brown rust, Ergot
|
Eye spot, Sharp eyespot
|
Southern leaf blight, Smut
|
|
Northernl eaf blight, Southern leaf blight
|
Bacteria
|
Net blotch, Spot blotch, Stripe disease
|
Bacteria
|
Bacteria
|
Rahman et.al [55 |
Bacterial blight
|
]
|
Hasan et al. [66 |
Bacterial blightYellow dwarf
|
2021
|
]
| Image |
2020
|
Image
|
Image (PlantVillage)
1
3
|
|
9300
|
|
2500
CNN
|
LSTM
90%
|
| 99.02% |
|
Corn stunt disease
|
|
Virus
|
Rice tungro disease
|
Saha and Ahsan. [57] |
Stewart wilt
|
Barley Yellow Dwarf,
|
| 450 |
|
| CNN |
|
|
Xu et al. [67]
|
2021
|
2021
Image
|
Image (PlantVillage)
3
|
9
276
|
2500
CNN
|
TCI-ALEXN
91.47%
|
|
| 99.18% |
|
Yellow dwarf
|
Chen et al. [58]
|
2020
|
Tian [68]
|
|
2019Image |
|
|
15
|
500
|
DenseNet
|
94.07%
|
| Image (PlantVillage) |
|
9
|
2500
|
Bacterial stalk rot
|
Bacterial leaf strip
|
Virus
|
Leaf fleek
|
Virus
|
Barley Stripe Mosaic,
|
|
|
Wheat Streak Mosaic
|
Mosaic
|
|
Aphids, Stink Bugs,
|
Yellow dwarf
|
|
Cereail Leaf Beetle,
|
| 89.9% |
|
Tagel et al. [48]
|
2021
|
Image
|
3
|
1500
|
VGG19
|
99.38%
|
Mosaic
|
Hussain et al. [49]
|
2018
|
Image
|
4
|
8828
|
|
Insect
|
Thrips,
|
|
Hessian Fly, Wireworms,
|
|
Mites
|
Nematode
|
Seed Gall Nematode
|
Cereal Cyst Nematode
|
Root Knot Nematode
|
2.3. Rice
Rice is the second most-produced cereal crop in the world
[28]. It is the main source of food for billions of people in the world and is one of the primary food sources for the majority of people in Asia
[29] with around 500 metric tons
[30] of rice milled every year. Rice is susceptible to a variety of disease-causing pathogens that attack the leaf, the seed, the stem, and the root
[31], some are given in
Table 3.
AlexNet |
|
|
84.54% |
Powdery mildew, Stem rust, Glume blotch |
golden stripe
|
Table 7.
3. Machine Learning-Based Cereal Crop Disease Detection
3.1. Machine Learning in Wheat Disease Detection
Bao et al.
[38] applied elliptical-maximum margin criterion metric learning to the identification and severity estimation of powdery mildew and stripe wheat disease types. The researchers choose the E-MMC algorithm since it is better suited to finding nonlinear transformations in patterns, and their results show that it achieved superior results when compared to the SVM algorithm. For testing their algorithm, the researchers prepared a dataset from farms around the province of Beijing. In total, they collected 360 images. Disease spot segmentation was performed by using the Otsu thresholding algorithm and feature extraction using HSV histogram, Color moments for color attributes, and LBP and Gabor for texture attributes.
Identification of various wheat diseases using hyper-spectral image data were performed by
[39][40][41][42]. Identification of wheat powdery mildew disease using linear regression and an SVM (
Figure 1) classifier on hyper-spectral data ranging from 656 nm to 784 nm was implemented by Huang et al.
[40]. The authors employed the Relief-F algorithm to identify the best spectral bands and evaluation of the SVM algorithm was performed by k-fold cross-validation. In addition, Huang et al.
[41] proposed an SVM-based detection of Fusarium Head Blight on wheat heads using hyperspectral imagery. Here, Fishers Linear Discrimination (FLD) was implemented for dimensionality reduction. An in-field detection of yellow rust and fusarium head blight in wheat-based on the ground and UAV-based platforms was discussed by Bohnenkamp et al.
[43] (
Figure 2) and Xiao et al.
[44].
Figure 1. Flow chart for hyper-spectral image data analysis and processing for wheat rust detection
[40].
|
|
VGG16 |
|
| 96.8% |
|
Jiang et al. [50]
|
2017
|
Image
|
6
|
9230
|
VGG-FCN
|
97.95%
|
kamrul et al. [59]
|
2019
|
Image
|
2
|
284
|
InceptionV3
|
99%
|
Azadbakht et al. [51]
|
2019
|
Hyper-spectral
|
2
|
284
|
v-SVR
|
0.99R2
|
Hasan et al. [60]
|
2019
|
Image
|
9
|
1080
|
InceptionV3
|
97.5%
|
Huang et al. [40 |
Sethy et al. [61]
|
2020
|
Image
|
11
|
5932
|
SVM
|
98.38%
|
Zhou et al. [62]
|
2019
|
Image
|
3
|