The Principles and Characteristics of NIR Spectroscopy: Comparison
Please note this is a comparison between Version 2 by Sirius Huang and Version 1 by Jun Zhang.

Spectroscopy technology is nowa popular methods for quantitative and qualitative analysis in fields such as agricultural products and foods by combining with various chemometric methods. In fact, this is the application basis for spectroscopy and spectral imaging techniques in other fields such as genetics and transgenic monitoring. There has been considerable research using spectroscopy (especially near infrared (NIR) spectroscopy) for the effective identification of agricultural products and foods. The principles and characteristics of NIR spectroscopy and its applications in the detection of transgenic agricultural products and foods are described.

  • chemometric analysis
  • transgenic agricultural products and foods
  • near-infrared spectroscopy

1. The Principles and Characteristics of NIRS

Near infrared spectroscopy (NIRS), with a wavelength range between 780 and 2500 nm, can be divided into short-wave NIR (with a range of 780–1100 nm) and long-wave NIR (with a range of 1100–2500 nm) [3][1], and is sometimes used together with a range of 350–780 nm visible range light to form a Vis-NIR spectrum for relevant detection. The state, composition, and structure of the molecule can be obtained by analyzing the primary overtones and oscillations between the hydrogen-containing groups, such as C-H, N-H, O-H, etc., by NIRS [4][2]. The common near-infrared spectrometers consist of a light source, a beam splitter system (wavelength selector), a sample detector, and an optical detector, and some are equipped with a data processing/analysis system for simplicity. The use of these parts should be chosen according to their use. NIR spectroscopy has transmission, diffuse reflection, transmission and reflection detection methods, and the choice of different detection methods is also demand-dependent.
After data acquisition with the spectrometer, the general steps for spectral analysis include: (1) spectral data preprocessing [5][3]; (2) feature wavelength selection [6][4]; (3) model establishment and evaluation [7][5]. The main analysis steps are shown in Figure 1.
Figure 1.
The general analyzing steps of NIRS data.

1.1. The Spectral Preprocessing Methods

The sample spectrum data collected by the spectrometer contains only not the chemical information of the sample itself, but also other irrelevant information and noise, such as electrical noise, sample background, stray light, etc. [5][3]. Therefore, in the application of chemometric methods for spectral analysis, it is necessary to preprocess the original spectral data to eliminate the irrelevant information and noise in the data, which is a necessary step in the analysis.
Smoothing, derivative, multiple scattering correction (MSC), baseline correction, standard normal transformation (SNV), orthogonal signal correction (OSC), and combinations of these methods are common spectral preprocessing methods [8,9,10][6][7][8].
Smoothing preprocessing is one of the most widely used methods for removing spectral noise. Moving average smoothing (MS) and Savitzk-Golay (SG) smoothing are commonly used smoothing methods. Derivative preprocessing is to eliminate the baseline offset and drift, enhance the spectral band features, and overcome the spectral band overlap [11][9]. The direct difference method and SG derivative method are the commonly used derivative preprocessing methods [12][10]. Baseline correction pretreatment successfully eliminates baseline drift and tilt caused by the instrument’s backdrop and the uneven surface of the sample by artificially pulling the baseline of the absorbance spectrum back to 0 baseline [13][11]. Multiple scattering correction (MSC) preprocessing is mainly used to eliminate the effect of scattering on the spectrum and effectively enhance the spectral information related to the content of sample components; the spectral errors due to factors such as optical path changes or sample dilution can be eliminated [4,14][2][12]. Standard normal variate transformation (SNV) is the processing of spectral data with a mean value of 0 and a standard deviation of 1 [15][13]. Orthogonal signal correction (OSC) is a spectral preprocessing algorithm based on the involvement of physical and chemical values of samples [16,17][14][15]. In order to improve the robustness and prediction ability of the model, the information unrelated to the physical and chemical values of spectral data is removed by orthogonal projection and then analyzed by corresponding modeling methods.
In NIRS preprocessing analysis, MSC and SNV are two well-known methods for reducing spectral distortion due to dispersion, and they have been proven to be effective in correcting the problems of inhomogeneous particle distribution and refractive index variation in food applications [18,19][16][17]. Although these preprocessing methods were aimed to reduce unmodeled variability in the spectra data in order to improve the features sought in the spectra, which are usually linearly related to the phenomenon of interest. However, if incorrect preprocessing techniques are used, the essential information may at risk of information removal [10][8].

1.2. The Feature Wavelength Selection Methods

When all wavelength variables are used for modeling, it may be computationally intensive and time consuming, and sometimes the absorption of NIR spectra is not obvious and the overlap is serious, which contains redundant information, so it is normal to eliminate the irrelevant information and filter out the independent variables with high correlation when modeling. When the useless variable is introduced into the model, it will affect model stability and prediction precision. Therefore, it is necessary to extract the feature wavelength variables from the full spectrum before modeling. At present, the commonly used methods for selecting the characteristic wavelengths [6,20,21][4][18][19] include principal component analysis (PCA), competitive adaptive reweighting (CARS), the genetic algorithm (GA), the successive projection algorithm (SPA), and uninformative variable elimination (UVE), etc.
PCA is a popular linear dimensionality reduction approach that is used to map high-dimensional data into a low-dimensional space using some type of linear projection. It is expected that the variance of the projected dimension is the largest, so that fewer data dimensions can be used and more original data points can be retained, which can reduce dimension and eliminate redundant information [4][2]. CARS is a variable selection method proposed to simulate the “survival of the fittest” principle in Darwin’s evolution theory [22][20]. The idea of GA is to optimize the PLSR model based on the RMSECV of selected variables by genetic iteration [23,24][21][22]. SPA is a method to improve modeling speed and prediction accuracy by reducing the covariance between variables and obtaining the wavelength with the least redundant information [6,25][4][23]. UVE is a wavelength selection algorithm based on the PLSR coefficients, which is used to eliminate the full-wavelength variables, the stability of which is less than the noise, thereby improving the predictive power of the model [26][24]. Sometimes, one feature wavelength selected algorithm is used, and the modeling effect is not very effective, and is therefore often used in combination with other feature wavelength selection methods [26][24].

1.3. Model Establishment and Evaluation

For NIR spectroscopy, a calibration model of the spectra is finally established in a linear and nonlinear way for qualitative or quantitative analysis after the pretreatment or feature wavelength selection.
With the rapid development of statistics, it is an inevitable trend to use mathematical analysis methods [7][5] for more scientific classification and quantitative detection, which can be linear, non-linear, or supervised or unsupervised modes. The common qualitative and quantitative methods are k-nearest (KNN) [27][25], linear discriminant analysis (LDA) [11[9][26],28], partial least squares discriminant analysis (PLS-DA) [29[27][28],30], extreme learning machine (ELM) [27][25], Support vector machine (SVM) [12[10][29][30],31,32], back propagation neural network (BPNN) [33][31], partial least squares regression (PLSR) [34,35][32][33], and radial basis function neural network (RBFNN) [27][25], etc.
After the model is established, the stability and accuracy of the model is evaluated, and the high-quality correction model is selected. Indicators often employed include accuracy, correlation coefficient, standard deviation of calibration and prediction set samples, etc.
NIRS and chemometrics methods are a pair of twin technologies that have been developing in tandem with each other. In recent years, deep learning algorithms, represented by convolutional neural networks (CNN), have been used for quantitative and qualitative modeling of NIR spectra [36,37][34][35]. Compared with traditional machine learning methods, the convolutional neural network can extract the features embedded in the spectral data step by step through multiple convolution and pooling layers, and to a certain extent, the preprocessing of spectra and the selection of variables before modeling can be reduced.
Among the most popular deep learning-based models, the DeepSpectra model has outperformed all the other models [38][36]. The combination of deep learning and spectral detection methods is a promising approach for the quality assessment of food and agricultural products, as well as for genetic modification detection [38,39][36][37].

2. The Applications of NIRS for the Detection of Transgenic Agricultural Products and Foods

In the last few decades, NIRS has demonstrated its power in the detection of agricultural products and foods, and there are now a series of applications in meat detection [40][38], agricultural materials and foods safety control [41,42[39][40][41],43], and fruits and vegetables detection [44,45][42][43]. Taking maize testing as an example, NIRS has been used in a range of applications in the identification of variety purity identification [27[25][44][45],46,47], vigor [48][46], internal components such as moisture and protein [49[47][48][49],50,51], fungal toxins [52[50][51],53], and frost damage [54][52]. Today, there is also equipment that can be used for online monitoring of agricultural products and foods using handheld/portable NIR spectroscopy for industrial applications [55,56][53][54].
With the in-depth study of spectroscopy technology, researchers have started to introduce NIRS technology into the identification of transgenic food and agricultural products, as shown in Table 1
Table 1.
Studies on the detection of transgenic agricultural products and foods using near-infrared spectroscopy.
 

References

  1. Feng, X.; Zhao, Y.; Zhang, C.; Cheng, P.; He, Y. Discrimination of transgenic maize kernel using nir hyperspectral imaging and multivariate data analysis. Sensors 2017, 17, 1894.
  2. Zhang, J.; Dai, L.; Cheng, F. Classification of frozen corn seeds using hyperspectral vis/nir reflectance imaging. Molecules 2019, 24, 149.
  3. Candolfi, A.; De Maesschalck, R.; Jouan-Rimbaud, D.; Hailey, P.A.; Massart, D.L. The influence of data pre-processing in the pattern recognition of excipients near-infrared spectra. J. Pharm. Biomed. Anal. 1999, 21, 115–132.
  4. Liu, D.; Sun, D.; Zeng, X. Recent advances in wavelength selection techniques for hyperspectral image processing in the food industry. Food Bioprocess Technol. 2014, 7, 307–323.
  5. Ramirez, C.A.M.; Greenop, M.; Ashton, L.; Rehman, I.U. Applications of machine learning in spectroscopy. Appl. Spectrosc. Rev. 2021, 8–10, 733–763.
  6. Lee, L.C.; Liong, C.; Jemain, A.A. A contemporary review on data preprocessing (dp) practice strategy in atr-ftir spectrum. Chemom. Intell. Lab. Syst. 2017, 163, 64–75.
  7. Jiao, Y.; Li, Z.; Chen, X.; Fei, S. Preprocessing methods for near-infrared spectrum calibration. J. Chemom. 2020, 34, e3306.
  8. Rinnan, A.; Berg, F.V.D.; Engelsen, S.B. Review of the most common pre-processing techniques for near-infrared spectra. TrAC Trends Anal. Chem. 2009, 28, 1201–1222.
  9. Sohn, S.; Pandian, S.; Oh, Y.; Zaukuu, J.Z.; Na, C.; Lee, Y.; Shin, E.; Kang, H.; Ryu, T.; Cho, W.; et al. Vis-NIR Spectroscopy and Machine Learning Methods for the Discrimination of Transgenic Brassica napus L. and Their Hybrids with B. juncea. Processes 2022, 10, 240.
  10. Cheng, P.; Xuping, F.; Yong, H.; Chu, Z.; Yiying, Z.; Junfeng, X. Discrimination of transgenic maize containing the cry1ab/cry2aj and g10evo genes using near infrared spectroscopy (nir). Spectrosc. Spectr. Anal. 2018, 38, 1095–1100.
  11. Workman, J.J. Review of Process and Non-invasive Near-Infrared and Infrared Spectroscopy: 1993–1999. Appl. Spectrosc. Rev. 1999, 34, 1–89.
  12. Xie, L.; Ying, Y.; Ying, T. Combination and comparison of chemometrics methods for identification of transgenic tomatoes using visible and near-infrared diffuse transmittance technique. J. Food Eng. 2007, 82, 395–401.
  13. Xie, L.; Ying, Y.; Ying, T.; Yu, H.; Fu, X. Discrimination of transgenic tomatoes based on visible/near-infrared spectra. Anal. Chim. Acta 2007, 584, 379–384.
  14. Wold, S.; Antti, H.; Lindgren, F.; öhman, J. Orthogonal signal correction of near-infrared spectra. Chemom. Intell. Lab. Syst. 1998, 44, 175–185.
  15. Wenchao, Z.; Fang, C. Analysis of transgenic and non-transgenic rice leaves using visible/near-infrared spectroscopy. Spectrosc. Spectr. Anal. 2012, 32, 370–373.
  16. Hashimoto, A.; Kameoka, T. Applications of infrared spectroscopy to biochemical, food, and agricultural processes. Appl. Spectrosc. Rev. 2008, 43, 416–451.
  17. Zinia Zaukuu, J.; Aouadi, B.; Lukács, M.; Bodor, Z.; Vitális, F.; Gillay, B.; Gillay, Z.; Friedrich, L.; Kovacs, Z. Detecting low concentrations of nitrogen-based adulterants in whey protein powder using benchtop and handheld nir spectrometers and the feasibility of scanning through plastic bag. Molecules 2020, 25, 2522.
  18. Dai, Q.; Cheng, J.H.; Sun, D.W.; Zeng, X.A. Advances in feature selection methods for hyperspectral image processing in food industry applications: A review. Crit. Rev. Food Sci. Nutr. 2015, 55, 1368–1382.
  19. Li, H.; Liang, Y.; Xu, Q.; Cao, D. Model population analysis for variable selection. J. Chemom. 2010, 24, 418–423.
  20. Li, H.; Liang, Y.; Xu, Q.; Cao, D. Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. Anal. Chim. Acta 2009, 648, 77–84.
  21. Niazi, A.; Leardi, R. Genetic algorithm in chemometrics. J. Chemom. 2012, 26, 345–351.
  22. Depczynski, U.; Frost, V.J.; Molt, K. Genetic algorithms applied to the selection of factors in principal component regression. Anal. Chim. Acta 2000, 420, 217–227.
  23. Araújo, M.C.U.; Saldanha, T.C.B.; Galvão, R.K.H.; Yoneyama, T.; Chame, H.C.; Visani, V. The successive projections algorithm for variable selection in spectroscopic multicomponent analysis. Chemom. Intell. Lab. Syst. 2001, 57, 65–73.
  24. Centner, V.; Massart, D.L.; de Noord, O.E.; de Jong, S.; Vandeginste, B.M.; Sterna, C. Elimination of uninformative variables for multivariate calibration. Anal. Chem. 1996, 68, 3851–3858.
  25. Feng, X.; Yin, H.; Zhang, C.; Peng, C.; He, Y. Screening of transgenic maize using near infrared spectroscopy and chemometric techniques. Span. J. Agric. Res. 2018, 16, e203.
  26. Sohn, S.; Pandian, S.; Zaukuu, J.Z.; Oh, Y.; Park, S.; Na, C.; Shin, E.; Kang, H.; Ryu, T.; Cho, W.; et al. Discrimination of transgenic canola (brassica napus L.) And their hybrids with b. Rapa using vis-nir spectroscopy and machine learning methods. Int. J. Mol. Sci. 2022, 23, 220.
  27. Mata, M.M.D.; Rocha, P.D.; Farias, I.K.T.D.; Silva, J.L.B.D.; Medeiros, E.P.; Silva, C.S.; Simões, S.D.S. Distinguishing cotton seed genotypes by means of vibrational spectroscopic methods (nir and raman) and chemometrics. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2022, 266, 120399.
  28. Lee, J.H.; Choung, M. Nondestructive determination of herbicide-resistant genetically modified soybean seeds using near-infrared reflectance spectroscopy. Food Chem. 2011, 126, 368–373.
  29. Jianguo, Z.; Yajing, W.; Zhiqin, Y.; Leiying, X.; Na, W.; Duo, C. Research on fast identification of transgenic oil based on near infrared spectroscopy. Opt. Instrum. 2020, 42, 61–66.
  30. Zhang, J.; Dai, L.; Cheng, F. Identification of corn seeds with different freezing damage degree based on hyperspectral reflectance imaging and deep learning method. Food Anal. Meth. 2021, 14, 389–400.
  31. Jiang, W.; Furong, H.; Caihuan, H.; Jun, Z.; Xingdan, C. Study on near infrared spectroscopy of transgenic soybean identification based on principal component analysis and neural network. Spectrosc. Spectr. Anal. 2013, 33, 1537–1541.
  32. Hattori, T.; Murakami, S.; Mukai, M.; Yamada, T.; Hirochika, H.; Ike, M.; Tokuyasu, K.; Suzuki, S.; Sakamoto, M.; Umezawa, T. Rapid analysis of transgenic rice straw using near-infrared spectroscopy. Plant Biotechnol. 2012, 29, 359–366.
  33. Xie, L.; Ying, Y.; Ying, T. Rapid determination of ethylene content in tomatoes using visible and short-wave near-infrared spectroscopy and wavelength selection. Chemom. Intell. Lab. Syst. 2009, 97, 141–145.
  34. Lecun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444.
  35. Yang, J.; Xu, J.; Zhang, X.; Wu, C.; Lin, T.; Ying, Y. Deep learning for vibrational spectral analysis: Recent progress and a practical guide. Anal. Chim. Acta 2019, 1081, 6–17.
  36. Zhang, X.; Lin, T.; Xu, J.; Luo, X.; Ying, Y. Deepspectra: An end-to-end deep learning approach for quantitative spectral analysis. Anal. Chim. Acta 2019, 1058, 48–57.
  37. Zhang, X.; Yang, J.; Lin, T.; Ying, Y. Food and agro-product quality evaluation based on spectroscopy and deep learning: A review. Trends Food Sci. Technol. 2021, 112, 431–441.
  38. Prieto, N.; Roehe, R.; Lavín, P.; Batten, G.; Andrés, S. Application of near infrared reflectance spectroscopy to predict meat and meat products quality: A review. Meat Sci. 2009, 83, 175–186.
  39. Qu, J.H.; Liu, D.; Cheng, J.H.; Sun, D.W.; Ma, J.; Pu, H.; Zeng, X.A. Applications of near-infrared spectroscopy in food safety evaluation and control: A review of recent research advances. Crit. Rev. Food Sci. Nutr. 2015, 55, 1939–1954.
  40. Fu, X.; Ying, Y. Food safety evaluation based on near infrared spectroscopy and imaging: A review. Crit. Rev. Food Sci. Nutr. 2016, 56, 1913–1924.
  41. Huang, H.; Yu, H.; Xu, H.; Ying, Y. Near infrared spectroscopy for on/in-line monitoring of quality in foods and beverages: A review. J. Food Eng. 2008, 87, 303–313.
  42. Nicolaï, B.M.; Beullens, K.; Bobelyn, E.; Peirs, A.; Saeys, W.; Theron, K.I.; Lammertyn, J. Nondestructive measurement of fruit and vegetable quality by means of nir spectroscopy: A review. Postharvest Biol. Technol. 2007, 46, 99–118.
  43. Chandrasekaran, I.; Panigrahi, S.S.; Ravikanth, L.; Singh, C.B. Potential of near-infrared (nir) spectroscopy and hyperspectral imaging for quality and safety assessment of fruits: An overview. Food Anal. Methods 2019, 12, 2438–2458.
  44. Ge, W.; Zhang, L.; Li, X.; Zhang, C.; Sun, M.; An, D.; Wu, J. Applying multimodal data fusion based on manifold learning with nuclear magnetic resonance (nmr) and near infrared spectroscopy (nirs) to maize haploid identification. Biosyst. Eng. 2021, 210, 299–309.
  45. Cui, Y.; Xu, L.; An, D.; Zhang, X.; Zhu, D. Identification of maize seed varieties based on near infrared reflectance spectroscopy and chemometrics. Int. J. Agric. Biol. Eng. 2018, 11, 177–183.
  46. Qiu, G.; Lü, E.; Lu, H.; Xu, S.; Zeng, F.; Shui, Q. Single-kernel ft-nir spectroscopy for detecting supersweet corn (Zea mays L. Saccharata sturt) seed viability with multivariate data analysis. Sensors 2018, 18, 1010.
  47. Egesel, C.Ö.; Kahrıman, F.; Ekinci, N.; Kavdır, O.; Büyükcan, M.B. Analysis of fatty acids in kernel, flour, and oil samples of maize by nir spectroscopy using conventional regression methods. Cereal Chem. J. 2016, 93, 487–492.
  48. Rosales, A.; Galicia, L.; Oviedo, E.; Islas, C.; Palacios-Rojas, N. Near-infrared reflectance spectroscopy (nirs) for protein, tryptophan, and lysine evaluation in quality protein maize (qpm) breeding programs. J. Agric. Food Chem. 2011, 59, 10781–10786.
  49. Zhong, J.; Qin, X. Rapid quantitative analysis of corn starch adulteration in konjac glucomannan by chemometrics-assisted ft-nir spectroscopy. Food Anal. Methods 2016, 9, 61–67.
  50. Levasseur-Garcia, C.; Bailly, S.; Kleiber, D.; Bailly, J.; Pan, L. Assessing risk of fumonisin contamination in maize using near-infrared spectroscopy. J. Chem. 2015, 2015, 485864.
  51. Darnell, R.E.; Harvey, J.J.; Fox, G.P.; Fletcher, M.T.; Wainaina, J.; Wanjuki, I.; Turner, W.J. Nirs calibration of aflatoxin in maize. Aust. J. Chem. 2018, 71, 868.
  52. Jia, S.; Yang, L.; An, D.; Liu, Z.; Yan, Y.; Li, S.; Zhang, X.; Zhu, D.; Gu, J. Feasibility of analyzing frost-damaged and non-viable maize kernels based on near infrared spectroscopy and chemometrics. J. Cereal Sci. 2016, 69, 145–150.
  53. Rady, A.; Adedeji, A. Assessing different processed meats for adulterants using visible-near-infrared spectroscopy. Meat Sci. 2018, 136, 59–67.
  54. Pullanagari, R.R.; Yule, I.J.; Agnew, M. On-line prediction of lamb fatty acid composition by visible near infrared spectroscopy. Meat Sci. 2015, 100, 156–163.
  55. Xie, L.; Ying, Y.; Ying, T. Quantification of chlorophyll content and classification of nontransgenic and transgenic tomato leaves using visible/near-infrared diffuse reflectance spectroscopy. J. Agric. Food Chem. 2007, 55, 4645–4650.
  56. Xie, L.; Ying, Y.; Ying, T.; Tian, H.; Niu, X.; Fu, X. Application of vis/nir diffuse reflectance spectroscopy to the detection and identification of transgenic tomato leaf. Spectrosc. Spectr. Anal. 2008, 28, 1062–1066.
  57. Long, Z.; Shan-Shan, W.; Yan-Fei, D.; Jia-Rong, P.; Cheng, Z. Discrimination of transgenic rice based on near infrared reflectance spectroscopy and partial least squares regression discriminant analysis. Rice Sci. 2015, 22, 245–249.
  58. Hao, Y.; Geng, P.; Wu, W.; Wen, Q.; Rao, M. Identification of rice varieties and transgenic characteristics based on near-infrared diffuse reflectance spectroscopy and chemometrics. Molecules 2019, 24, 4568.
  59. Guo, H.; Chen, J.; Pan, T.; Wang, J.; Cao, G. Vis-nir wavelength selection for non-destructive discriminant analysis of breed screening of transgenic sugarcane. Anal. Methods 2014, 6, 8810–8816.
  60. Guisong, L.; Haosong, G.; Tao, P.; Wang, J.; Cao, G. Vis-nir spectroscopic pattern recognition combined with sg smoothing applied to breed screening of transgenic sugarcane. Spectrosc. Spectr. Anal. 2014, 34, 2701–2706.
  61. Yafeng, Z.; Qian, S.; Wenjin, W.; Zhentian, H.; Zongying, Z.; Jiashuang, A.; Jin, D.; Xin, D.; Chenggui, H.; Jialin, Y.; et al. Fast discrimination of varieties of transgene wheat based on bionimetic pattern recognition and near infrared spectra. Spectrosc. Spectr. Anal. 2010, 30, 924–928.
  62. Luna, A.S.; Da Silva, A.P.; Pinho, J.S.A.; Ferré, J.; Boqué, R. Rapid characterization of transgenic and non-transgenic soybean oils by chemometric methods using nir spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2013, 100, 115–119.
More
Video Production Service