With the aim to obtain an accurate metabolites identification from the observation of the corresponding peaks in complex mixtures, Kim et al.
[27] developed a convolutional neural network (CNN) model, called SMART-Miner, which is trained on 657 chemical entities collected from HMDB and BMRB databases. After training, the model is able to automatically carry out the recognition of metabolites from 1H-13C HSQC NMR spectra of complex metabolite mixtures, showing higher performance in comparison with other NMR-based metabolomic tools.
Brougham et al.
[28], by employing ANNs on 1H NMR spectra, performed a successful classification of four lung carcinoma cell lines, showing different drug-resistance patterns.
ThRese
authoarchers chose human lung carcinoma and adenocarcinoma cell lines together with specific drug-resistant daughter lines. The ANN architecture was constructed at first using three layers and the corresponding weights were determined by minimizing the root mean square error. Then,
the authoresearchers analyzed networks with four layers, two of which are hidden. Their results show that the four-layer structure with two hidden layers provided a 100% successful classification
[28]. These data are very interesting in terms of the robustness of the used approach: the cell lines were correctly classified, even though the effects were provoked by the operator and independently from the spectra chosen for training and validation.
Very recently, Di Donato et al.
[29] analyzed serum samples from 94 elderly patients with early stage colorectal cancer and 75 elderly patients with metastatic colorectal cancer. With the aim to separately observe each different molecular component, these
resea
uthorchers acquired one-dimensional proton NMR spectra by using three different pulse sequences for each sample: (i) a nuclear Overhauser effect spectroscopy pulse sequence to observe molecules with both low and high molecular weight; (ii) a common spin echo mono-dimensional pulse sequence
[30] to observe only lighter metabolites and (iii) a common diffusion-edited pulse sequence to observe only macromolecules
[29]. Their results, taking advantage of Kaplan–Meier curves for prognosis and of a PCA-based kNN analysis, allowed distinguishing relapse-free and metastatic cancer groups, with the advantage of obtaining information about the risks in the early stage of the colorectal cancer disease.
Peng et al.
[31], by using two-dimensional NMR correlational spectroscopy on the longitudinal and transversal components of the magnetization relaxation time during its equilibrium recovery, were able to perform a molecular phenotyping of blood with the employment of supervised learning models, including neural networks. In detail, by means of a fast two-dimensional Laplace inversion
[32], they obtained T1–T2
2 correlation spectra on a single drop of blood in a few minutes with a benchtop-sized NMR spectrometer. Then, they converted the NMR correlational maps for deep image analysis, achieving useful insights for medical decision making by the application of machine learning techniques. In particular, after an initial dimensionality reduction by unsupervised analysis, supervised neural network models were applied to train and predict the data that, at the end, were compared with the diagnostic prediction made by humans. The results showed that ML approaches outperformed the human being and took a much shorter time. Therefore,
threse
authoarchers demonstrated the clinical efficacy of this technique by analyzing human blood in different physiological and pathological conditions, such as oxidation states
[31]. Concerning the analysis of different physiological conditions, it was reported the T1–T2 correlational maps of blood cells at oxygenated (a), oxidized (b), and deoxygenated (c) states. Three peaks with different relaxation times values were observed and assigned to the different microenvironments that water experiences in the considered samples of red blood cells. For the obtained maps, the coordinate for the bulk water peak (slowest component) is shown at the upper left of the map indicating T1 and T2 relaxations (in ms) and T1/T2
2-ratio, respectively. Instead, the coordinates of the fastest components, due to hydration and bound water molecules
[33], are reported close to the corresponding correlation peak.