Beyond the classical computational approaches in drug discovery, such as ligand- (mainly QSAR methods and pharmacophore modeling)
[24][25][26][27][28][29][30] and structure-based strategies (mainly based on molecular docking and molecular dynamics)
[31][32][33][34][35][36][37][38][39] or a combination of them
[40][41][42][43][44][45], currently these computational methods are integrated with ML technologies for improving the reliability of the calculation, avoiding false positive outcomes and enhancing the success ratio in identifying safer hit compounds. Some examples are represented by QSAR-ML models
[46][47][48][49], and multi- and combi-QSAR approaches
[50][51][52][53][54][55][56]. Furthermore, in the drug discovery field, advanced computational models, based on ML technology, have demonstrated strong potential in selecting effective hit compounds
[57][58][59][60][61][62][63][64]. Moreover, ML-based approaches represent a valuable resource also in the drug repurposing field
[65][66]. Interestingly, these approaches have provided potential drugs for treating COVID-19 in a short time
[67]68]. Currently, protein structural modeling and design, as well as protein structure prediction, which can increase the proficiency in the drug discovery pipeline, are emerging areas of application of ML models
[68][69][70][71]. In fact, ML methods offer a theoretical framework for identifying and prioritizing bioactive molecules possessing suitable pharmacological profile, as well as optimizing them as drug-like lead compounds before clinical investigation
[64]. Generally, three steps allow the development of a computational protocol enabling ML-based models: (a) the selection of appropriate descriptors for capturing crucial features of compounds involved in the study; (b) the selection of a suitable metric or scoring system for comparing the set of molecules; (c) a proper ML-based technique for determining the characteristic traits of the features that help to qualitatively or quantitatively discriminate active molecules from inactive ones
[72][73]. ML/DL approaches suitable in the drug discovery field include RF, Artificial Neural Networks (ANN), Deep Neural Networks (DNN), Graph Convolutional Neural Networks (GCNN), Convolutional Neural Networks (CNN), Naïve Bayesian techniques, Multiple Linear Regression (MLR), natural language processing (NLP), decision trees (DT), Logistic Regression (LR), Linear Discriminant Analysis (LDA), Multi-Layer Perceptron (MLP), Probabilistic Neural Networks (PNN), k-nearest neighbors (k-NN), and Support Vector Machine (SVM), only considering some of them in the context of ML
[74][75][76].
2.1.2. Drug Target Prediction and Biomarker Identification
Noteworthy is that in addition to the previously discussed ML approaches to identify promising drug candidates, AI techniques are also emerging in drug target prediction, with remarkable success. For instance, in the field of neurodegenerative disorders, we report here significant progress in ML approaches applied to drug target identification in the drug discovery/drug repurposing field (Table 2).
Table 2. Main examples of AI/ML in drug target prediction and biomarker identification.
2.1.3. AI/ML in Quantitative Systems Pharmacology (QSP)
Following the identification of prospective therapeutic drug targets, analysis must be performed to validate them. Computational approaches offer affordable, time-saving strategies to evaluate the likelihood that potential targets could provide an efficient method for treating a given disorder. Accordingly, a pivotal step in target validation is represented by the construction of a confidence interval for a given potential therapeutic hypothesis, employing quantitative systems pharmacology (QSP) models
[77]. QSP is a stimulating and effective conjunction of biological pathways, pharmacology, and mathematical models for drug development. QSP possesses potential for providing a considerable impact on modern medicine as a result of the discovery and deployment of new molecular pathways and drug targets in the quest for innovative therapeutic agents and personalized medicine. The combination of these specialties is triggering substantial attention in pharma companies to expand predictions from a pharmacodynamic (PD) and pharmacokinetics (PK) perspective, and through improvements in computing capacity, QSP is currently capable of improving outcomes in the drug discovery trajectory. In fact, QSP models can combine information on PK/PD properties, biological processes of interest, and mechanisms of action, resulting from prior knowledge and available preclinical and clinical data, to quantitatively predict efficacy and safety responses over time and translate molecular data to clinical outcomes
[78][79][80][81]. QSP provides a perfect quantitative framework for integrating different big data sources, including omics (i.e., proteomics, transcriptomics, metabolomics, and genomics) and imaging, the dimensionality of which can be reduced using ML methods. By allowing the identification of relevant association and data representations, the development of QSP platforms with higher granularity and enhanced predictive power can be further enhanced
[82]. Moreover, the opportunity to implement a QSP platform with ML techniques to enhance the capacity to handle big data can offer great opportunities for systems pharmacology modeling. In fact, with the high availability of processed and organized data for building interpretable and actionable computational models, supporting decision making in the whole process of drug discovery and development, QSP can improve the reliability of predictions, providing more complex analysis, a better understanding of biomedical systems, and ultimately the design of optimized treatments. We report some examples regarding this approach.
2.2. Imaging, Biomarkers, Diagnosis, and Disease Progression
With the growing accessibility to high-quality amounts of cell imaging data, there are currently relevant possibilities to use ML-based methods to aid researchers in cell image processing. In fact, the image features that are supposed to be crucial in producing predictions or diagnoses can be generally processed using ML algorithms. The latter offers the possibility of predictive, descriptive, and prescriptive assessment to acquire relevant information that would otherwise be impossible to obtain by human analysis, providing accurate medical diagnoses
[83][84]. Accordingly, in recent years, numerous clinical investigations have enabled the use of AI in several fields, providing general pathological classification, risk evaluation, diagnosis, prognosis, and the prediction of appropriate therapy and possible responses to a given pharmacological treatment
[85][86]. In particular, DL, a class of ML that employs ANN (CNN and recurrent neural networks (RNN)) resembling human cognitive capabilities, has proven undeniable superiority over conventional ML approaches owing to algorithm improvement, better processing hardware, and access to massive amounts of imaging data
[87]. The successful incorporation of DL technology into normal clinical practice has determined that the diagnosis accuracy is comparable to that of healthcare experts. Furthermore, DL model integration provides additional advantages, including speed, efficiency, affordability, increased accessibility, and ethical behavior
[88]. For these reasons, the FDA has approved the use of specific DL-driven diagnostic computational tools for clinical usage (
Table 3)
[89][90][91]. The application of AI encompasses several medical and biomedical fields, including radiology
[92], gastroenterology
[93][94], neurology
[95][96], ophthalmology
[97][98], cardiology
[99][100], dermatology
[101], general pathology
[102], oncology
[103], healthcare
[104][105], and clinical medicine
[105][106].
Table 3. List of some examples of FDA-approved AI/ML-based solutions
[89][91][103][107][108][109].