Deep Learning in Liver Transplantation

Deep Learning in Liver Transplantation: Comparison

Please note this is a comparison between Version 2 by Rafael Calleja Lozano and Version 1 by Rafael Calleja Lozano.

Liver transplantation outcomes have improved in recent years. HLoowever, with the emergence of expanded donor criteria, tools to better assist donor–rking for the best Donor-Recipient matching have become necessary(D-R matching) as always been a challenge for the liver transplantation surgeons. Most of the currently proposed scores based on conventional biostatistics are not good classifiers of a problem that is considered “unbalanced.” In recenlast years, the implementation of artificial intelligence in medicine has experienced exponential growth. Deep learning, a branch of artificial intelligence, may be the answer to this classification problem. The with capability to handle a large number of variables with speed, objectivity, and multi-objective analysis is one of its advantagesis. Artificial neural networks and random forests have been are the most widely used deep-learning classifiers in this field. This review aims to give a brief overview of D–R matching and its evolution in recent years and how artificial intelligence may be able to provide a solution.Both classifiers have been able to show a high predictive ability in the graft survival of a D-R pair compared to traditional classifiers. There are even authors that have successfully created a matching model based on one of them. However, they are not without limitations. The question now is: Will these limitations be overcome in the coming years?

donor–recipient matching
artificial intelligence
deep learning

1. ICurrentroduc State of Artion

The problem of liver donor–recipient (D–R) matching is not new and is inherent to organ transplantation. The improvement in surgical techniques, postoperative management, diagnosis, and treatment of post-transplant complications or the development of liver preservation techniques are some of the barriers that liver transplantation has overcome in recent years. As a result, the recipient- and graft-survival rates at one year are above 95%, and long-term survival has become the norm. However, the Achilles’ heel of liver transplantation continues to be the disproportionate numbers between the number of shortage in donors pool and the numberincrease of waitlisted patients. This leads to long waiting times and mortality among patients A long time on waiting for a graft that list (can reach as high as 20% in certain patient groups ^[1][2]. F) can leard from improving this situation, the current expansion of inclusion criteria, such as elderly recipients or indications for malignant tumors (trto the death of patients waiting for an organsplant oncology), may further aggravate this problem^[1][2]. This shortage might lead us to think that the use of grafts would be higher. Paradoxically, however, in countries where deceased donor rates are low, the utilization rate is high and inverse^[2]. Se current expansion ome of the proposed solutions have been to include expandedinclusion criteria donors (ECDs) or improve graft utilization using preservation machines, one of the most promising developing fields ^[3]have aggravated this problem.

The imbalance between candidates and grafts is further complicated by organ allocation policies since there exist as many policies as decisions about what to prioritize. On thBrie one hand, fly, we have policies based on the principle of urgency (the “sickest-first” principle) benefit high-risk candidates. On the other hand, policies based on the principles of , on “individual transplant benefit principle” and “population-based transplant benefit” favor candidates in a better clinical condition and aim to achieve better transplantation results principle” ^[43]. FurthHowevermore, not all organs and not all recipients are equal. A high-risk donor (e.g., ECD) combined with a high-risk r-recipient is a high-risk combination, which may qualify the transplant as futile. This has lematching may lead to the avoidance of such pairings in clinical practice (risk divergence allocation policy)futility ^[5][6]. Forf this reason, many high-risk, waitlisted te transplant candidates are , penalized. To address this situation, and with ting these patients on waiting lists. The development of perfusion machines, the opposite strategy has been proposed: enable has enabled the use of marginal grafts using normothermic machine perfusion (NMP) to serve high-risk candidates (NAPLES initiative) ^[74].

Today, artificial intelligence (AI) is revolutionizing the field of hepatology and liver surgery. AI applications, particularly through machine learning, have now become common in the fields of diagnostic imaging and image-guided surgery ^[8]. A classicalthough such AI-based solutions may seem recent and novel, in 1994, Doyle et al. published the first work on artificial neural networkmodels (ANNs) in the field of liver transplantation^[9].

This review aims to offer a brief overview of the D–R matching crossroads. To this end, some basic AI concepts are explained, and the evolution of AI in recent years is explored to determine if this technology is a solution that seems to border on utopia.

2. What Is the StartFing Point? The Achilles’ Heel of Traditional D–R Matching Models

The classical models (Figure 1ure 1) used to design organ allocation policies consider systems based on patient characteristics or donor risks, or a combination of donor and recipient characteristics. These D–R systems use conventional biostatistics and methodologies, such as logistic regression and^[5] obuther linear model have important limitations ^[106].: Although these models have been analyzed in-depth, none offers an adequate response to D–R matching ^[43]. MatThematically, liver transplantation is a dichotomous problem. Different variables (donor, recipient, and logistics) are combined to obtain two possible outcomes: graft survival or graft loss at different endpoints (3 and 12 months are the most commonly used). However, no current allocation system is capable of achieving an ideal match. That is, these systems reason is that these models are unable to identify the candidate on the waiting list with the highest probability of death and identify, from all available grafts, the one with the highest probability of post-transplantation success for this candidate.

Chart, waterfall chart

Description automatically generated

Figure 1. Different D–R matching systems based on donor and recipient variables. COD, cause of death; CVA, cardiovascular accident; DCDD, donation after circulatory determination of death; PVT, portal vein thrombosis. Figure obtained from Briceno J, Ciria R, de la Mata M. Donor–recipient matching: myths and realities. J. Hepatol. 2013, 58 (4), 811–820. Copyright © 2012 European Association for the Study of the Liver. Published by Elsevier Ireland Ltd. All rights reserved.

In allocation policies based on the sickest-first principle, the Mayo Model for End-Stage Liver Disease (MELD) score is the most commonly used score to prioritize waitlisted candidates. Over the past two decades, the MELD score has been modified several times. The current MELD score establishes a cut-off score point (<15) below which transplantation may be unsuitable for the patient and is a valid predictor of waitlist mortality. Despite its utilityDespite its utility, however, MELD (and its modifications) shows poor predictive capacity (C-statistic of 0.55) in post-transplant survival and lacks precision in prioritizing indications other than liver dysfunction (e.g., in pediatric recipients or hepatocarcinoma)^[97]. This has led to the development of special systems based on extra points ^{[58][69][1110]}.

Other liver scoring systems, such as the Balance of Risk (BAR) score ^[1211] or the Survival Outcome Following Liver Transplantation (SOFT) score^[1312], have been validated and are being used as tools in the clinical decision-making process. The SOFT score is a reliable predictor of 3-month mortality after liver transplantation that utilizes 18 variables: 13 donor variables, 4 recipient variables, and 1 logistics variable. However, the BAR score is the best measure to predict 90-day morbidity with reasonable accuracy (area under the receiver operating characteristic [AUROC] > AUC>0.70), as it can detect unfavorable D–R factor combinations before liver graft allocation ^[1413]. Unfortunately, both BAR and SOFT are “all-or-nothing” scores, so they are unable to identify which of several D–R pairs will achieve the best outcome; that is, they are not “matching” systems ^[43].

The previous scores are all based on statistical models that include logistic regression but have important limitations ^[15]:

They assume a linear relationship between variables. Most health sciences relationships are non-linear, so this statistical methodology is not accurate.

The models exclude variables considered non-significant when all variables contribute to a clinical outcome to a greater or lesser degree.

In unbalanced problems such as liver transplantation, where deceased patients are rare, and most of them survive, logistic regression does not have an adequate predictive capacity. This is because modern biostatistics are not able to predict unbalanced phenomena, and the most common solution is to use large cohorts of patients to increase the number of infrequent events.

32. WhaConcept Iss: Artificial Intelligence, and Are Machine Learning and Deep Learning the Same Concept?.

AI is a branch of computational science that studies computational models capable of performing human-like activities based on two fundamental characteristics: behavior and reasoning. Its applications are diverse, including data analysis. Machine learning is defined as a branch of AI that focuses on the use of data and algorithms to mimic the way humans learn and gradually improve the algorithms’ accuracy. This learning process is understood as the ability to identify a series of complex patterns determined by a large number of variables. Therefore, the machine does not learn by itself, but the algorithm modifies itself automatically depending on the data input in its interface, thus allowing scenarios and conditions to be predicted in an automated way. For this reason, AI is being increasingly applied in the health sciences to predict clinical outcomes^[1614].

Machine learning can be approached in different ways, such as supervisedd deep learning (the algorithm receives already labeled data and the expected type of response), unsupervised learning (the algorithm receives an unlabeled dataset and must find its patterns), and reinforcement learning (the algorithm learns from the environment through positive or negative reinforcement). To sum up, machine learning involves the development of an algorithmic model that is then trained on data. There are many different models, such as decision tree classifiers, Bayesian networks, and neural networks (specifically convolutional neural networks). These models use a set of techniques that pursue learning through examples and are capable of recognizing complex problems and solutions, what is known as “deep learning.” Therefore, machine learning and deep learning are not on the same level, but the seare not on the same level, but the second is part of the first. Even so, it is possible to compare both and establish some differences.

While machine learning uses algorithms to analyze data, learn, and generate results or make decisions based on what it learns, deep learning structures the algorithms into layers of convolutional neural networks that help it learn and generate more accurate results. Data used by machine learning algorithms are structured and labeled for their predictions. This does not mean that they cannot work from unstructured data, but to do so, they need to perform some information pre-processing. Deep learning algorithms eliminate some of these pre-processing needs, as they can work with unstructured data and extract features in an automated or independent way. Finally, deep learning algorithms work in layers that reduce the margin of error. Each layer makes a judgment and combines that judgment with the result of the previous layer. The more information it receives and processes, the more accurate it becomes.

What role do deep learning algorithms play in D–R matching? As we noted in a recent publication^[15], clinical decisions have both an objective and a subjective component. Scientific data, memory, and previous experiences serve as the basis for clinical reasoning, while aspects, such as intuition or emotions, form the subjective component. Therefore, clinical decisions in D–R matching have an inherent emotional bias. A single D–R matching may include around 100 parameters between the donor and recipient’s characteristics and logistical aspects. Deep-learning classifiers use multiple previous experiences based on objective data (databases) to make the best decision for which they have been trained. The subjective component of the decision is non-existent, and these classifiers are able to handle large amounts of data in a short time, which is why s is the reason why AI and particularly deep-learning classifiers are an interesting alternative to traditional models^[1715].

43. The Role ofApplications: Deep Learning in Liver Transplantation

Deep learning provides a variety of classifiers that can be utilized in almost any field of medicine^{[816][1614][1715]}. Selecting the most appropriate classifier for the problem to be studied is perhaps the greatest difficulty for those unfamiliar with how these models function. Most studies on liver transplantation have focused on the development of models to predict post-transplant graft survival. However, predicting waitlist mortality^[18] or the probability of developing post-transplant acute renal failure^[19] has also been the subject of study. ANNs and random forests are the most frequently used classifiers in this field, and studies aimed at improving D–R matching may use any of them.

43.1. Artificial Neural Networks

The ability of ANNs to predict post-transplant outcomes based on D–R matching is promising. ANN classi classifiers imitate the design of human neuronal networks (Figure 2). Briefly, they consist of several groups of units (neurons) organized in different layers.

Figure 2. Representation of a basic neural network. Different layers are represented in blue (input layer), gray (hidden layer), and yellow (output layer). The arrows represent the relationships between neurons (weights).

A basic neural network consists of an input layer, a hidden layer, and an output layer. The number of layers and the ANN training can vary. Both the neurons and the relationships established between them are mathematical algorithms. The relationships (weights) between the neurons in the different layers are not constant but vary as increasing data are introduced, which the model learns from. Extrapolating this to a specific cIn a clinical problem, a series of input variables isare introduced into the neural network, which then processes them, according to the training received, to provide output variables of clinical interest. It is, th

Figure 2. Reprefore, important that thesentation of a basic neural network is trained as robustly as possible. Briefly, the process involves splitting the dataset into two groups. The first data group is the “training set,” which includes 75% or 90% of the cases. The second group (the remaining cases) is called the “validation set” and is used to check ANN performance. Because the network is built this way, the predictive capacity (validity) of an ANN may be affected negatively by the phenomena of “overtraining” and “overfitting. Different layers are represented in blue (input layer), gray (hidden layer), and yellow (output layer). The arrows represent the relationships between neurons (weights).”

To understand the advantages of ANNs in the field of liver transplantation, it is important to consider that the most common scenario is graft survival, while graft failure is rare, which is why it is said to be an unbalanced problem. Traditional biostatistics models are good predictors for outcomes that occur frequently; that is, they predict graft survival very well (majority class). However, they show a poor ability to predict graft failure (minority class) because it is not the usual outcome. In this regard, ANNs are able to predict both probabilities independently as they handle a large amount of data (variables). The “surviving class or majority class” prediction is based on the concept of correct classification rate (CCR, accuracy), which refers to the proportion of training patterns classified correctly by the ANN. On the other hand, the “non-surviving class or minority class” prediction capability is measured using the concept of minimum sensitivity (MS). It is necessary to understand these concepts in order to understand the results obtained by these classifiers.

However, since the model works with probabilities, a donor will be assigned to a recipient according to the probability of survival and graft failure without taking into account the severity of the recipient. Therefore, it is necessis necessary to establish certain conditions for organ allocation (rules-based system). If this does not occur, the allocation would be biased, and the best candidates would receive the best grafts (i.e., those with a higher probability of success). The abovAll these concepts applied to D–R matching are schematized in Figure 3.

Diagram

Description automatically generated

Figure 3. Diagram of an ANN-based on a multi-objective algorithm. Liver transplantation outcomes are shown as an unbalanced problem, and we classify them into the majority class (probability of surviving after liver transplantation, NN-CCR) and the minority class (probability of not surviving, NN-MS). By combining both probabilities (NN-CCR and NN-MS) based on input variables, we obtain a final D–R matching according to a rules-based system. ANN, artificial neural network; NN-CCR, neural network based on the correct classification rate or accuracy; NN-MS, neural network based on the minimum sensitivity.

From a clinical point of view, Briceño et al. ^[2017] were the first to apply a neural network combined with a system of rules to create a donor-recipient allocation model (M.A.D.R.E model). This multicenter study included a total of 1003 liver transplants performed between 2007 and 2008, using 57 variables (recipient, donor, and logistics). The probability of graft failure at 3 months was the endpoint variable. Firstly, ANN-CCR predicted a 90.79% probability of graft survival with an area under the curve (AUC) of 0.80, while ANN-MS predicted a 71.42% probability of graft loss with an AUC of 0.82. Secondly, the authors demonstrated the superiority of ANNs in donor allocation over biostatistics-based prioritization scores (MELD, D-MELD, SOFT, P-SOFT, DRI, and BAR). Finally, the allocation system used the results obtained by the constructed ANNs and successfully assigned the best candidate for a graft according to the different probabilities (CCR and MS) from among a group of patients with higher MELD, using its rules-based system.

These autho ers validated externally validate tthis methodology, with a second study was performed^[2118] wachith a dataset of 858 D–R pairs from liver transplants at King’s College Hospital (KCH) in London. The authors found that the models obtained with this database achieved excellenteving excellent prediction results at 3 months [CCR-AUC 0.94; MS-AUC 0.94] and 12 months (CCR-AUC 0.78; MS-AUC 0.82). When these results were compared with other scores, such as MELD and BAR (Figure 4), a 15% difference was found in favor of the proposed model. The main reason for these findings was that a homogenous database with a low number of missing values was used. In addition to the differences in the input variables and the population, this would justify the differences between the KCH database and the Spanish model. Therefore, the authors concluded that each ANN should be used in the specific population in which it was trained.

A picture containing chart

Description automatically generated

Figure 4. 3their most recent study, Guijo-month graft survival model based on ANN compared to other scores using the KCH database. CCR (correct classification rate or accuracy), MS (minimum sensitivity), and AUC (area under curve) values are shown. Figure obtained from Ayllón MD ubio et al. [21] © 2017 by the American Association for the Study of Liver Diseases.

In their most recent study, Briceño et al. ^[2219] analyzed how ANNs work using the United Network for Organ Sharing (UNOS) dataset. This dataset comprised 39,189 liver transplants with donor, recipient, and logistics variables. Prediction of the majority class (graft survival class) and minority class (non-survival class) at different time points (3 months and 1, 2, and 5 years) were the selected endpoints. Classical statistical models (naïve Bayes or logistic regression) were compared with different machine learning models: ANN, random forest, gradient boosting, and support vector machines. For the 5-year endpoint, machine learning techniques, such as ANN (AUC = 0.599) or random forest (AUC = 0.644), were outperformed by logistic regression (AUC = 0.654). In general, tThe predictive capacity of the AI models (including ANNs) was very similar to that obtained by traditional models (C-statistic ≤ 0.66). The authors argued that threason was these classifiers were trained on a database with a high percentage of missing values (only 28 variables had a percentage of less than 10%.).

Strengths and Weaknesses

Table 1 summarizes the main strengths and weaknesses of neural networks as a classifier. In clinical scenarios, neural networks are very useful for finding patterns that are far too complex or numerous since they can generate near-perfect predictions using the data on which they are fit^[2320]. In addition, data processing is performed quickly—an essential aspect of graft allocation. However, ANNs are inherently opaque and lack interpretability because the set of weights or algorithms in hidden layers is unknown. This is called the “black-box” issue, which has made many clinicians skeptical of their use because it is necessary to know all the details of the process ^[816].

The predictability of an AI model depends on the robustness of the database. Consequently, ANNs for D–R matching can only be applied in very homogeneous databases that follow similar rules for the prioritizand results are conditioned by the population and inclusion of candidatesin which they are trained. This is why ANNs may work very well in local and regional liver transplantation programs but cannot be extrapolated to other centers, thus requiring regional-specific ANN models.

Although the results of predictive models are good, most databases are small, have a high number of missing values, or can only be applied to the population where the ANN has been trained. In addition, neural networks depend on a system of rules to properly perform donor–recipient matching but are based on the sickest-first principle. Therefore, nowadays, ANNs can only assist, but not carry out, the matching decision in all aspects of organ transplantation ^[2421][25].

Table 13. Main strengths and weaknesses of neural networks as a classification method based on artificial intelligence.

Artificial Neural Networks. Strengths and Weaknesses

Strengths

Ability to work with incomplete knowledge. After ANN training, the data may produce output even with incomplete information.

Fault tolerance. The corruption of one or more neurons of the ANN does not prevent it from generating output, even if it reduces the training capacity of the network.

A distributed memory. For an ANN to be able to learn, it is necessary to determine the examples of a database and teach the network according to the desired output by showing it these examples.

Ability to make machine learning. With neural networks, classification, clustering, and regression models associated with decision support systems are built.

Parallel processing capability. Artificial neural networks have numerical strength that can perform more than one job at the same time.

They are applied in many branches of knowledge: medicine, engineering, economics, agriculture, energy, climate change, ecology, etc.

Weaknesses

Hardware dependence. ANNs require processors with parallel processing power in accordance with their structure. Therefore, current convolutional neural networks require GPU graphics cards.

Unexplained behavior of the network. The models obtained with this methodology lack the ability to interpret the results obtained. It is one of the main disadvantages of this type of methodology.

Determination of proper network structure. There is no specific rule for determining the structure of ANNs. Thus, if an adequate architecture is not designed, the results can be approximate.

Difficulty of showing the problem to the network. ANNs work with numerical information, so other types of information must be transformed into numerical information.

4.2. Random Forests

Random forests are deep-learning classifiers based on decision trees. It is an ensemble-type methodology (i.e., a model of models), and it is necessary to determine the number of models that will form the final model and verify that these models are not correlated because their results will not be very adequate. Aside from these two drawbacks, the methodology is superior to the decision trees from which it comes. For each of the possible outputs, a different decision tree is built. The database is “split” by the researcher into different nodes. The database requires a previous treatment filter to avoid over-training and overfitting the data.

Lau et al. ^[2622] examined how models based on random forests could predict post-transplant graft failure compared with other scores. The dataset included 180 liver-transplanted patients (173 donor and 103 recipient variables). Randorandom forests and ANNs were compared with the Donor Risk Index (DRI), MELD, and SOFT. The random forest demonst demonstrated its superiority with an AUC of 0.787 compared to ANN (AUC = 0.734) and DRI (AUC = 0.595). In addition, they obtained a simplified model of 15 variables that achieved an AUC of 0.715, as well as with other scores. The percentage of missing values in these 15 top variables s used in this study, ranged from 0% to 72.22%, with 5 variables with missing values >10%, thus demonstrating the ability of random forest models to work with a high percentage of incomplete data.

Strengths and Weaknesses

The most important advantages of random forest classifiers are that a) it is not necessary to normalize the variables, unlike neural network models, which require normalization of the independent variables of the model; b) unlike ANNs, they perform very well with a small database and a high percentage of missing values ^[2723]; and cb) they have excellent predictive power, thus providing a very nice and sophisticated output with variable importance.

. Unfortunately, thesey models are not useful with larger datasets since (the number of decision trees they generate can be unmanageable. When looking for the minority endpoint in an unbalanced problem (i.e., graft failure in liver transplantation), a larger database is required to have more casuistry. Thus, the model accuracy may be affected. Moreover, ould be unmanageable) and they have a high risk of “over-fitting,” that is, their effectiveness on the training dataset is sometimes much higher than that obtained on the validation and/or generalization dataset.

4.3. Study Limitations

The implementation of AI in the field of liver diseases has grown exponentially. However, the number of clinical (not methodological) papers addressing D–R matching is small. Most of the papers mentioned are observational studies (data retrieved from databases). To truly test the predictive power of artificial intelligence, large prospective cohort studies with external validation are needed. To date, only one author has validated this methodology externally ^[21]. However, external validation may be questionable since classifiers based on deep learning perform better in populations where they are trained. Thus, the most realistic and suitable option would be to use region-specific models ^[15].

Given that the studies differ significantly in terms of size, design, prediction models, and dataset quality, it is difficult to determine which classifier is preferred for each scenario. Finally, a time-to-event (i.e., graft failure) approach could be interesting as this has not been done to date. This approach could be useful as a quantitative measure of survival gained/lost when accepting/rejecting a specific organ, which would aid clinicians when making decisions about grafts ^[28].

From an ethical point of view, there are three barriers to overcome. The first is the “black box issue,” which may cause mistrust among clinicians because they do not know the weight of the variables in the models. The second is data privacy and cyber security. The last barrier is finding an adequate answer to the following question: Who is responsible if the model fails?

5. ConcCurrent applicabilusionsty: What Iis on the Hhorizon?

AI has contributed to the field of liver transplantation through different classifiers, such as ANNs or random forests ^[2924]. On the one hand, machine learning classifiers operate impartially as they are not affected by subjective factors. On the other, they can handle a multitude of variables of clinical interest in a quick and easy way (faster than humans) to identify the best outcome. These are the main reasons that make the use of AI so attractive from a clinical point of view. Achieving I-better outcomes in liver transplantation implies a lower economic investment compared with dysfunctional grafts. Additionally, better D–R matching would lead to better post-transplant outcomes that would be more cost-effective in the long term than dysfunctional grafts. Methodologies such as AI that aim to imased classifiers may improve D–R matching in these terms would also reduceerms such as procedure costs, thus obtaining or individual and social benefits ^[3025].

Deep learning is the branch of artificial intelligence that appears to be undergoing the greatest development in this field. Nitski etIn al. ^[31] assessed the ability of deep-learning algorithms to predict post-transplant complications that result in patient death. The AUCs for prediction of death by graft failure within one year achieved by ANN models (with a low missing values dataset) ranged from 0.847 to 0.871, values that were replicated in the testing group. Other examples include ition to the advances mentioned above, we have examples such as the application of deep learning to assess CT volumetry in living donors ^[3226], the prediction of hepatocellular carcinoma recurrence after liver resection ^[3327], or the identification of hepatic steatosis in living donors ^[3428].

However, while the scientific production related to AI is more abundant in other areas of liver disease [8], D–R matching remains controversial. The worksdata published to date show interestpromising results but have been unable to achievenot reached clinical applicability. In the opinion of the authorsCurrently, there are three key points to implement these models in our clinical decisions:

a) oOvercome our skeptical mentality and ethical barriers as clinicians; three ethical barriers. The first is the “black box issue,” which may cause mistrust among clinicians because they do not know the weight of the variables in the models. The second is data privacy and cyber security. The last is finding an adequate answer to the following question: Who is responsible if the model fails?

b) cCollect data without missing values to build large and robust datasets since robust datasets lead to accurate models and confidence. However, external validation may be questionable since classifiers based on deep learning perform better in populations where they are trained. Thus, the most realistic and suitable option would be to use region-specific models; .

c) dDo not consider AI-based tools as “self-driving cars,” but as tools to support decisions and complement current systems.

Funding: This work has received financial support from Mutua Madrileña XVIII Convocatoria de ayudas a la investigación.

Institutional Review Board Statement: Not applicable.

Informed Consent Statement: Not applicable.

Data Availability Statement: Not applicable.

Conflicts of Interest: The authors declare no conflict of interest

References

Kwong, A.; Kim, W.R.; Lake, J.R.; Smith, J.M.; Schladt, D.P.; Skeans, M.A.; Noreen, S.M.; Foutz, J.; Miller, E.; Snyder, J.J.; et al. OPTN/SRTR 2018 Annual Data Report: Liver. Am. J. Transplant. 2020, 20 (Suppl. S1), 193–299. https://doi.org/10.1111/ajt.15674.
2. Neuberger, J. Liver transplantation in the United Kingdom. Liver Transpl. 2016, 22, 1129–1135.Neuberger, J. Liver transplantation in the United Kingdom. Liver Transpl. 2016, 22, 1129–1135.
3. MacConmara, M.; Hanish, S.I.; Hwang, C.S.; De Gregorio, L.; Desai, D.M.; Feizpour, C.A.; Tanriover, B.; Markmann, J.F.; Zeh, H., III; Vagefi, P.A. Making Every Liver Count: Increased Transplant Yield of Donor Livers Through Normothermic Machine Perfusion. Ann. Surg. 2020, 272, 397–401.Briceno, J.; Ciria, R.; de la Mata, M. Donor-recipient matching: Myths and realities. J. Hepatol. 2013, 58, 811–820.
4. Briceno, J.; Ciria, R.; de la Mata, M. Donor-recipient matching: Myths and realities. J. Hepatol. 2013, 58, 811–820.Hann, A.; Lembach, H.; Nutu, A.; Dassanayake, B.; Tillakaratne, S.; McKay, S.C.; Boteon, A.P.C.S.; Boteon, Y.L.; Mergental, H.; Murphy, N.; et al. Outcomes of normothermic machine perfusion of liver grafts in repeat liver transplantation (NAPLES ini-tiative). Br. J. Surg. 2022, 109, 372–380.
5. Schlegel, A.; Linecker, M.; Kron, P.; Gyori, G.; De Oliveira, M.L.; Mullhaupt, B.; Clavien, P.A.; Dutkowski, P. Risk Assessment in High- and Low-MELD Liver Transplantation. Am. J. Transplant. 2017, 17, 1050–1063.Lewsey, J.D.; Dawwas, M.; Copley, L.P.; Gimson, A.; Van der Meulen, J.H. Developing a prognostic model for 90-day mortal-ity after liver transplantation based on pretransplant recipient factors. Transplantation 2006, 82, 898–907.
6. Lai, J.C. Defining the threshold for too sick for transplant. Curr. Opin. Organ Transplant. 2016, 21, 127–132.Briceño, J.; Calleja, R.; Hervás, C. Artificial intelligence and liver transplantation: Looking for the best donor-recipient pair-ing. Hepatobiliary Pancreat. Dis. Int. 2022, 21, 347–353. https://doi.org/10.1016/j.hbpd.2022.03.001.
7. Hann, A.; Lembach, H.; Nutu, A.; Dassanayake, B.; Tillakaratne, S.; McKay, S.C.; Boteon, A.P.C.S.; Boteon, Y.L.; Mergental, H.; Murphy, N.; et al. Outcomes of normothermic machine perfusion of liver grafts in repeat liver transplantation (NAPLES ini-tiative). Br. J. Surg. 2022, 109, 372–380.Doyle, H.R.; Dvorchik, I.; Mitchell, S.; Marino, I.R.; Ebert, F.H.; McMichael, J.; Fung, J.J. Predicting outcomes after liver trans-plantation. A connectionist approach. Ann. Surg. 1994, 219, 408–415. https://doi.org/10.1097/00000658-199404000-00012.
8. Veerankutty, F.H.; Jayan, G.; Yadav, M.K.; Manoj, K.S.; Yadav, A.; Nair, S.R.S.; Shabeerali, T.U.; Yeldho, V.; Sasidharan, M.; Rather, S.A. Artificial Intelligence in hepatology, liver surgery and transplantation: Emerging applications and frontiers of research. World J. Hepatol. 2021, 13, 1977–1990. https://doi.org/10.4254/wjh.v13.i12.1977.Schlegel, A.; Linecker, M.; Kron, P.; Gyori, G.; De Oliveira, M.L.; Mullhaupt, B.; Clavien, P.A.; Dutkowski, P. Risk Assessment in High- and Low-MELD Liver Transplantation. Am. J. Transplant. 2017, 17, 1050–1063.
9. Doyle, H.R.; Dvorchik, I.; Mitchell, S.; Marino, I.R.; Ebert, F.H.; McMichael, J.; Fung, J.J. Predicting outcomes after liver trans-plantation. A connectionist approach. Ann. Surg. 1994, 219, 408–415. https://doi.org/10.1097/00000658-199404000-00012.Lai, J.C. Defining the threshold for too sick for transplant. Curr. Opin. Organ Transplant. 2016, 21, 127–132.
10. Lewsey, J.D.; Dawwas, M.; Copley, L.P.; Gimson, A.; Van der Meulen, J.H. Developing a prognostic model for 90-day mortal-ity after liver transplantation based on pretransplant recipient factors. Transplantation 2006, 82, 898–907.Sacleux, S.C.; Samuel, D. A Critical Review of MELD as a Reliable Tool for Transplant Prioritization. Semin. Liver Dis. 2019, 39, 403–413.
11. Sacleux, S.C.; Samuel, D. A Critical Review of MELD as a Reliable Tool for Transplant Prioritization. Semin. Liver Dis. 2019, 39, 403–413.Dutkowski, P.; Oberkofler, C.E.; Slankamenac, K.; Puhan, M.A.; Schadde, E.; Müllhaupt, B.; Geier, A.; Clavien, P.A. Are there better guidelines for allocation in liver transplantation? A novel score targeting justice and utility in the model for end-stage liver disease era. Ann Surg. 2011, 254, 745–753. https://doi.org/10.1097/SLA.0b013e3182365081.
12. Dutkowski, P.; Oberkofler, C.E.; Slankamenac, K.; Puhan, M.A.; Schadde, E.; Müllhaupt, B.; Geier, A.; Clavien, P.A. Are there better guidelines for allocation in liver transplantation? A novel score targeting justice and utility in the model for end-stage liver disease era. Ann Surg. 2011, 254, 745–753. https://doi.org/10.1097/SLA.0b013e3182365081.Rana, A.; Hardy, M.A.; Halazun, K.J.; Woodland, D.C.; Ratner, L.E.; Samstein, B.; Guarrera, J.V.; Brown, R.S., Jr.; Emond, J.C. Survival outcomes following liver transplantation (SOFT) score: A novel method to predict patient survival following liver transplantation. Am. J. Transplant. 2008, 8, 2537–2546. https://doi.org/10.1111/j.1600-6143.2008.02400.x
13. Rana, A.; Hardy, M.A.; Halazun, K.J.; Woodland, D.C.; Ratner, L.E.; Samstein, B.; Guarrera, J.V.; Brown, R.S., Jr.; Emond, J.C. Survival outcomes following liver transplantation (SOFT) score: A novel method to predict patient survival following liver transplantation. Am. J. Transplant. 2008, 8, 2537–2546. https://doi.org/10.1111/j.1600-6143.2008.02400.xBoecker, J.; Czigany, Z.; Bednarsch, J.; Amygdalos, I.; Meister, F.; Santana, D.A.M.; Liu, W.J.; Strnad, P.; Neumann, U.P.; Lurje, G. Potential value and limitations of different clinical scoring systems in the assessment of short- and long-term outcome following orthotopic liver transplantation. PLoS ONE 2019, 14, e0214221.
14. Boecker, J.; Czigany, Z.; Bednarsch, J.; Amygdalos, I.; Meister, F.; Santana, D.A.M.; Liu, W.J.; Strnad, P.; Neumann, U.P.; Lurje, G. Potential value and limitations of different clinical scoring systems in the assessment of short- and long-term outcome following orthotopic liver transplantation. PLoS ONE 2019, 14, e0214221.Deo, R.C. Machine learning in medicine. Circulation 2015, 132, 1920–1930.
15. Briceño, J.; Calleja, R.; Hervás, C. Artificial intelligence and liver transplantation: Looking for the best donor-recipient pair-ing. Hepatobiliary Pancreat. Dis. Int. 2022, 21, 347–353. https://doi.org/10.1016/j.hbpd.2022.03.001.Fernández-Delgado, M.; Cernadas, E.; Barro, S.; Amorim, D. Do we need hundreds of classifiers to solve real world classifi-cation problems? J. Mach. Learn Res. 2014, 15, 3133–3181.
16. Deo, R.C. Machine learning in medicine. Circulation 2015, 132, 1920–1930.Veerankutty, F.H.; Jayan, G.; Yadav, M.K.; Manoj, K.S.; Yadav, A.; Nair, S.R.S.; Shabeerali, T.U.; Yeldho, V.; Sasidharan, M.; Rather, S.A. Artificial Intelligence in hepatology, liver surgery and transplantation: Emerging applications and frontiers of research. World J. Hepatol. 2021, 13, 1977–1990. https://doi.org/10.4254/wjh.v13.i12.1977.
17. Fernández-Delgado, M.; Cernadas, E.; Barro, S.; Amorim, D. Do we need hundreds of classifiers to solve real world classifi-cation problems? J. Mach. Learn Res. 2014, 15, 3133–3181.Briceño, J.; Cruz-Ramírez, M.; Prieto, M.; Navasa, M.; Ortiz de Urbina, J.; Orti, R.; Gómez-Bravo, M.Á.; Otero, A.; Varo, E.; Tomé, S.; et al. Use of artificial intelligence as an innovative donor-recipient matching model for liver transplantation: Re-sults from a multicenter Spanish study. J. Hepatol. 2014, 61, 1020–1028. https://doi.org/10.1016/j.jhep.2014.05.039.
18. Bertsimas, D.; Kung, J.; Trichakis, N.; Wang, Y.; Hirose, R.; Vagefi, P.A. Development and validation of an optimized predic-tion of mortality for candidates awaiting liver transplantation. Am. J. Transplant. 2019, 19, 1109–1118. https://doi.org/10.1111/ajt.15172.Ayllón, M.D., Ciria, R., Cruz-Ramírez, M., Pérez-Ortiz, M., Gómez, I., Valente, R., O'Grady, J., de la Mata, M., Hervás-Martínez, C., Heaton, N.D. and Briceño, J. (2018), Validation of artificial neural networks as a methodology for donor-recipient matching for liver transplantation. Liver Transpl, 24: 192-203. https://doi.org/10.1002/lt.24870
Liver Transplantation: Machine Learning Approaches vs. Logistic Regression Model. J. Clin. Med. 2018, 7, 428. https://doi.org/10.3390/jcm7110428.Guijo-Rubio, D.; Briceño, J.; Gutiérrez, P.A.; Ayllón, M.D.; Ciria, R.; Hervás-Martínez, C. Statistical methods versus machine learning techniques for donor-recipient matching in liver transplantation. PLoS ONE 2021, 16, e0252068. https://doi.org/10.1371/journal.pone.0252068.
20. Briceño, J.; Cruz-Ramírez, M.; Prieto, M.; Navasa, M.; Ortiz de Urbina, J.; Orti, R.; Gómez-Bravo, M.Á.; Otero, A.; Varo, E.; Tomé, S.; et al. Use of artificial intelligence as an innovative donor-recipient matching model for liver transplantation: Re-sults from a multicenter Spanish study. J. Hepatol. 2014, 61, 1020–1028. https://doi.org/10.1016/j.jhep.2014.05.039.Kelly, C.J.; Karthikesalingam, A.; Suleyman, M.; Corrado, G. King, D. Key challenges for delivering clinical impact with arti-ficial intelligence. BMC Med. 2019, 17, 195–204.
Use of artificial intelligence as an innovative donor-recipient matching model for liver transplantation: results from a multicenter Spanish studyRuffle, J.K.; Farmer, A.D.; Aziz, Q. Artificial intelligence-assisted gastroenterology promises and pitfalls. Am. J. Gastroenterol. 2019, 114, 422–428.
22. Guijo-Rubio, D.; Briceño, J.; Gutiérrez, P.A.; Ayllón, M.D.; Ciria, R.; Hervás-Martínez, C. Statistical methods versus machine learning techniques for donor-recipient matching in liver transplantation. PLoS ONE 2021, 16, e0252068. https://doi.org/10.1371/journal.pone.0252068.Lau, L.; Kankanige, Y.; Rubinstein, B.; Jones, R.; Christophi, C.; Muralidharan, V.; Bailey, J. Machine-Learning Algorithms Predict Graft Failure After Liver Transplantation. Transplantation 2017, 101, e125–e132. https://doi.org/10.1097/TP.0000000000001600.
23. Kelly, C.J.; Karthikesalingam, A.; Suleyman, M.; Corrado, G. King, D. Key challenges for delivering clinical impact with arti-ficial intelligence. BMC Med. 2019, 17, 195–204.Sapir-Pichhadze, R.; Kaplan, B. Seeing the Forest for the Trees: Random Forest Models for Predicting Survival in Kidney Transplant Recipients. Transplantation 2020, 104, 905–906. https://doi.org/10.1097/TP.0000000000002923.
24. Ruffle, J.K.; Farmer, A.D.; Aziz, Q. Artificial intelligence-assisted gastroenterology promises and pitfalls. Am. J. Gastroenterol. 2019, 114, 422–428.Spann, A.; Yasodhara, A.; Kang, J.; Watt, K.; Wang, B.; Goldenberg, A.; Bhat, M. Applying machine learning in liver disease & transplantation: A comprehensive review. Hepatology 2020, 71, 1093–1105.
25. Thrall, J.H.; Li, X.; Li, Q.; Cruz, C.; Do, S.; Dreyer, K.; Brink, J. Artificial intelligence and machine learning in radiology op-portunities, challenges, pitfalls, and criteria for success. J. Am. Coll. Radiol. 2018, 15, 504–508.Sucher, R.; Sucher, E. Artificial intelligence is poised to revolutionize human liver allocation and decrease medical costs as-sociated with liver transplantation. Hepatobiliary Surg. Nutr. 2020, 9, 679–681. https://doi.org/10.21037/hbsn-20-458.
26. Lau, L.; Kankanige, Y.; Rubinstein, B.; Jones, R.; Christophi, C.; Muralidharan, V.; Bailey, J. Machine-Learning Algorithms Predict Graft Failure After Liver Transplantation. Transplantation 2017, 101, e125–e132. https://doi.org/10.1097/TP.0000000000001600.Park, R.; Lee, S.; Sung, Y.; Yoon, J.; Suk, H.I.; Kim, H.; Choi, S. Accuracy and Efficiency of Right-Lobe Graft Weight Estimation Using Deep-Learning-Assisted CT Volumetry for Living-Donor Liver Transplantation. Diagnostics 2022, 12, 590. https://doi.org/10.3390/diagnostics12030590.
27. Sapir-Pichhadze, R.; Kaplan, B. Seeing the Forest for the Trees: Random Forest Models for Predicting Survival in Kidney Transplant Recipients. Transplantation 2020, 104, 905–906. https://doi.org/10.1097/TP.0000000000002923. Liu, Z.; Liu, Y.; Zhang, W.; Hong, Y.; Meng, J.; Wang, J.; Zheng, S.; Xu, X. Deep learning for prediction of hepatocellular carci-noma recurrence after resection or liver transplantation: A discovery and validation study. Hepatol. Int. 2022, 16, 577–589. https://doi.org/10.1007/s12072-022-10321-y.
28. Wingfield, L.R.; Ceresa, C.; Thorogood, S.; Fleuriot, J.; Knight, S. Using Artificial Intelligence for Predicting Survival of Indi-vidual Grafts in Liver Transplantation: A Systematic Review. Liver Transplant. 2020, 26, 922–934. https://doi.org/10.1002/lt.25772.Lim, J.; Han, S.; Lee, D.; Shim, J.H.; Kim, K.M.; Lim, Y.S.; Lee, H.C.; Jung, D.H.; Lee, S.G.; Kim, K.H.; et al. Identification of he-patic steatosis in living liver donors by machine learning models. Hepatol. Commun. 2022, 6, 1689–1698. https://doi.org/10.1002/hep4.1921.
29. Spann, A.; Yasodhara, A.; Kang, J.; Watt, K.; Wang, B.; Goldenberg, A.; Bhat, M. Applying machine learning in liver disease & transplantation: A comprehensive review. Hepatology 2020, 71, 1093–1105.
30. Sucher, R.; Sucher, E. Artificial intelligence is poised to revolutionize human liver allocation and decrease medical costs as-sociated with liver transplantation. Hepatobiliary Surg. Nutr. 2020, 9, 679–681. https://doi.org/10.21037/hbsn-20-458.
31. Nitski, O.; Azhie, A.; Qazi-Arisar, F.A.; Wang, X.; Ma, S.; Lilly, L.; Watt, K.D.; Levitsky, J.; Asrani, S.K.; Lee, D.S.; et al. Long-term mortality risk stratification of liver transplant recipients: Real-time application of deep learning algorithms on longitudinal data. Lancet Digit Health 2021, 3, e295–e305. https://doi.org/10.1016/S2589-7500(21)00040-6. PMID: 33858815.
32. Park, R.; Lee, S.; Sung, Y.; Yoon, J.; Suk, H.I.; Kim, H.; Choi, S. Accuracy and Efficiency of Right-Lobe Graft Weight Estimation Using Deep-Learning-Assisted CT Volumetry for Living-Donor Liver Transplantation. Diagnostics 2022, 12, 590. https://doi.org/10.3390/diagnostics12030590.
33. Liu, Z.; Liu, Y.; Zhang, W.; Hong, Y.; Meng, J.; Wang, J.; Zheng, S.; Xu, X. Deep learning for prediction of hepatocellular carci-noma recurrence after resection or liver transplantation: A discovery and validation study. Hepatol. Int. 2022, 16, 577–589. https://doi.org/10.1007/s12072-022-10321-y.
34. Lim, J.; Han, S.; Lee, D.; Shim, J.H.; Kim, K.M.; Lim, Y.S.; Lee, H.C.; Jung, D.H.; Lee, S.G.; Kim, K.H.; et al. Identification of he-patic steatosis in living liver donors by machine learning models. Hepatol. Commun. 2022, 6, 1689–1698. https://doi.org/10.1002/hep4.1921.