Alzheimer’s disease (AD) is the most prevalent form of dementia; however, the precise pathophysiology of the disease has not been elucidated yet. Thus, understanding the disease pathology, as well as identification and development of valid biomarkers, are necessary for identifying therapeutic targets and early diagnosis as well as for monitoring disease progression. Mass spectrometry (MS)-based proteomics studies on clinical specimens, in-vitro and in-vivo models of AD has greatly aided such efforts.
Alzheimer’s disease (AD) is the most common progressive neurodegenerative disorder with memory loss, cognitive impairment, disorientation, and psychiatric symptoms. These symptoms render AD patients incapable of independently performing day-to-day activities. With increasing average life expectancy, it is becoming one of the greatest healthcare challenges of the 21st century [1]. The major hallmark of AD is extracellular deposition of amyloid-β (Aβ) as plaques in the brain and the formation of intracellular neurofibrillary tangles (NFTs) composed of the accumulation of hyperphosphorylated tau (p-tau) [2][3][2,3]. Despite advancements in research, definitive causes of the disease remain elusive with no effective treatments available yet [4]. As the understanding of AD pathogenesis is limited, it is difficult to develop novel therapies for AD. Therefore, further understanding of molecular mechanisms and cellular signalling pathways that are responsible for AD pathogenesis is needed for discovering new targets and evolving treatment strategies for AD. Mass-spectrometry-based proteomics will continue to play an important role in the future for discovering these events responsible for AD pathogenesis.
Over the past few decades, several studies on both patient-derived biological samples as well as in vivo and in vitro Alzheimer’s disease models utilizing proteomics technologies have been carried out. These studies have contributed greatly toward understanding the disease pathogenesis as well as the identification of biomarkers (Figure 1).
Figure 1. Biological materials used for understanding Alzheimer’s pathology and for the discovery and validation of candidate biomarkers using proteomics technologies.
The past decade has seen remarkable advances in high-resolution mass-spectrometry-based proteomics that have enabled precise analysis of thousands of proteins from brain tissue samples in a relatively shorter amount of time. Understanding the function of the brain and the related disease mechanisms requires comprehensive knowledge of the changes in the neuronal proteome and post-translational modifications (PTMs). Quantitative proteome analysis of the human hippocampus at different Braak stages has demonstrated alteration in components of the extracellular matrix and calcium-dependent signalling [5]. Multiple large-scale deep proteomics studies have demonstrated the involvement of diverse signalling networks including proteostasis, RNA homeostasis, neuroinflammation, lipid homeostasis, and myelin-axon interactions among others to be associated with Alzheimer’s pathology [6][7][8][9][10][11][12][6-12]. These studies show that a gamut of cellular protein networks and processes are associated with the pathology of this disease. Apart from brain tissues, protein aggregates such as extracellular amyloid-beta plaques and tau neurofibrillary tangles have also been extensively studied using proteomics techniques providing insights into their role in AD pathology and progression [13]. Deep proteome analysis of detergent-insoluble protein aggregates from brain tissue identified ~4000 proteins, including U1–70K and other U1 small nuclear ribonucleoprotein (U1 snRNP) spliceosome components in AD as well as mild cognitive impairment (MCI) samples [14]. Such insights underscore the need for further functional studies to probe the exact role of aggregate-associated proteins in AD.
Studies on human postmortem brain tissue have greatly improved our understanding of the disease, however; there are several limitations to use brain tissue for research purposes. Major limitations being the availability of tissue, quality of the tissue, genetic heterogeneity, pre-terminal medication or concomitant disease, cause of death, and postmortem interval. In-vivo animal model systems serve as a better alternative to postmortem brain tissue in understanding disease mechanisms as many of these issues related to consistency can be controlled. Multiple mouse models have been employed for studying AD-related neurodegeneration. Li et al. carried out proteome profiling of hippocampus of APPSw,Ind J20 mouse and identified alterations in mTOR signalling and β-spectrin, Rab3-mediated APP trafficking, and proteolysis as early molecular events associated with AD pathogenesis [15]. Alteration in synaptic functions associated with dendritic spine morphology, neurite outgrowth, long-term potentiation, CREB signalling, and cytoskeletal dynamics was revealed by systematic analysis of proteome, phosphoproteome, and sialylated N-linked glycosylation of APPswe/PS1ΔE9 mouse olfactory bulb, hippocampus, neocortex, and the brainstem [16].
Although the use of such animal models has been extremely useful translational clinical research is still heavily reliant on human tissue specimens. One of the other alternatives to fresh human tissue is a formalin-fixed and paraffin-embedded (FFPE) tissue sample. These FFPE tissues can be stored for the long term and also have a plethora of associated clinical data like survival time and therapy response. Drummond et al. isolated neuronal cells from temporal cortex FFPE samples with severe Alzheimer’s using the LCM-LC-MS/MS technique and identified more than 400 proteins of which 78% were of neuronal origin and about 200 proteins were associated with Alzheimer’s disease [17]. However, further challenges that are yet to be reliably addressed include limiting the effects of formaldehyde-induced protein modification on protein recovery as well as identification in shotgun proteomics [18]. Another relevant area that is still largely unexplored in the context of Alzheimer’s disease is the use of FFPE samples for the identification of changes in the post-translation modifications.
2.2. Proteomics and Fluid Biomarkers Discovery
The goal of the protein marker discovery is to detect a protein or panel of proteins that differentiate affected patients with a specific disease from healthy individuals. For biomarker analysis, it is preferred that it would be accessible through minimal invasion.
As cerebrospinal fluid (CSF) is in close proximity to brain tissue it has been extensively probed in search of biomarkers for neurological diseases including AD. There are several studies published on deep proteomic analysis of CSF using depletion of high-abundant proteins coupled with high-resolution mass spectrometry to increase coverage of low-abundant proteins [19][20][19,20]. Previously known AD markers including Tau and amyloid-beta peptides have been identified across multiple mass-spectrometry-based proteomics studies and have been successfully developed into targeted assays for quantitation [21][22][23][24][21-24]. Barthelemey et al. have studied tau phosphorylation in CSF, brain, and plasma and reported an association between site-specific changes in tau phosphorylation and evolution of stages of dominantly inherited Alzheimer’s disease [25][26][27][25-27]. Although CSF-biomarker studies have identified several novel protein markers for AD, these studies are largely constrained by limited sample size.
In neurodegenerative disease, the blood-brain barrier is disturbed which results in increased permeability, and greater exchange of proteins between blood and CSF [28]. Several discovery and targeted proteomics studies have been carried out for the identification of serum protein markers despite the enormous challenge of a large dynamic range of protein abundance that spans 12 orders of magnitude [29][30][31][32][33][29-33]. Zetterberg et al. has curated a timeline for recent biomarker studies aimed at the development of clinically implementable blood test for Alzheimer’s summarizing different technologies employed for such research [34]. Dey et. al. used extensive pre-fractionation-based deep proteomic profiling of undepleted human serum for the biomarker discovery for Alzheimer’s disease. They reported a panel of 30 significantly altered proteins related to mitochondrial function including downregulation of AK2 and PCK2 in Alzheimer’s samples [35]. These studies, although limited by sample size, demonstrate serum biomarkers as a viable option despite several challenges posed by being a complex matrix.
Apart from CSF and serum other body fluids such as urine, saliva, and ocular fluids have also been probed in search of candidate biomarkers. Compared to both blood and CSF these fluids are comparatively very easily accessible through noninvasive procedures. Increased levels of urinary Aβ42 have been demonstrated in AD samples and proposed as a marker for both AD diagnosis and monitoring [36]. Expression of tau mRNA and amyloid-β precursor protein (AβPP) has been reported in the salivary gland and human salivary epithelial cells, respectively [37][38][37,38]. Min Shi et al. have also reported the identification of salivary tau and p-tau as AD biomarkers by immunoprecipitation and mass-spectrometry-based identification [39]. Visual symptoms have been reported in many AD patients, and this has piqued an increased interest in discovering ocular biomarkers that might be related to the pathology [40][41][40,41]. Several proteomics studies have been carried analyzing the tear proteome to identify AD markers [42][43][42,43]. These studies demonstrate the feasibility of using these alternative fluids as targets for biomarker discovery. Such markers also have the potential to be used for screening asymptomatic subjects, dramatically increasing the window for therapeutic interventions.
Proteomics technologies that are employed toward the identification of the AD-related changes in CSF, plasma, and tissue are broadly categorized as discovery or targeted proteomics (Figure 2). The discovery-based approach has been primarily used for the quantitation of changes in molecular signalling networks that are associated with AD pathology. While a targeted proteomics approach is being used for monitoring and quantitation of proteins that are already known to be associated with AD pathogenesis and can be used as a candidate marker.
Figure 2. Overview of discovery and targeted proteomics technologies: Panel 1 describes label-free and label-based quantitation techniques used in discovery proteomics to identify differentially expressed proteins across samples by comparing either intensities and peak area of peptides (label-free quantitation) or reporter ion intensities (label-based quantitation) generated after fragmentation of labeled peptide precursors. Panel 2 describes targeted proteomics techniques used for monitoring peptides across samples by either sequential selection and monitoring of precursor-product ion pair (MRM) or simultaneous monitoring of all fragment ions of a selected precursor ion (PRM) or targeted extraction of fragment ion intensities from a mixed spectrum of fragment ions generated from multiple precursor ions selected by using a small window of m/z (DIA).
3.1. Discovery-Based Proteomics Analyses
Data-dependent acquisition or information-dependent acquisition has been the cornerstone of shotgun-discovery-based proteomics technology. In the data-dependent acquisition (DDA) the instrument cycles through a short high-resolution MS1 survey scan of the peptides followed by series of quick low-resolution MS2 scans for detection of the fragment ions of peptides selected based on the intensity. Label-free proteomic analysis approach has been used for quantitation of proteins for Alzheimer’s disease (AD) brains versus normally aged brains [44] and membrane-enriched proteome from postmortem human brain tissue in Alzheimer’s disease [45]. Several studies have carried out proteomics analysis for the discovery of biomarkers using either iTRAQ or TMT-based multiplexing approach. Sathe et. al. identified several known and novel CSF protein biomarker including MAPT, NPTX2, PKM, and YWHAG using TMT-based quantitative proteomics [19]. Adav et. al. used iTRAQ-based quantitative proteomics for identifying alterations in mitochondrial proteome of the human brain tissues of healthy and AD individuals.
Apart from the changes in cellular protein levels characterization of changes in protein post-translational modifications (PTMs) is needed for the understanding of the complex regulatory circuits that drive disease pathology. There are more than 200 types of PTMs that are known to control cellular functions. Domenico et al. identified significant alteration in neuronal processes such as formation, outgrowth, and guidance of neurites on quantitative phosphoproteomics analysis of the hippocampus of the Alzheimer’s disease subjects [46]. Kempf et al. carried out proteomics, phosphoproteomics, and N-glycosylation changes in the four different parts of the brain of the APP/PS1 Alzheimer’s mouse model identifying a brain-region specific response in CREB-mediated synaptic signalling [47]. Chou et al. have reviewed the effects of multiple post-translational modifications including phosphorylation, acetylation, SUMOylation, O-GlcNAcylation, and ubiquitination of tau on AD pathology [48]. Rayaprolu et al. describe how a network-based proteomics approach can be leveraged to improve our narrow understanding of AD biology beyond amyloid plaques and tau phosphorylation [49]. One of the major advantages of PTM proteomics studies is that the altered kinases identified by them will be also helpful as potential therapeutic targets in the future.
3.2. Targeted Proteomics Analyses
Once the discovery data are acquired, rigorous statistical methods are applied to identify significantly differential proteins that can be used as potential markers for the disease. To quantitate the abundance of these markers reproducibly, highly robust and reproducible methods are required. In general proteomics, workflow DDA and targeted proteomics are used in tandem or parallel. In targeted proteomics, multiple- or parallel-reaction monitoring (MRM or PRM) and data-independent analysis (DIA) are generally used (Figure 2).
In targeted proteomic analysis such as MRM or PRM precursor ions of selected peptides are filtered and fragmented and only desired fragment ions recorded. An increase in CSF concentrations of Alpha-, Beta-, and Gamma-synuclein has been demonstrated in Alzheimer’s patients using the MRM-proteomics approach [50]. Similarly, Wesseling et al. reported heterogeneity on tau phosphorylation sites across AD patients using FLEXItau, an MRM-based assay [51][52][51,52]. Using a label-free PRM approach Sathe et. al. demonstrated that NPTX2, in combination with PKM or YWHAG in CSF differentiate AD from control samples [19]. Because of their high specificity and multiplexing abilities, targeted proteomics assays have become a very valuable tool for assessing proteins for which commercial antibodies are not available.
Data-independent analysis (DIA) is a recent advancement and a rapidly growing technology in proteomics analysis [53]. In DIA experiments MS/MS data from all detectable precursor ions present in a biological sample are recorded generating a complete and permanent digital archive of the sample. Chang et al. used DIA for quantifying the synaptosomal proteome from the hippocampus and motor cortex tissue from Alzheimer’s subjects. They reported differential expression of 30 proteins including 17 novel proteins in AD [54]. Bader et al. characterized CSF proteome changes that accompany AD in three independent cohorts consistently quantifying more than 1000 proteins [55]. They reported 20 proteins to be differentially expressed in AD-CSF samples including tau, SOD1, PARK7, YKL-40, and novel biomarker candidates such as YWHAG, PKM, and ALDOC. Bader et al. provide a novel application of DIA proteomics strategies to drive a shift in the clinical proteome paradigm toward a population-wide discovery and validation of biomarker candidates that exonerate the individual-specific effects observed in small pilot studies.
Recent developments in proteomics sample preparation methodologies along with advances in high-throughput mass spectrometry have improved the proteome coverage. Several quantitative proteomics methods including label-free, isobaric labelling based in the discovery platform and MRM, PRM, and DIA in the targeted platform have been employed for AD biomarker discovery and understanding AD pathogenesis. These advancements in technologies have rapidly generated several datasets on various biofluids, tissues, and animal models related to Alzheimer’s disease. Although these studies have dramatically improved our understanding of disease pathology, as the complexity of the disease is high, studies at a large scale in diverse populations will be required in the future for the identification of robust biomarkers. One of the other limitations in the identification of robust and reliable biomarker candidates is the current “triangular strategy” of discovering candidates from a smaller subset of samples and validation in a larger cohort. Where many of the identified candidates turn out to be specific to the discovery cohort and fail at validation stages in larger cohorts. Such limitations are now being addressed by a shift in the study design of clinical discovery proteomics with a “rectangular strategy”. Such strategies employ high proteome depth workflows for simultaneous discovery and validation on hundreds of samples in a population-wide setting. In the future, it is also necessary to have a stringent reporting system for study and patient selection criteria, clinical metadata, and parameter used for analysis. This information will be useful for comparison of the datasets as well as to build a knowledge base that will help to further our understanding of the disease and associated neurodegeneration.