Predicting the optimal drug responses for individual patients using omics data from different levels of biological molecules
[23][20]. For example, multi-omics can use machine learning methods such as SVMs, random forests, or neural networks to build predictive models that can estimate the efficacy, safety, toxicity, adverse effects, resistance, sensitivity, dosage, and duration of drug responses.
-
4. Different Types of Proteomics Data That Can Be Used for Multi-Omics Analyses
This research, as well as others, highlights the discrepancy between the interactome, proteome, and transcriptome
[37][21]. The discrepancy between interactome/proteome and transcriptome is due to the difference between the levels of transcription of specific genes, translation of mRNA, and protein abundance or interaction in a biological system. This difference can be caused by various factors, such as post-transcriptional regulation, post-translational modification, protein degradation, protein–protein interaction, and environmental stimuli
[38][22]. The discrepancy between interactome/proteome and transcriptome can have significant implications for understanding the molecular mechanisms and functions of biological systems, as well as for identifying potential biomarkers and therapeutic targets for diseases that are linked to protein complexity
[39][23].
Proteome and phosphoproteome are two important concepts in the field of proteomics, which is the study of the entire set of proteins expressed by a cell, tissue, or organism under certain conditions. Proteome refers to the identity, expression levels, and modification of proteins, while phosphoproteome refers to the subset of proteins that are phosphorylated. This is a common post-translational modification that regulates protein function and signaling. Proteomics and phosphoproteomics can provide valuable information for the design of novel therapies
[40][24], especially for diseases such as cancer, where protein expression and phosphorylation are often dysregulated.
Proteomics, ubiquitinome, and phosphoproteomics can also help to characterize molecular mechanisms and target modulators by integrating with other omics data, such as genomics, transcriptomics, and metabolomics. Proteomics can help identify potential biomarkers and protein expression patterns that can be used to assess disease prognosis, tumor classification, and identify potential responders for specific therapies
[44][25]. Phosphoproteomics can help to understand cellular signaling and infer kinase activity, which is a key regulator of many cellular processes and a common target for drug development
[45][26]. In line with these findings, a recent article described an overview of the online data publicly available in the field of cancer research, highlighting the discrepancy between different cancer types and potential multi-omics strategies
[46][27].
The ubiquitinome refers to the set of proteins that are modified by covalently bound ubiquitin molecules, a small protein that regulates protein stability, localization, and function
[47][28]. Ubiquitination is a reversible and dynamic process that can affect various cellular pathways such as the cell cycle, DNA repair, apoptosis, and autophagy. The ubiquitin system is involved in many diseases, such as cardiovascular diseases, cancer, neurodegeneration, inflammation, and infection
[48][29]. Therefore, understanding and manipulating the ubiquitinome, which is the set of all ubiquitinated proteins in a cell or organism, could lead to new therapeutic strategies. One way to use the ubiquitinome for drug discovery is to identify biomarkers or signatures of the ubiquitinome that are associated with certain diseases or conditions. For example, changes in the levels or patterns of ubiquitination of certain proteins can indicate the presence or progression of a disease or the response or resistance to a treatment
[49][30]. By measuring the ubiquitinome using proteomics or other methods, one can diagnose, monitor, or predict the clinical outcomes of patients. Moreover, one can use the information from the ubiquitinome to design novel therapies that target the underlying mechanisms or pathways of the disease, as in the case of Parkinson’s disease (PD)
[50,51][31][32] or cardiovascular diseases such as Noonan syndrome
[52][33].
Glycoproteome analysis is the study of the structure and function of proteins that are modified by glycans, which are complex carbohydrate chains attached to proteins
[53][34]. Glycoproteins are involved in many biological processes and diseases, such as cell signaling, immune response, cancer, and viral infection. Glycoproteome analysis aims to identify glycoproteins, and their glycosylation sites. Glycosylation can affect the structure, stability, folding, interactions, and functions of proteins and thus regulate many cellular processes and pathways. Moreover, glycosylation can also influence the recognition and response of the immune system to foreign or abnormal cells, such as cancer cells or virus-infected cells. MS-based glycoproteome analysis can be performed using two complementary workflows: glycosylation site mapping and glycopeptide analysis. Glycosylation site mapping identifies the potential glycosylation sites that are occupied by glycans on the protein sequence, while glycopeptide analysis characterizes the specific glycan structures and compositions on each site. Glycoproteome analysis can be used as a powerful tool for disease diagnosis and therapy monitoring because glycosylation can serve as a biomarker that reflects the current status of the patient and the changes in the glycome due to disease progression or treatment
[54][35]. Therefore, analyzing the glycome can reveal the alterations in glycosylation that are associated with different diseases, such as cancer, diabetes, Alzheimer’s disease, and infectious diseases. For example, cancer cells often have abnormal glycosylation patterns that affect their growth, invasion, metastasis, and immune evasion.
Protein acetylation is a type of post-translational modification that involves the addition of an acetyl group to a protein molecule. This modification can affect different amino acid residues of the protein, such as lysine, serine, and threonine. However, the most common and well-studied form of protein acetylation is the acetylation of lysine side chains. Protein acetylation can affect the structure, function, and interactions of proteins and regulate various biological processes such as metabolism and signaling. Protein acetylation can also occur at the N-terminus of the proteins, which is called N-terminal acetylation. N-terminal acetylation is catalyzed by a group of enzymes called N-terminal acetyltransferases (NATs). N-terminal acetylation can affect the protein’s lifetime by influencing its degradation, folding, localization, and interactions with other molecules. For example, N-terminal acetylation can protect proteins from being degraded by proteases that recognize unmodified N-termini, or it can target proteins for degradation by specific ubiquitin ligases. Acetylome analysis is the study of the global patterns and dynamics of protein acetylation using mass spectrometry and bioinformatics tools
[55][36]. Acetylome analysis can reveal critical features of lysine acetylation, such as its abundance, distribution, conservation, and functional roles. Furthermore, it can reveal the changes in protein acetylation patterns and levels that are associated with disease pathogenesis and progression
[56][37]. Acetylome can also be a target for therapy development, such as using drugs that modulate the activity of acetyltransferases or deacetylases, which are enzymes that add or remove acetyl groups from proteins. For example, elamipretide and nicotinamide mononucleotide are mitochondrial-targeted drugs that can restore the abundance and acetylation of proteins that are disrupted by aging in mouse hearts
[57][38].
One way to resolve the discrepancy between the interactome, proteome, and transcriptome is to use multi-omics approaches, which integrate data from different levels of biological molecules. For example, one study used multi-omics data from human brain organoids to identify the posttranscriptional regulation of ribosomal genes by a transcription factor called KLF4
[61][39]. Another study used multi-omics data from human melanoma samples to identify the molecular mechanisms and therapeutic targets of a novel oncogene called RREB1
[62][40]. While transcriptomics does not always correlate with protein levels and direct drug responses, the transcriptome can still be important for the design of therapies associated with large chromosomal rearrangements that can, for example, occur in cancer. Several papers have described the integration of RNA sequence, copy number aberration, and methylation to give a better understanding of cellular alterations associated with disease pathogenesis
[63][41]. Such an approach can help identify genes that are affected by multiple types of genomic and epigenomic alterations. A recent article used transcriptome analysis to compare the effects of commonly observed chromosomic deletion on the gene expression in kidney epithelial cells and clear-cell renal cell carcinoma (ccRCC) samples
[64][42].