Colorectal cancer (CRC) is one of the most frequently diagnosed cancers, with more than 1.9 million estimated new cases worldwide
[1]. In Spain, CRC accounted for around 15,288 deaths in 2018, and has an annual age-standardized mortality rate of 30 per 100,000 inhabitants. This makes CRC the sixth-leading cause of death and the second leading cause of cancer-related mortality
[2]. Early diagnosis raises the 5-year survival rate of these patients up to 94%
[3]. Given the high burden of CRC on the National Health Service and the importance of early detection, significant efforts have been directed toward developing CRC screening programs. The main aim of these programs is to remove pre-malignant lesions which could ultimately develop into malignant tumours, as well as to start treatment in early-stage detected cancers. This way, it is expected to reduce CRC incidence and CRC-specific mortality, which has been proven effective
[4].
One of the main problems for CRC is the late diagnosis, giving rise to a decrease in survival since there is a lack of early biomarkers
[5]. Different tools have been developed for CRC screening, which include colonoscopy, flexible sigmoidoscopy, guaiac faecal occult blood testing (gFOBT), faecal immunochemical testing (FIT), and carcinoembryonic antigen (CEA) in plasma, which has low sensitivity and specificity
[6]. Intention-to-treat estimates from meta-analyses of large randomized trials report reductions in CRC mortality of 20–30% for flexible sigmoidoscopy
[7][8], 8–16% for gFOBT, and 41% for FIT and follow-up colonoscopy
[9]. Currently, the screening program in Spain consists of biennial FIT with colonoscopy follow-up on positive subjects, according to the European guidelines
[10]. However, every autonomous region implements this program at a different pace and there are important differences among regions
[11][12]. Although this screening program has led to a decrease in mortality, the performance of this test is suboptimal, with a sensitivity and specificity for CRC of 54–89% and 89–97%, respectively
[13]. Furthermore, it has been noted that this sensitivity may vary with the tumour stage, being lower with early-stage CRC
[14]. This leads to a substantial number of false negative and false positive tests and, consequently, to missed diagnoses or unneeded colonoscopies. Thus, there is an urgent need for more accurate and, ideally, non-invasive tests to implement for CRC screening and monitoring tumour progression and treatment efficacy.
The National Cancer Institute defines genomics as the study of the complete set of DNA (including all of its genes) in a person or other organism. The genome contains all the information needed for an individual to develop and grow. Analyzing the genome may help researchers understand how genes interact with each other and the environment and how certain diseases, such as cancer, diabetes, or heart disease develop. This may lead to new ways to diagnose, treat, and prevent disease
[15]. Genetic alterations have been identified as major players in tumourigenesis. Therefore, genomics has gained attention as a tool to identify genetic markers that can lead to better diagnosis and prognosis and at the same time, allow researchers to improve the understanding of cancer. Apart from gene mutations and single nucleotide polymorphisms (SNP), the epigenetic signature has also proven useful to establish a more personalised diagnosis
[16].
The development of high-throughput methods for genome and gene expression studies has increased the amount of information available. These data are deposited in international public repositories and can be studied by other research groups. NCBI Gene Expression Omnibus (GEO) is the most important database repository of high-throughput gene expression data and hybridization arrays, chips, and microarrays
[17]. The Cancer Genome Atlas (TCGA) of the National Cancer Institute (NCI) is another relevant database in oncology. TCGA is a project to classify the genetic mutations that cause cancer, using genome sequencing and integrating bioinformatics tools to analyse this information
[18].
Finally, the use of metagenomics, which evaluates the microbiome genes, holds special promise for CRC. Metagenomics has shown the potential to identify differences between control and CRC-associated microbiomes and eventually describe new CRC biomarkers
[19].