编码区和非编码区 SNP 的功能机制

编码区和非编码区 SNP 的功能机制: History

Please note this is an old version of this entry, which may differ significantly from the current revision.

Contributor: Wenmin Yang ,

Te Zhang

, Xuming Song , Lin Xu , Gaochao Dong , Feng Jiang

Cancer ranks as the second leading cause of death worldwide, and, being a genetic disease, it is highly heritable. Over the past few decades, genome-wide association studies (GWAS) have identified many risk-associated loci harboring hundreds of single nucleotide polymorphisms (SNPs). Some of these cancer-associated SNPs have been revealed as causal, and the functional characterization of the mechanisms underlying the cancer risk association has been illuminated in some instances.

genome-wide association analysis
single nucleotide polymorphism
cancer
molecular and biological mechanism

1. Functional Mechanisms of Coding Region SNPs

Single nucleotide polymorphisms (SNPs) located in coding regions can be divided into two types: synonymous and non-synonymous mutations. Although synonymous mutations do not affect the amino acid sequence of the protein, they may change the expression of the protein by affecting post-transcriptional modifications, translation rates, and other processes. In contrast, non-synonymous SNPs (nsSNPs) cause the substitution of amino acids, thereby resulting in changes to the protein structure, its physical and chemical properties (stability, solubility, etc.), and its function. At present, there are many biological software packages (such as SIFT(Sorting Intolerant From Tolerant), F-SNP(the Functional Single Nucleotide Polymorphism) and PolyPhen) that can be used to predict the effect of nsSNPs on protein structure and function ^[1]^[2]^[3]. Compared to SNPs located in gene non-coding regions, the functional mechanism underlying tumor-associated nsSNPs is relatively simple ^[4]. Combined with whole exon analysis, several coding region SNPs have been identified to be associated with colorectal cancer development. For example, the missense mutation rs3184504 (p. trp263ARg) located in a domain of SH2B3 may change the function of the protein in the context of regulating cell division. Other coding variants may also affect variable shear (RS16888728, UTP23) ^[5]. The mechanism by which SNPs located within the coding regions of genes affect the risk of disease is inseparable from the function of the resulting coded proteins.

Some risk loci exert an effect on the amino acid sequence of the produced protein. Examples include BRCA2 p.Lys3326Ter (rs11571833), and CHEK2 p.Ile157Thr (rs17879961) in lung ^[6] and breast ^[7] cancers. The mechanistic interpretation of such variants is presumed to be relatively simple. In addition to the aforesaid, coding SNPs can affect RNA processing; an example is rs78378222 in the 3′ untranslated region of TP53, whereby the risk-corrected variation alters the sequence AATAAA to AATACA, thereby changing the polyadenylation signal of TP53, and ultimately resulting in the impaired 3′-end processing of TP53 mRNA ^[8]^[9]. Some variants can also affect splicing. Tian and his colleagues identified a single-nucleotide variation in the ELP2 gene that affect ELP2 exon pre-mRNA splicing through splicing a quantitative trait locus (sQTL) ^[10].

Researchers have often focused on specific signaling pathways, genes, and genetic modifications of interest, while also performing whole-exon association analysis (GWAS) to find any relevant coding SNPs with large effects on these molecules and processes. For example, Li and colleagues used exon sequencing and conducted an association analysis of 12 important genes involved in TGF-β signaling to find that low-frequency causative variation in the TGF-β pathway contributes to colorectal carcinoma (CRC) susceptibility. They discovered that the missense variation rs3764482 (c. 83C>T; p. S28F) located in the gene SMAD7 was consistently and strongly associated with CRC risk. The rs3764482 allele T was more effective compared to the dominant allele C in limiting TGF-β signaling and reducing the phosphorylation of receptor-regulated SMADs (R-SMADs) via impeding the activation of downstream genes, thereby promoting cancer cell proliferation and contributing to CRC pathogenesis ^[11].

Coding SNPs may also affect gene and protein modifications. The N6-methyladenosine (m6A) modification is critical for ensuring messenger RNA stability and is involved in many biological activities, including pre-mRNA splicing, 3’-end processing, nuclear export, translation regulation, mRNA degradation, and the DNA damage response ^[12]^[13]. The m6A methylation modification occurs in the messenger RNA(mRNA) and can be formed by methylation “writers” and removed by demethylation “erasers” ^[14]. Rs8100241, located in the gene ANKLE1, was identified to be associated with susceptibility to both CRC and breast cancer. The presence of the rs8100241 risk allele A (Figure 1a) combined with the m6A “writer” complex (comprised of the proteins METTL3, METTL14, and WTAP) and the m6A “reader” protein (YTHDF1) was found to increase the levels of the m6A modification on the gene ANKLE1 and consequently increase its protein expression. Mechanistically, ANKLE1 functions as a potential tumor suppressor by decreasing CRC cell proliferation while maintaining genomic integrity, thereby contributing to a lower risk of CRC ^[15].

Figure 1. Schematic diagram of the action mechanism employed by coding SNPs. (a) The A allele of the rs8100241 variant, which is found in the ANKLE1 second exon region, has been linked to a lower risk of CRC by increasing ANKLE1 mRNA m6A levels and thus facilitating ANKLE1 protein expression, thereby potentially functioning as a negative regulator to hinder cell growth by maintaining genomic stability. (b) Interaction between the TCF7L2 missense variant rs138649767 and a regulatory variant rs6983267 in the MYC enhancer and promoter on the expression of MYC.

Notably, coding SNPs may interact with other SNPs to produce a stronger functional role ^[5]. The rs138649767 A allele (Figure 1b) located in the exon region of TCF7L2 can activate the MYC enhancer containing rs6983267 allele G to promote the expression of MYC ^[16]. SNPs occurring in the exons and introns of SMAD7 may affect its regulation and jointly affect downstream signaling pathways involving SMAD7 and TGFβ ^[11]. As a result, while examining SNPs in coding regions, the interactions between them should be taken into account to better understand their functional processes.

2. Functional Mechanisms of Non-Coding Region SNPs

Accumulating evidence shows that a SNP in non-coding regions is the most common type of genetic variation in the human genome, accounting for 90% of inter-individual variation ^[17]^[18]. Depending on the location, the region can harbor a response element that is either proximal (promoter, enhancer, or super-enhancer) or distal (intergenic or intra-genic). The risk loci identified by GWAS were located in the genomic regions of cell type-specific active chromatin, and most of them were quantitative trait loci, methylation quantitative trait loci and transcription factor (TF) binding related loci. Chromatin conformational studies have helped to link regulatory regions localized by SNPs to their respective target genes ^[17]^[19]^[20]. These loci may be involved in gene transcription, post-transcriptional processing, translation, post-translational modifications, and other processes to regulate gene expression. Many target genes have been identified using expression quantitative trait loci (eQTL) to detect the relationship between SNPs and gene expression. Non-coding SNPs can regulate the transcription of target genes by sequence-proximal (cis)- or distal (trans)-interactions. Studies have found that histone modifications in the regions of such risk SNPs are particularly abundant, especially those related to promoter and enhancer activities (H3K4me3, H3K4me1, H3K27ac). Most SNPs are predicted to destroy the binding motifs of specific transcription factors. For example, rs6983267 may change the binding of transcription factors such as MYC, CTCF, and TCF7L2 ^[21]. In addition to affecting gene transcription levels by altering transcription factor-binding sites (TFBS), non-coding SNPs also change epigenetic modifications and/or the chromatin structure to influence target gene expression. Through the above method, non-coding SNP participates in cell proliferation, apoptosis, migration, and invasion.

2.1. Genetic Variants That Alter Promoters

A promoter is a sequence of DNA that is recognized, bound, and serves to initiate transcription by RNA polymerase. Promoters contain variations of a conserved sequence required for the specific binding of RNA polymerase and transcription initiation. Most promoters are located upstream of the transcription initiation point of structural genes, and the promoter itself is not transcribed ^[22]. Promoters are located upstream of the 5’ end of a given structural gene, and they activate RNA polymerase to bind accurately to the template DNA with specificity for inducing the initiation of transcription ^[22]. Promoters do not control gene activity themselves; rather, gene activity is regulated by binding to proteins called transcription factors (TF). SNPs within promoter regions generally play a regulatory role by influencing the binding of such transcription factors. A recently reported example is that of the SNP rs13278062 located in the promoter of death receptor 4 (DR4) which confers an altered risk of colorectal cancer. The study revealed that the rs13278062 G>T variant changed the binding affinity of the transcription factor Sp1/NF1, increased the expression of DR4, and thus suppressed carcinogenesis and metastasis of colorectal cancer ^[23]. The MPO promoter SNP rs2333227 increases the malignant characteristics of colorectal cancer by changing the promoter’s affinity to AP-2α ^[24]. The variant SNP rs10993994 located in the upstream promoter of the gene MSMB is also found to be overrepresented in individuals with prostate cancer; this is attributed to stronger CREB binding and thus increased promoter activity ^[25]. Furthermore, the SNP rs11672691 is a risk locus associated with prostate cancer that is related to the lncRNA PCAT19. The non-risk variant rs11672691 and its linkage disequilibrium (LD) SNP rs887391 are more likely to bind the TFs NKX3.1 and YY1 to the PCAT19-short promoter, thereby leading to increased promoter but lower enhancer activity, which then activates PCAT19-short, and ultimately results in lower prostate cancer susceptibility ^[26]. SNPs in promoter regions of multiple genes, including TERT, KLHDC7A, PIDD1, and ESR1, have been discovered in breast cancer by GWAS, with reporter studies revealing that independent risk alleles change target promoter activity ^[27]^[28]. Most of the reported promoter changes exert their regulatory effects by altering TF binding. The SNP rs3824662 allele A (Figure 2a) increases chromatin accessibility by changing the TF GATA3 expression, promoting the binding of GATA3 with the CRLF promoter, and ultimately forming a chromatin loop ^[29].

Figure 2. Schematic diagram of the action mechanism employed by non-coding SNPs. (a) The SNP rs3824662 allele A increases chromatin accessibility by inducing GATA3 expression, promoting the binding of GATA3 with the CRLF promoter, and ultimately forming a chromatin loop. (b) The NTN4 enhancer risk variant rs11836367 binds to the TF GATA3 to regulate NTN4 expression, ultimately promoting breast carcinoma initiation and progression. (c) Enhancer SNP rs7959129 risk allele G interacts with promoter SNP rs6192603 risk allele G contributing to ATF1 expression by binding TFs GATA3 and SP1. (d) The risk allele rs11986220 and higher methylation at –10 Kb synergistically function to confer a greater risk of tumor; however, when -20 Kb is hypomethylated, the function of the risk SNP is inhibited by the enhancer-blocking insulator loop mediated by CTCF. (e) The risk variant rs11655237 in LINC00673 creates a miR-1231–binding site that interferes with the expression of LINC00673 and contributes to pancreatic cancer susceptibility.

2.2. Genetic Variants That Alter Enhancers

Enhancers are regions of DNA sequence that can increase the cis-acting transcription of their target gene sequences. Enhancers each differ in their distance from their target promoter(s); in mammalian species, an enhancer can be 100 bp to Mb away from their target gene ^[30]. Enhancers, unlike promoters, can be found anywhere in a gene; they can be positioned either upstream or downstream of their target genes, or even within another gene’s gene body, and enhancer regulation can circumvent other genes irrespective of their orientation. Enhancers must bind to specific protein factors to enhance the transcription of their target. Enhancers generally have tissue or cell specificity, whereby they only show activity in certain cells or tissues, which is determined by the specific protein factors present in these cells or tissues ^[31]. Enhancers are typically recognized by the epigenetic marks H3K4me1 and H3K27ac, which are present in active enhancer elements. Conversely, H3K27me3 is regarded as a silent epigenetic mark associated with lower enhancer activity ^[32]^[33]. GWAS-identified risk loci for common illnesses are often found in non-coding areas, and many of these are thought to function as enhancers ^[34]. According to emerging data, these SNPs may influence gene regulation by changing the binding of important TFs to critical transcriptional enhancers ^[35].

2.2.1. Breast Cancer

Of all cancers, breast cancer has so far yielded the greatest number of discovered risk loci ^[36]. Understanding the driving mechanism(s) underlying malignant transformation provides the prospect of combating cancer recurrence and treatment resistance. Zhang et al. identified that the SNP rs4971059 resides in the sixth intron and within an active enhancer element of the TRIM46 gene. By using CRISPR/Cas9-mediated homologous recombination, they constructed the SNP rs4971059 with the allele G converted to allele A, thereby resulting in TRIM46 overexpression, boosting breast carcinoma cell growth, enhancing chemotherapy resistance in vitro, and hastening tumor development in vivo ^[37]. In addition, Yang and colleagues (Figure 2b) reported the noncoding regulatory variant rs11836367 at the NTN4 locus (12q22) and identified it to be associated with the risk of breast carcinoma as a causal variant. The rs11837367 protective T allele promotes GATA3 binding to the distal enhancer and increases NTN4 expression ^[38].

2.2.2. Prostate Cancer

Several studies have independently identified several genes in specific prostate cancer (PCa) susceptibility loci that are either controlled by causative SNPs containing a cis-regulatory element (CRE) or have been indicated as SNP-associated genes ^[39]. SNP rs339331 at 6q22 was found to be a prostate cancer risk-associated variant. The risk allele T of rs339331 has been found to augment the enhancer-binding of HOXB13, alter the level of the RFX6 protein in an allele-specific manner, and confer a predisposition to prostate cancer ^[40]. Recently, Huang et al. also identified that the PCa-associated rs11672691 located within an enhancer element can change the binding site of HOXA2, which in turn promotes oncogenesis by impacting the expression of nearby genes ^[41].

Notably, there are other cases of SNPs causing DNA-binding polymorphisms in distinct transcription factors. For example, a gastric cancer risk-associated polymorphism (rs2978980 T>G) that is situated in an intronic enhancer of lncPSCA has been found to disrupt the binding of the transcription factor RORA, thereby resulting in lower lncPSCA expression in an allele-specific manner ^[42]. As another example, the rs2647046 enhancer has been found to interact with the HLA-DQB1-AS1 promoter to alter its expression via a CTCF-mediated long-range loop in an allele-specific manner, thereby conferring susceptibility to hepatocellular cancer (HCC) ^[43]. Another variation on chromosome 11q13.3 in a distant intergenic region has been characterized as a susceptibility locus for renal cell cancer. To control transcription, the 11q13.3 locus encodes a long-range enhancer that physically connects with the CCDN1 promoter ^[44]. Interestingly, SNP sites can act as promoters and enhancers simultaneously, and their conversion is determined by the background genotype. As a result, one gene can produce several different RNAs that are involved in the development of diseases. The SNP rs11672691 mediates promoter and enhancer switching under different genotypes. A risk-associated sequence in the PCAT19-long enhancer interacts with the PCAT19-long promoter to enhance prostate cancer development through activating cell cycle genes ^[26].

2.2.3. Colorectal Cancer

GWAS have identified numerous colorectal cancer risk loci, but only a fraction of the target genes of these loci have been systematically interrogated. For example, Yu et al. identified a common SNP (rs7198799) in the intron of the gene CDH1. They demonstrated that the risk allele C of rs7198799 acts as an enhancer that can target the TF NFATC2 and remotely enhance ZFP90 expression ^[45]. A prominent mechanism by which SNP variants can affect cell-specific enhancer function is via altered TF binding, thus regulating the target gene’s expression. Tian et al. identified two risk SNPs (rs61926301 and rs79591129) located in the ATF1 promoter and first intron, respectively. These are enriched in enhancer regions and open chromatin, which are also associated with H3K4me1, H3K27ac, and ATAC-seq peaks. The two variants increase the expression of ATF1 through preferentially binding to the two TFs SP1 and GATA3 ^[46]. Rs174575 can act as a specific remote enhancer of FADS2 and lncRNA-AP002754.2 with the participation of the transcription factor E2F1. Interestingly, TF E2F1 can promote the expression of FADS2, form a chromatin loop, and affect the occurrence of colorectal cancer ^[47].

2.3. Genetic Variants That Affect Promoter–Enhancer Interactions

Promoter–enhancer interactions (PEIs) underlie differential transcriptional regulation. Several technologies (chromosome conformation capture (3C), Hi-c, and H3K27Ac-HiCHIP) allow for the study of long-range cis-regulation ^[48]^[49]^[50]. Promoter–enhancer interactions are essential events involved in the current theory of transcriptional control. So far, there is little evidence that PEIs are required for the transcriptional control of an enhancer’s target gene. The insertion or deletion of promoters, the absence of certain PEI-associated proteins, and the inclusion of PEI-disrupting insulators all have an effect on the expression of target genes. Tian et al. found two risk variants (rs1926301 and rs7959129) located in the ATF1 promoter and intron, respectively; the former binds the TF SP1 while the latter binds the TF GATA3 (Figure 2c). They found that these two risk sites increase the interaction between the promoter and enhancer by binding SP1 and GATA3, facilitating ATF1 expression, and conferring hereditary susceptibility to CRC ^[46]. Moreover, the SNP rs11672691 mediates promoter and enhancer switching in a manner dependent on different background genotypes. The risk is determined by the PCAT19-long enhancer interacting with the PCAT19-long promoter, thereby altering prostate cancer development through activating cell cycle genes ^[26].

2.4. Genetic Variants That Alter 3D Genome Architecture

Within the nucleus, genomic DNA folds into a three-dimensional structure organized at different levels by the formation of chromatin rings. These structures can bring distant enhancers near their target promoters to affect gene expression and regulation. The chromosomes fold into chromatin characterized by sequence-regulating spatial interactions that are key to maintaining normal cell status and function. In cancer genomes, structural variation typically results in changes to the genome’s 3D structure and, as a result, alterations in genome-mediated transcriptional control ^[51]. Changes in the three-dimensional genome architecture or high-order chromatin structure are linked to the development and progression of several diseases ^[52]^[53]. Long-distance chromatin looping regulates cancer susceptibility genes either actively or passively. Enhancers frequently form long-range chromatin loops with their target gene promoter regions to affect gene expression. The 9q22 locus, for example, contains the thyroid cancer risk-related SNP rs965513, which demarcates a 33-kb linkage disequilibrium block (including the lead SNP rs965513) that is strongly linked with PTC risk. The chromatin characteristics and regulatory element signatures of this block indicate at least three regulatory elements that operate as enhancers. Using chromosomal conformation capture technology, researchers have observed the long-range looping connections of these elements with the promoter region shared by FOXE1 and PTCSC2 in a human papillary thyroid cancer cell line (KTC-1) and unaffected thyroid tissue ^[54]. Similarly, Zhang et al. discovered that the rs1859962 risk-associated LD block contains a PCa-specific enhancer that forms a 1-Mb chromatin loop with the SOX9 gene. This study found that the rs1859962 PCa risk LD block contacts SOX9 via a long-distance chromatin loop that connects it to the E1 enhancer ^[55].

CTCF is a transcription factor that promotes long-range chromosomal contact via looping. Hoffman et al. discovered that one allele in the Igf2/H19 imprinting control region (ICR) on chromosome 7 colocalized with one allele of Wsb1/Nf1 on chromosome 11. The lack of CTCF or the ablation of the maternal ICR was found to eliminate this connection and alter the expression of the Wsb1/Nf1 gene ^[56]. This finding confirmed the importance of CTCF in the control of the shape of chromatin and the resulting gene expression. On the other hand, the unique contribution of CTCF is that of an insulator. Insulators are short nucleotide sequences that determine the boundaries of genomic areas that are close to one another ^[57]. When CTCF binds to an insulator region, it inhibits gene transcription by interfering with the communication between an enhancer and a gene promoter ^[58]. Ahmed M. et al. identified (Figure 2d) noncoding cis-regulatory elements (rCRE) by performing CRISPRi screens. They discovered that the 8q24.21 area is widely marked with H3K27ac and has a significant binding affinity to AR, FOXA1, and HOXB13, all of which are important transcription regulators for PCa pathogenesis ^[59]. Using an integrated approach involving ChIP, Hi-C, CRISPR, and functional rescue, researchers also discovered that the rs11986220 containing the rCRE sequence interacts with the MYC promoter in V16A cells but not in 22Rv1 cells, as the promoter–CRE interaction is typically facilitated by a CTCF site in a 10 kb region upstream, which prevents chromatin looping ^[59]. Similarly, the rs6702619 region is inhabited by CTCF, which acts as an insulator with long-range physical interactions with CRC-relevant loci ^[60]. Understanding CTCF-mediated 3D genomic architecture will aid in understanding the mechanism of action underlying noncoding GWAS SNPs at either CTCF sites or regulatory enhancer sites ^[61].

2.5. Genetic Variants That Influence the Binding of miRNA

MicroRNAs (miRNAs) are noncoding RNA molecules that influence gene expression via regulating messenger RNA degradation and translation. MicroRNAs are normally excised by the RNase iii enzyme Dicer from 60–110 nucleotide long hairpin precursor (folded) RNA structures (pre-miRNAs), which are then integrated into the RNA-induced silencing complex (RISC). The pro-miRNA sequence is transcribed by Pol-II ^[62]. Accumulating evidence suggests that miRNAs play a key role in carcinogenesis by binding to the 3’-UTR of target mRNAs ^[63]. MiRNA mutations or their misexpression have been associated with human malignancies and alterations in cancer-associated gene expression ^[64]. Hoffman et al. detected a variant (rs11614913) in has-miR-196a-2 using GWAS to screen genetic variants in 15 miRNAs. This SNP was identified to be associated with decreased breast cancer risk ^[65]. Previous research has confirmed that the methylation of ^[66] islands in miRNA regions may change miRNA function, thereby influencing carcinogenic pathways. The author and his colleagues found that a CpG island in the region upstream of the miRNA precursor is associated with breast cancer risk ^[65]. The ATF1 rs11169571 variant was shown to be strongly related to ATF1 expression by influencing hsa-miR-1283 and hsa-miR-520d-5p binding, which may increase susceptibility to colorectal cancer ^[16]. In addition, SNPs located in the 3’UTR region of MDM4, CD44, LAMC1, and other genes exert a similar mechanism ^[67]^[68]^[69].

Some SNPs within long non-coding RNA can also change their binding affinity to miRNAs. The variant loci rs1317082, discovered at exon 1 of lncRNA RP11-362K14.5 (CCSlnc362), establishes a binding site for miR-4658, which consequently reduces CCSlnc362 expression and confers lowered susceptibility to CRC ^[70]. The link between rs140618127 in the lncRNA LOC146880 with non-small cell lung cancer involves a miR-539-5p binding site. The combination of miR-539-5p and LOC146880 has been found to result in the reduced activation of the oncogene ENO1. Reduced ENO1 phosphorylation also results in lower PI3K and Akt activation, which is linked to decreased cell proliferation and tumor formation ^[71]. Moreover, the SNP rs11655237 allele G in LINC00673 exon can create a miRNA binding site that increases the function of LINC00667 expression (Figure 2e). Furthermore, rs67311347 in RCC ^[72], rs12982687 in CRC ^[73], and rs16854802 in neck squamous cell carcinoma (HNSCC) ^[74] are SNPs in lncRNA sequences that affect target gene expression by binding with miRNA. If a SNP occurs within miRNA, it will consequently affect the binding affinity of the miRNA to target genes.

This entry is adapted from the peer-reviewed paper 10.3390/cancers14225636

References

Ng, P.C.; Henikoff, S. SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003, 31, 3812–3814.
Lee, P.H.; Shatkay, H. F-SNP: Computationally predicted functional SNPs for disease association studies. Nucleic Acids Res. 2008, 36, D820–D824.
Ritchie, G.R.; Flicek, P. Computational approaches to interpreting genomic sequence variation. Genome Med. 2014, 6, 1760.
Theodoratou, E.; Farrington, S.M.; Timofeeva, M.; Din, F.V.; Svinti, V.; Tenesa, A.; Liu, T.; Lindblom, A.; Gallinger, S.; Campbell, H.; et al. Genome-wide scan of the effect of common nsSNPs on colorectal cancer survival outcome. Br. J. Cancer 2018, 119, 988–993.
Timofeeva, M.N.; Kinnersley, B.; Farrington, S.M.; Whiffin, N.; Palles, C.; Svinti, V.; Lloyd, A.; Gorman, M.; Ooi, L.-Y.; Hosking, F.; et al. Recurrent Coding Sequence Variation Explains Only a Small Fraction of the Genetic Architecture of Colorectal Cancer. Sci. Rep. 2015, 5, 16286.
Wang, Y.; McKay, J.D.; Rafnar, T.; Wang, Z.; Timofeeva, M.N.; Broderick, P.; Zong, X.; Laplana, M.; Wei, Y.; Han, Y.; et al. Rare variants of large effect in BRCA2 and CHEK2 affect risk of lung cancer. Nat. Genet. 2014, 46, 736–741.
Michailidou, K.; The Breast and Ovarian Cancer Susceptibility Collaboration; Hall, P.; Gonzalez-Neira, A.; Ghoussaini, M.; Dennis, J.; Milne, R.L.; Schmidt, M.; Chang-Claude, J.; Bojesen, S.E.; et al. Large-scale genotyping identifies 41 new loci associated with breast cancer risk. Nat. Genet. 2013, 45, 353–361.
Stacey, S.N.; Sulem, P.; Jonasdottir, A.; Masson, G.; Gudmundsson, J.; Gudbjartsson, D.F.; Magnusson, O.T.; Gudjonsson, S.A.; Sigurgeirsson, B.; Thorisdottir, K.; et al. A germline variant in the TP53 polyadenylation signal confers cancer susceptibility. Nat. Genet. 2011, 43, 1098–1103.
Enciso-Mora, V.; Hosking, F.J.; Di Stefano, A.L.; Zelenika, D.; Shete, S.; Broderick, P.; Idbaih, A.; Delattre, J.-Y.; Hoang-Xuan, K.; Marie, Y.; et al. Low penetrance susceptibility to glioma is caused by the TP53 variant rs. Br. J. Cancer 2013, 108, 2178–2185.
Tian, J.; Chen, C.; Rao, M.; Zhang, M.; Lu, Z.; Cai, Y.; Ying, P.; Li, B.; Wang, H.; Wang, L.; et al. Aberrant RNA Splicing Is a Primary Link between Genetic Variation and Pancreatic Cancer Risk. Cancer Res. 2022, 82, 2084–2096.
Li, J.; Zou, L.; Zhou, Y.; Li, L.; Yang, Y.; Gong, Y.; Lou, J.; Ke, J.; Zhang, Y.; Tian, J.; et al. A low-frequency variant in SMAD7 modulates TGF-β signaling and confers risk for colorectal cancer in Chinese population. Mol. Carcinog. 2017, 56, 1798–1807.
Roundtree, I.A.; Evans, M.E.; Pan, T.; He, C. Dynamic RNA Modifications in Gene Expression Regulation. Cell 2017, 169, 1187–1200.
Frye, M.; Harada, B.T.; Behm, M.; He, C. RNA modifications modulate gene expression during development. Science 2018, 361, 1346–1349.
Yue, Y.; Liu, J.; He, C. RNA N6-methyladenosine methylation in post-transcriptional gene expression regulation. Genes Dev. 2015, 29, 1343–1355.
Tian, J.; Ying, P.; Ke, J.; Zhu, Y.; Yang, Y.; Gong, Y.; Zou, D.; Peng, X.; Yang, N.; Wang, X.; et al. ANKLE1N6-Methyladenosine-related variant is associated with colorectal cancer risk by maintaining the genomic stability. Int. J. Cancer 2020, 146, 3281–3293.
Chang, J.; Tian, J.; Yang, Y.; Zhong, R.; Li, J.; Zhai, K.; Ke, J.; Lou, J.; Chen, W.; Zhu, B.; et al. A Rare Missense Variant in TCF7L2 Associates with Colorectal Cancer Risk by Interacting with a GWAS-Identified Regulatory Variant in the MYC Enhancer. Cancer Res. 2018, 78, 5164–5172.
Sud, A.; Kinnersley, B.; Houlston, R. Genome-wide association studies of cancer: Current insights and future perspectives. Nat. Cancer 2017, 17, 692–704.
Maurano, M.T.; Humbert, R.; Rynes, E.; Thurman, R.E.; Haugen, E.; Wang, H.; Reynolds, A.P.; Sandstrom, R.; Qu, H.; Brody, J.; et al. Systematic Localization of Common Disease-Associated Variation in Regulatory DNA. Science 2012, 337, 1190–1195.
Wei, G.-H.; Liu, D.-P.; Liang, C.-C. Charting gene regulatory networks: Strategies, challenges and perspectives. Biochem. J. 2004, 381, 1–12.
Wei, G.H.; Liu, D.P.; Liang, C.C. Chromatin domain boundaries: Insulators and beyond. Cell Res. 2005, 15, 292–300.
Law, P.J.; The PRACTICAL consortium; Timofeeva, M.; Fernandez-Rozadilla, C.; Broderick, P.; Studd, J.; Fernandez-Tajes, J.; Farrington, S.; Svinti, V.; Palles, C.; et al. Association analyses identify 31 new risk loci for colorectal cancer susceptibility. Nat. Commun. 2019, 10, 2154.
Haberle, V.; Stark, A. Eukaryotic core promoters and the functional basis of transcription initiation. Nat. Rev. Mol. Cell Biol. 2018, 19, 621–637.
Wu, S.; Meng, Q.; Zhang, C.; Sun, H.; Lu, R.; Gao, N.; Yang, H.; Li, X.; Aschner, M.; Chen, R. DR4 mediates the progression, invasion, metastasis and survival of colorectal cancer through the Sp1/NF1 switch axis on genomic locus. Int. J. Cancer 2018, 143, 289–297.
Meng, Q.; Wu, S.; Wang, Y.; Xu, J.; Sun, H.; Lu, R.; Gao, N.; Yang, H.; Li, X.; Tang, B.; et al. MPO Promoter Polymorphism rs2333227 Enhances Malignant Phenotypes of Colorectal Cancer by Altering the Binding Affinity of AP-2α. Cancer Res. 2018, 78, 2760–2769.
Lou, H.; Yeager, M.; Li, H.; Bosquet, J.G.; Hayes, R.B.; Orr, N.; Yu, K.; Hutchinson, A.; Jacobs, K.B.; Kraft, P.; et al. Fine mapping and functional analysis of a common variant in MSMB on chromosome 10q11.2 associated with prostate cancer susceptibility. Proc. Natl. Acad. Sci. USA 2009, 106, 7933–7938.
Hua, J.T.; Ahmed, M.; Guo, H.; Zhang, Y.; Chen, S.; Soares, F.; Lu, J.; Zhou, S.; Wang, M.; Li, H.; et al. Risk SNP-Mediated Promoter-Enhancer Switching Drives Prostate Cancer through lncRNA PCAT. Cell 2018, 174, 564–575.
Bojesen, S.E.; Pooley, K.A.; Johnatty, S.E.; Beesley, J.; Michailidou, K.; Tyrer, J.P.; Edwards, S.L.; Pickett, H.A.; Shen, H.C.; Smart, C.E.; et al. Multiple independent variants at the TERT locus are associated with telomere length and risks of breast and ovarian cancer. Nat. Genet. 2013, 45, 371–384.
Michailidou, K.; Lindstrom, S.; Dennis, J.; Beesley, J.; Hui, S.; Kar, S.; Lemacon, A.; Soucy, P.; Glubb, D.; Rostamianfar, A.; et al. Association analysis identifies 65 new breast cancer risk loci. Nature 2017, 551, 92–94.
Noncoding Genetic Variation in GATA3 Increases Acute Lymphoblastic Leukemia Risk through Local and Global Changes in Chromatin Conformation|Nature Genetics. Available online: https://www.nature.com/articles/s41588-021-00993-x (accessed on 11 October 2022).
Williamson, I.; Hill, R.E.; Bickmore, W.A. Enhancers: From Developmental Genetics to the Genetics of Common Human Disease. Dev. Cell 2011, 21, 17–19.
Li, G.; Ruan, X.; Auerbach, R.K.; Sandhu, K.S.; Zheng, M.; Wang, P.; Poh, H.M.; Goh, Y.; Lim, J.; Zhang, J.; et al. Extensive Promoter-Centered Chromatin Interactions Provide a Topological Basis for Transcription Regulation. Cell 2012, 148, 84–98.
Sur, I.; Taipale, I.S.J. The role of enhancers in cancer. Nat. Cancer 2016, 16, 483–493.
Yan, J.; Chen, S.-A.A.; Local, A.; Liu, T.; Qiu, Y.; Dorighi, K.M.; Preissl, S.; Rivera, C.M.; Wang, C.; Ye, Z.; et al. Histone H3 lysine 4 monomethylation modulates long-range chromatin interactions at enhancers. Cell Res. 2018, 28, 204–220.
Corradin, O.; Scacheri, P.C. Enhancer variants: Evaluating functions in common disease. Genome Med. 2014, 6, 85.
Ward, L.D.; Kellis, M. Interpreting noncoding genetic variation in complex traits and human disease. Nat. Biotechnol. 2012, 30, 1095–1106.
Michailidou, K.; Beesley, J.; Lindstrom, S.; Canisius, S.; Dennis, J.; Lush, M.J.; Maranian, M.J.; Bolla, M.K.; Wang, Q.; Shah, M.; et al. Genome-wide association analysis of more than 120,000 individuals identifies 15 new susceptibility loci for breast cancer. Nat. Genet. 2015, 47, 373–380.
Zhang, Z.; Liu, X.; Li, L.; Yang, Y.; Yang, J.; Wang, Y.; Wu, J.; Wu, X.; Shan, L.; Pei, F.; et al. SNP rs4971059 predisposes to breast carcinogenesis and chemoresistance via TRIM46-mediated HDAC1 degradation. EMBO J. 2021, 40, e107974.
Yang, H.; Ting, X.; Geng, Y.-H.; Xie, Y.; Nierenberg, J.L.; Huo, Y.-F.; Zhou, Y.-T.; Huang, Y.; Yu, Y.-Q.; Yu, X.-Y.; et al. The risk variant rs11836367 contributes to breast cancer onset and metastasis by attenuating Wnt signaling via regulating NTN4 expression. Sci. Adv. 2022, 8, eabn3509.
Tian, P.; Zhong, M.; Wei, G.-H. Mechanistic insights into genetic susceptibility to prostate cancer. Cancer Lett. 2021, 522, 155–163.
Huang, Q.; Whitington, T.; Gao, P.; Lindberg, J.; Yang, Y.; Sun, J.; Väisänen, M.-R.; Szulkin, R.; Annala, M.; Yan, J.; et al. A prostate cancer susceptibility allele at 6q22 increases RFX6 expression by modulating HOXB13 chromatin binding. Nat. Genet. 2014, 46, 126–135.
Gao, P.; Xia, J.-H.; Sipeky, C.; Dong, X.-M.; Zhang, Q.; Yang, Y.; Zhang, P.; Cruz, S.P.; Zhang, K.; Zhu, J.; et al. Biology and Clinical Implications of the 19q13 Aggressive Prostate Cancer Susceptibility Locus. Cell 2018, 174, 576–589.
Zheng, Y.; Lei, T.; Jin, G.; Guo, H.; Zhang, N.; Chai, J.; Xie, M.; Xu, Y.; Wang, T.; Liu, J.; et al. LncPSCA in the 8q24.3 risk locus drives gastric cancer through destabilizing DDX. EMBO Rep. 2021, 22, e52707.
Hepatocellular Carcinoma Risk Variant Modulates lncRNA HLA-DQB1-AS1 Expression via a Long-Range Enhancer–Promoter Interaction|Carcinogenesis|Oxford Academic. Available online: https://academic.oup.com/carcin/article/42/11/1347/ (accessed on 25 August 2022).
Schödel, J.; Bardella, C.; Sciesielski, L.; Brown, J.M.; Pugh, C.; Buckle, V.; Tomlinson, I.P.; Ratcliffe, P.; Mole, D.R. Common genetic variants at the 11q13.3 renal cancer susceptibility locus influence binding of HIF to an enhancer of cyclin D1 expression. Nat. Genet. 2012, 44, 420–425.
Yu, C.-Y.; Han, J.-X.; Zhang, J.; Jiang, P.; Shen, C.; Guo, F.; Tang, J.; Yan, T.; Tian, X.; Zhu, X.; et al. A 16q22.1 variant confers susceptibility to colorectal cancer as a distal regulator of ZFP. Oncogene 2020, 39, 1347–1360.
Tian, J.; Chang, J.; Gong, J.; Lou, J.; Fu, M.; Li, J.; Ke, J.; Zhu, Y.; Gong, Y.; Yang, Y.; et al. Systematic Functional Interrogation of Genes in GWAS Loci Identified ATF1 as a Key Driver in Colorectal Cancer Modulated by a Promoter-Enhancer Interaction. Am. J. Hum. Genet. 2019, 105, 29–47.
Tian, J.; Lou, J.; Cai, Y.; Rao, M.; Lu, Z.; Zhu, Y.; Zou, D.; Peng, X.; Wang, H.; Zhang, M.; et al. Risk SNP-Mediated Enhancer–Promoter Interaction Drives Colorectal Cancer through Both FADS2 and AP002754. Cancer Res. 2020, 80, 1804–1818.
Capturing Chromosome Conformation|Science. Available online: https://www.science.org/doi/10.1126/science.1067799?url_ver=Z39.88-2003&rfr_id=ori:rid:crossref.org&rfr_dat=cr_pub%20%200pubmed (accessed on 25 August 2022).
Tolhuis, B.; Palstra, R.-J.; Splinter, E.; Grosveld, F.; de Laat, W. Looping and Interaction between Hypersensitive Sites in the Active β-globin Locus. Mol. Cell 2002, 10, 1453–1465.
Giambartolomei, C.; Seo, J.-H.; Schwarz, T.; Freund, M.K.; Johnson, R.D.; Spisak, S.; Baca, S.C.; Gusev, A.; Mancuso, N.; Pasaniuc, B.; et al. H3K27ac HiChIP in prostate cell lines identifies risk genes for prostate cancer susceptibility. Am. J. Hum. Genet. 2021, 108, 2284–2300.
Zhu, Y.; Gujar, A.D.; Wong, C.-H.; Tjong, H.; Ngan, C.Y.; Gong, L.; Chen, Y.-A.; Kim, H.; Liu, J.; Li, M.; et al. Oncogenic extrachromosomal DNA functions as mobile enhancers to globally amplify chromosomal transcription. Cancer Cell 2021, 39, 694–707.
Zheng, H.; Xie, W. The role of 3D genome organization in development and cell differentiation. Nat. Rev. Mol. Cell Biol. 2019, 20, 535–550.
Gorkin, D.U.; Leung, D.; Ren, B. The 3D Genome in Transcriptional Regulation and Pluripotency. Cell Stem Cell 2014, 14, 762–775.
He, H.; Li, W.; Liyanarachchi, S.; Srinivas, M.; Wang, Y.; Akagi, K.; Wang, Y.; Wu, D.; Wang, Q.; Jin, V.; et al. Multiple functional variants in long-range enhancer elements contribute to the risk of SNP rs965513 in thyroid cancer. Proc. Natl. Acad. Sci. USA 2015, 112, 6128–6133.
Zhang, X.; Cowper-Sal·lari, R.; Bailey, S.D.; Moore, J.H.; Lupien, M. Integrative functional genomics identifies an enhancer looping to the SOX9 gene disrupted by the 17q24.3 prostate cancer risk locus. Genome Res. 2012, 22, 1437–1446.
Ling, J.Q.; Li, T.; Hu, J.F.; Vu, T.H.; Chen, H.L.; Qiu, X.W.; Cherry, A.M.; Hoffman, A.R. CTCF Mediates Interchromosomal Colocalization Between Igf2/H19 and Wsb1/Nf1. Science 2006, 312, 269–272.
Insulators: Exploiting Transcriptional and Epigenetic Mechanisms|Nature Reviews Genetics. Available online: https://www.nature.com/articles/nrg (accessed on 26 August 2022).
Yusufzai, T.M.; Tagami, H.; Nakatani, Y.; Felsenfeld, G. CTCF Tethers an Insulator to Subnuclear Sites, Suggesting Shared Insulator Mechanisms across Species. Mol. Cell 2004, 13, 291–298.
Ahmed, M.; Soares, F.; Xia, J.-H.; Yang, Y.; Li, J.; Guo, H.; Su, P.; Tian, Y.; Lee, H.J.; Wang, M.; et al. CRISPRi screens reveal a DNA methylation-mediated 3D genome dependent causal mechanism in prostate cancer. Nat. Commun. 2021, 12, 1781.
Statkiewicz, M.; Maryan, N.; Kulecka, M.; Kuklinska, U.; Ostrowski, J.; Mikula, M.; Czyżowska, A.; Barbasz, A. Functional analyses of a low-penetrance risk variant rs6702619/1p21.2 associating with colorectal cancer in Polish population. Acta Biochim. Pol. 2019, 66, 305–313.
Claussnitzer, M.; Cho, J.H.; Collins, R.; Cox, N.J.; Dermitzakis, E.T.; Hurles, M.E.; Kathiresan, S.; Kenny, E.E.; Lindgren, C.M.; MacArthur, D.G.; et al. A brief history of human disease genetics. Nature 2020, 577, 179–189.
Rahman, M.A.; Krainer, A.R.; Abdel-Wahab, O. SnapShot: Splicing Alterations in Cancer. Cell 2020, 180, 208–208.e1.
He, L.; Hannon, G.J. MicroRNAs: Small RNAs with a big role in gene regulation. Nat. Rev. Genet. 2004, 5, 522–531.
Esquela-Kerscher, A.; Slack, F. Oncomirs—MicroRNAs with a role in cancer. Nat. Cancer 2006, 6, 259–269.
Hoffman, A.E.; Zheng, T.; Yi, C.; Leaderer, D.; Weidhaas, J.; Slack, F.; Zhang, Y.; Paranjape, T.; Zhu, Y. microRNA miR-196a-2 and Breast Cancer: A Genetic and Epigenetic Association Study and Functional Analysis. Cancer Res. 2009, 69, 5970–5977.
Chen, J.; Jiang, Y.; Zhou, J.; Liu, S.; Qin, N.; Du, J.; Jin, G.; Hu, Z.; Ma, H.; Shen, H.; et al. Evaluation of CpG-SNPs in miRNA promoters and risk of breast cancer. Gene 2018, 651, 1–8.
Gao, F.; Xiong, X.; Pan, W.; Yang, X.; Zhou, C.; Yuan, Q.; Zhou, L.; Yang, M. A Regulatory MDM4 Genetic Variant Locating in the Binding Sequence of Multiple MicroRNAs Contributes to Susceptibility of Small Cell Lung Cancer. PLoS ONE 2015, 10, e0135647.
Wu, X.-M.; Yang, H.-G.; Zheng, B.-A.; Cao, H.-F.; Hu, Z.-M.; Wu, W.-D. Functional Genetic Variations at the microRNA Binding-Site in the CD44 Gene Are Associated with Risk of Colorectal Cancer in Chinese Populations. PLoS ONE 2015, 10, e0127557.
Ke, J.; Tian, J.; Li, J.; Gong, Y.; Yang, Y.; Zhu, Y.; Zhang, Y.; Zhong, R.; Chang, J.; Gong, J. Identification of a Functional Polymorphism Affecting Microrna Binding in the Susceptibility Locus 1q25.3 for Colorectal Cancer. Wiley Online Library, 2021. Available online: https://onlinelibrary.wiley.com/doi.org/10.1002/mc.22649 (accessed on 26 August 2022).
Shen, C.; Yan, T.; Wang, Z.; Su, H.-C.; Zhu, X.; Tian, X.; Fang, J.-Y.; Chen, H.; Hong, J. Variant of SNP rs1317082 at CCSlnc362 (RP11-362K14.5) creates a binding site for miR-4658 and diminishes the susceptibility to CRC. Cell Death Dis. 2018, 9, 1177.
Feng, T.; Feng, N.; Zhu, T.; Li, Q.; Zhang, Q.; Wang, Y.; Gao, M.; Zhou, B.; Yu, H.; Zheng, M.; et al. A SNP-mediated lncRNA (LOC146880) and microRNA (miR-539-5p) interaction and its potential impact on the NSCLC risk. J. Exp. Clin. Cancer Res. 2020, 39, 157.
Wang, J.; Zou, Y.; Du, B.; Li, W.; Yu, G.; Li, L.; Zhou, L.; Gu, X.; Song, S.; Liu, Y.; et al. SNP-mediated lncRNA-ENTPD3-AS1 upregulation suppresses renal cell carcinoma via miR-155/HIF-1α signaling. Cell Death Dis. 2021, 12, 672.
Fu, Y.; Zhang, Y.; Cui, J.; Yang, G.; Peng, S.; Mi, W.; Yin, X.; Yu, Y.; Jiang, J.; Liu, Q.; et al. SNP rs12982687 affects binding capacity of lncRNA UCA1 with miR-873-5p: Involvement in smoking-triggered colorectal cancer progression. Cell Commun. Signal. 2020, 18, 37.
Hou, Y.; Zhou, M.; Li, Y.; Tian, T.; Sun, X.; Chen, M.; Xu, W.; Lu, M. Risk SNP-mediated LINC01614 upregulation drives head and neck squamous cell carcinoma progression via PI3K/AKT signaling pathway. Mol. Carcinog. 2022, 61, 797–811.

© Text is available under the terms and conditions of the Creative Commons Attribution (CC BY) license; additional terms may apply. By using this site, you agree to the Terms and Conditions and Privacy Policy.