Telomeres and Their Neighbors: Comparison
Please note this is a comparison between Version 1 by Eva Sykorova and Version 2 by Camila Xu.

Telomeres are essential structures formed from satellite DNA repeats at the ends of chromosomes in most eukaryotes. Satellite DNA repeat sequences are useful markers for karyotyping, but have a more enigmatic role in the eukaryotic cell. More research is needed until there is a complete picture of the biological function of telomere and other DNA satellite sequences, including chromatin structure, chromosome end-protection and species evolution with a particular focus on non-model organisms. The first problem to solve is the identification of telomere repeats, because telomere repeat identity is the foundation for any hypothesis about telomere maintenance and structure, or binding of specific  proteins. Celebrating Gregor Mendel’s anniversary by going to the principles behind the experiments, a selection of recent developments and underexplored areas of research from the past are illustrated in plants and insects. Indeed, much recent work has expanded beyond the human and yeast models traditional in telomere research. Classic methods from the past, and cutting-edge in silico methods are described. These do not require specialized equipment or expensive materials and can be used, often in combination, to aid research into telomeres and satellites. This can both enrich the general understanding of chromosome maintenance mechanisms and further explore the evolution of telomeres and telomerases.

  • satellite
  • telomere evolution
  • interstitial telomere sequences
  • telomerase RNA
  • NGS
  • eukaryotic tree of life

1. Introduction

The essential DNA structures that form eukaryotic chromosomes are centromeres, telomeres and origins of replication. Centromeres are vital for proper nuclear division and telomeres protect the ends of linear chromosomes from attrition during DNA replication. Each chromosome also possesses genes which populate the chromatin regions lying between centromeres and telomeres. Genes are sequence-based quanta of information, Gregor Mendel’s “elements”, that are the foundation of organism identity when realized. A simplified picture of genetics was initially recognized by Mendel without any deep knowledge of DNA, RNA and proteins; molecules that we now know realize organism function. When thinking about Mendel 200 years after his birth, it is obvious that simple methods and an open mind are the solution to many scientific questions, even those related to gene function, genome structure and evolution. Of course, there is also a little bit of luck in choosing a model. Mendel’s choice, the garden pea (Pisum sativum) has a large genome that has only recently been sequenced [1] and is not a popular organism for modern plant genetics due to the abundance of repetitive elements in its genome. Conversely, another one of Mendel’s favorite organisms, the honeybee (Apis melifera, [2]) is still intensively studied as it offers a chance to understand the phenomena of social life and cooperation in the insect world. It is an extraordinary coincidence that the dawn of telomere biology in 1938 is linked to maize and Drosophila, models which were used in the pioneering works of McClintock and Muller, who showed breakage-fusion-bridge cycles and chromosome healing after X-ray damage [3][4][3,4]. At that time, it was assumed that the entire chromosome was filled with genes, and Muller’s original definition of the telomere speaks about it as a special terminal gene. Today, the definition of a gene has changed and we know more about chromosomes and telomeres, but there is still much yet to be discovered (e.g., [5][6][7][8][9][10][11][12][13][5,6,7,8,9,10,11,12,13] and references herein). Telomeres and satellites are often neighbors at chromosome termini. Much telomere research derives from human, ciliate and yeast models. Knowledge of telomere evolution has increased enormously in recent years, however, thanks to discoveries in plant and insect models. 

2. Telomeres as Steps in Species Evolution

To begin with, telomere DNA sequences were assigned as a trait of a large group of organisms, e.g., TTAGGG in vertebrates, TTTAGGG in plants, TTAGG in insects/arthropods [14][15][16][17][18][74,114,115,116,117] (see [19][75] for review). Thus, the majority of identified telomere repeats are of minisatellite size and maintained by a special enzyme, telomerase telomerase ([20][21][22][76,77,78], reviewed in, e.g., [7][11][23][7,11,79]), except well-known examples of non-telomerase alternatives from Diptera ([24][25][26][27][28][80,81,82,83,84], reviewed in [11][19][29][30][11,75,98,99]). This conservation has proven advantageous in microscopy studies and telomeric probes are second only to rDNA probes [31][32][33][32,33,118], e.g., to distinguish and study telocentric chromosomes, to recognize Rabl-like or bouquet organization or various chromosomal aberrations [34][35][36][37][38][39][40][41][119,120,121,122,123,124,125,126]. Numerous reports that characterized typical telomeric sequences in an increasing number of species seemed to confirm the telomere consensus TxAyGz. Telomeric sequences in yeast models, e.g., TG1–3 in budding yeast [42][43][127,128], T1–2ACA0–1C0–1G1–6 in fission yeast [44][129], 8–25 bp-long repeats in Kluyveromyces and Candida [45][46][130,131] were treated as an interesting variety from the general repeat unit TxAyGz and special only to yeast. Current research on Saccharomycotina [47][48][132,133] has revealed even more telomeric variants, although despite their considerable divergence, all of these telomere sequences have guanines (Gs) as one of their most conserved features [46][47][48][49][131,132,133,134]. Missing signals using telomere probes in in situ hybridization experiments were the first hints towards identifying organisms that do not possess typical telomeres formed by the expected repeat, e.g., plants Allium (Asparagales, [15][114]), Cestrum (Solanales, [50][135]), some beetles and the spider Tegenaria ferruginea [18][117]. In the next few years, detailed studies revealed gradually more species with unknown telomeres from plants [51][52][136,137] and insects [53][138]. This led to a breakthrough in the general view of telomeres. Studies that mapped telomere sequences in plants, animals and algae identified evolutionary switchpoints in which sequences typical to one group were replaced by other variants [50][54][55][56][57][58][59][60][61][30,135,139,140,141,142,143,144,145]. For example, a group of species from the plant order Asparagales changed their telomeric sequence from the Arabidopsis-type repeat TTTAGGG to the human-type TTAGGG. An elusive, highly divergent telomere repeat was finally identified in Allium (Amaryllidaceae, Alloidae, [62][146], see Section 7), one of the largest monocotyledonous genera with an estimated 800–900 species [63][147]. Similar step changes were found in green algae, in which the transitions from TTTAGGG to novel types TTAGGG, TTTTAGGG or TTTTAGG allowed the grouping of species with the same telomere in distinct phylogeny clades [54][58][64][30,142,148]. A similar switch was identified in beetles where the repeat TTAGG was replaced with the TCAGG repeat [59][143]. A broad experimental study of algal telomeres, accompanied with the identification of candidate telomeric sequences from genomic databases of various species across the eukaryotic tree of life, showed TTAGGG and TTTAGGG telomeres as being the predominant telomeric types [54][30]. Fulneckova and colleagues [54][30] mapped the occurrence of telomeric sequences in phylogeny revealing the TTAGGG repeat as an ancestral eukaryotic telomere and current phylogeny [65][66][149,150] still supports this hypothesis (see [7] for review). Interestingly, just as many telomere variants were experimentally verified, many more species and groups with unknown telomeres were discovered [11][54][58][64][67][11,30,142,148,151]. Telomere sequence variants and their evolution in plants and algae are described in detail in a review by Peska and Garcia [7]. Progress in insect telomere identification is reviewed in Mason et al. [11] and recent findings are mapped in [67][68][69][70][151,152,153,154].

3. Telomere Minisatellites Are Much like Any Other DNA Sequences

When exploring the occurrence of telomere minisatellite repeats in the genome, we should keep in mind that telomere-like sequences can occur in locations other than in the telomere. Such sequences are called interstitial telomeric sequences (ITSs) and can be classified as part of several groups according to their length, occurrence and structure (recently reviewed in [71][155]Figure 12). ITSs can have the same sequences as telomeres or they can have variant telomere-like repeats. For example, budding yeast has the telomeric sequence TG1–3 and interstitial tracts of TTAGGG repeats are present in subtelomeric and other regions [72][156]. ITSs can occur as a few copies across the genome, including regions that are proximal to genes, but also in clusters found frequently in pericentromeric or subtelomeric regions. The arrangement of ITS sites can also be classified in respect to the orientation and composition of telomere-like sequences as head-to-tail or head-to-head, homogeneous or degenerated tandem repeats and with or without linker sequence(s) (Figure 12b). When ITSs occur in a head-to-head orientation with a linker sequence, these can be amplified using a single-primer PCR reaction [73][157] (Figure 12c). ITSs can be unique or part of longer repetitive sequences and are a suitable genetic marker for mapping [73][74][75][157,158,159]. ITSs in clusters usually contain a large portion of degenerate telomeric motifs and could be interspersed with other repetitive sequences [74][75][76][158,159,160].
Figure 12. Experimental examination of ITS and telomeric repeats. (a) Telomere repeats are strand oriented. (b) Telomere-like repeats in telomeres or internal sites may form clusters or short stretches. Single-primed PCR distinguishes between these using an extension reaction with a single telomeric oligonucleotide primer (C-rich primer is shown, triangles). Telomeric sequences, short and clustered ITSs produce a smear of ssDNA products visible after hybridization with a radioactively (*) labeled probe (right, e.g., from Chlorela vulgaris, experiment performed as in [50][135]). Cloneable dsDNA products visible in an ethidium-bromide stained agarose gel (etd) are produced when ITSs occur in head-to-head orientation. When dGTP is omitted, bands are not produced by ssDNA or short ITSs, but ssDNA from a telomere is elongated until primer extension stops at the first G in the subtelomere. This reaction showed the Arabidopsis- and human-type telomere repeats are absent in Allium and Cestrum [50][56][77][135,140,161]. (c) Different patterns of ITSs amplified from four Cestrum species in single-primed PCR using C-rich and G-rich primers for the Arabidopsis-type telomeric repeat [50][73][135,157]. (d) The specific pattern of ITS-associated sequence BR23 (green) was visualized on Cestrum elegans chromosomes using FISH. The high-copy repeat BR23 shows dispersed and clustered signals (5S rDNA in red, counterstained with DAPI; adapted from [73][157]). (e) Allotetraploid Cardamine scutata, a hybrid of C. parviflora and C. amara with the parental origin of chromosomes visualized by GISH (left panel, GISH) and telomeric probe (TEL) that detects differing pericentromeric ITS clusters (adapted from [75][159]; modified). (f) FISH of the 180-bp centromeric satellite (CEN180), retroelement ATHILA and TEL on pachytene chromosomes of A. thaliana. Interstitial telomeric locus in the pericentromeric region of chromosome Ch1 is marked by an arrow (adapted from [76][160]; modified). (g) TRF (terminal restriction fragment) method visualizes telomeric and ITS fragments from A. thaliana after restriction digestion of gDNA with MseI. (h) Schema illustrating the effect of Bal31 nuclease digestion on telomeric, subtelomeric, ITS and internal genomic sequences. After DNA isolation, DNA is fragmented and Bal31 nuclease gradually shortens these fragments from the end. Bal31-digested samples can be used for specific telomere-subtelomere PCR (left, see below). Further restriction digestion (right, H) results in the visualization of TRF signal (h-tel) shortening and verification of the terminal position of a candidate sequence. (Left) PCR/qPCR investigation of genomes with short telomeres (e.g., A. thaliana, see results in (ik) adapted from [78][162]) proving subtelomeric position of candidate sequences (A,B). When the telomere is completely digested, PCR with a C-rich primer cannot amplify the product (tel-a, tel-b), and further digestion results in a loss of amplification signal from subtelomere regions proximal to telomeres (A) in contrast to ITS (C, pericentromeric ITS in A. thaliana, see schemas in (j)) or control sequences (D,E). Bal31 nuclease also degrades ssDNA (F) and some dsDNA sites with altered structures (G). (i) Dynamics of Bal31 digestion monitored by qPCR. Short gDNA exposure to Bal31 results in a sudden, seemingly non-specific decrease in gDNA amount followed by a gradual decrease over a prolonged time. (j) Bal31-sensitivity of specific subtelomeric sequences from chromosome arm 2R (pat and gal2) and the resistance of the centromeric ITS region to Bal31 digestion resolved by PCR. gDNA integrity was monitored by amplification of 5 kb-long fragments of the TERT gene. (k) qPCR analysis of specific subtelomere (gal2, pat, gal5), ITS and control sequences documented a decrease of subtelomeric sequences in relation to their position in the subtelomere. Relative DNA levels were calculated by the ΔCt method (i) or ΔΔCt method [79][163] using ubiquitine-10 as a reference gene relative to the nontreated DNA sample (k). Color coding is the same for (hk). Pictures were adapted by courtesy of Dr. Terezie Mandáková (e,f) and Prof. Andrew Leitch (d), scale bars are 10 µm.
When such clusters are big enough, these can be detected by FISH (Figure 12d–f) and distinguished from telomeres (e.g., [73][75][80][81][82][83][84][85][86][49,157,159,164,165,166,167,168,169]). If they are shorter than the detection limit of this method, they can still show a positive signal when investigated by Southern hybridization or primer extension (Figure 12g). The origin, evolution and function of ITSs are still subject to much discussion [35][71][86][87][88][89][90][120,155,169,170,171,172,173]. The massive areas of ITSs often found in pericentromeric regions can be explained as the result of mechanisms such as unequal gene conversion, crossing-over, DNA replication slippage and rolling circle replication of extrachromosomal circular DNA. Some ITSs co-localize with sites of chromosomal breakage and are described as remnants of ancient chromosomal rearrangements, such as during primate evolution [91][174]. A similar view holds for human ITSs arranged as head-to-head blocks of telomeric repeats that seem to result from the terminal fusion of ancestor chromosomes [41][92][126,175]. ResearchWers are still far from understanding the interplay of mechanisms that are activated during genome instability. It has long been considered that overall change in chromosome architecture can result from breakage-fusion-bridge cycles, a phenomenon first described in maize ([93][176], reviewed in [94][177]). The classic theory behind this is that a chromosome with one end broken during meiotic crossing-over can fuse with another such broken chromosome, leading to the formation of a “bridge“ conformation chromosome with two centromeres during the subsequent cycle of meiosis. This bridged chromosome is then ultimately cleaved into two daughter chromatids, but not necessarily at the site of the original breakage. This can lead to sequence deletion or replication on subsequently-healed daughter chromatids [93][176]. Experimental examination of this theory in Caenorhabditis elegans revealed evidence of such cycles, but also suggested more complex chromatin rearrangements can arise [95][178]. These more extensive rearrangements are proposed to arise from stalled replication events followed by template switching as may occur in areas with high-homology satellite sequences [95][178]. A simpler phenomenon is where non-reciprocal translocations can occur during break-induced DNA replication ([96][179], reviewed in [97][180]). Broken chromosomes are proposed to invade intact chromosomes with areas of homology during the G1 or G2 phase of the cell cycle, initiating DNA repair with the sequence from the other chromosome arm, possibly acquiring new genes and a telomere in the process [96][179]. Similar genome instability is also possible when telomeres are lost, making chromosome ends indistinguishable from double-strand breaks [98][181]. It is clear that telomerase and possible ITSs could have an important role in chromosome rearrangement. For example, when tobacco cells recovered to full cell viability after extensive chromatin fragmentation induced by cadmium stress, this was accompanied by a concomitant increase in telomerase activity [99][182]. Wheat chromosome end healing after gametocidal gene-induced breakage, efficient telomere healing by telomerase and stabilization of holocentric chromosomes in irradiated Luzula elegans plants were also previously reported [100][101][183,184]. Interestingly, when constructs containing telomeric arrays are introduced into mammalian or plant cells, the sites of integration become fragile, chromosomal breakage is induced and the new ends are stabilized [102][103][104][185,186,187]. Telomere-mediated chromosomal truncation has even been employed as a chromosome engineering technique [105][106][107][108][188,189,190,191]. All this supports the hypothesis that ITSs are preferred sites for breakage and that telomere-like repeats at a break site may favor chromosomal healing [87][170].

4. Telomere Proteins

Chromosomal DNA in cells associates with proteins that fold these long polymeric molecules into condensed, ordered forms. Most of the DNA sequence, including genes, subtelomeric satellites and the proximal sections of telomeres is folded into a series of compact but dynamic protein-DNA complexes called nucleosomes [109][269]. In 2001, Fajkus and Trifonov [110][270] proposed telomeric nucleosomes are packed in a variant, columnar chromatin structure. Recently, the formation of this structure was confirmed experimentally using cryoelectron microscopy [111][271]. The ends of telomeres associate with a more diverse set of proteins depending on organism that maintain a 3′ single-stranded overhang (aka G-overhang), recruiting enzymes to lengthen the 3′ strand and shorten the 5′ strand which induce and stabilize t-loop formation (reviewed in [9][112][9,272]). These mechanisms protect telomeric DNA, prevent aberrant DNA repair and mediate interactions with telomerase (see above, [112][272]). In Arabidopsis and Chlamydomonas some telomere ends are instead blunt, with no or little 3′ overhang, although it is unknown whether this is a special feature of these organisms or a more widespread characteristic [113][114][115][195,273,274]. Of principal interest to telomere researchers are the specialist proteins that interact with the distal sections of telomeres at the ends of chromosomes. Two major telomere protecting complexes have been described, CST and shelterin. These were initially thought to be alternative mutually exclusive systems, but the search for homologues revealed that many eukaryotes, including humans, had both systems able to work in parallel [116][117][118][119][275,276,277,278]. Continuing research focused on looking for homologues of human systems across all eukaryotes, however this approach has had only partial success (reviewed in [120][279]). The CST complex is largely conserved in eukaryotes [121][280] in terms of function, if not necessarily the sequence of its components [122][123][124][281,282,283]. CST binds ssDNA and recruits Pol1α primase for C-rich strand synthesis and also has a role in preventing stalled replication forks (for recent advancements see [119][278] and references herein). In comparison, shelterin (reviewed in [112][272]) coats telomeric DNA generally and interacts with telomerase for G-rich strand synthesis. Shelterin is not present in all eukaryotes, although most have an identifiable protein family that occupies the same role (Figure 25). In addition to these larger end-protection protein complexes, there is a highly conserved heterodimer of proteins called Ku70/Ku80 that is normally involved in non-homologous end-joining events, but which also has an enigmatic role in telomeres. This complex binds dsDNA ends non-specifically, but is known to interact with components of shelterin in mammals and telomerase RNA in yeast (reviewed in [125][126][284,285]).
Figure 25. Telomere protection by protein complexes. (a) The six core units of shelterin [112][272] form a complex coating distal telomeres, although stoichiometries of subunits may vary. TRF2 forms T-loops and binds the double-stranded vertebrate telomeric sequence. TRF1 assists this binding, TIN2 and TPP1 form the core of the complex and control other protein-protein interactions and POT1 can bind single-stranded telomeric repeats to stabilize the T-loop. (b) Fission yeast shelterin [127][286] is analogous to vertebrates but differs in stoichiometries of proteins. (cDrosophila terminin [128][95] has a similar function to shelterin although the precise roles of components that share little homology with shelterin components are speculative. (d) Budding yeast telosomes [129][287]. Rap1 binds telomeric DNA and can be complexed into dimers by Rif2 or tetramers by Rif1. The entire assembly is proposed to form a velcro-like coating of telomeres although to date structural studies of this complex are on dsDNA only, so any interaction with 3′ overhangs is speculative.
Shelterin has a dynamic composition and variant complexes bind the entire length of distal telomeres, there are six core protein components in humans which are more-or-less thought to be conserved in mammals [112][130][272,288]. Telomeric repeat binding factor 2 (TRF2) binds the telomeric DNA motif with nanomolar affinity via a SANT/Myb domain sometimes termed the telobox in older literature [131][289], not to be confused with interstitial telomeric motifs, which are also called teloboxes [89][172]. TRF2 binds dsDNA and homodimerization enhances this process. It is also proposed to have helicase-like activity where it can wrap dsDNA from near to the telomere end around itself causing steric torsion in the telomere end that encourages T-loop formation. Consistent with this, TRF2 is both necessary for T-loop formation by shelterin and capable of forming T-loops in the absence of any other shelterin components [132][290]. TRF1 is a highly homologous protein to TRF2 which only binds telomeric repeats and lacks T-loop forming ability. Both proteins (possibly as homodimers) bind TRF1-interacting protein (TIN2) to form the core dsDNA binding subunit of shelterin [133][291]. TIN2 binds TINT1/PIP1/PTOP1 (TPP1) which in turn binds protection of telomeres 1 (POT1), a protein with multiple OB-fold domains that can bind ssDNA and which is thought to be the main interactor with the 3′ overhang in the complete shelterin complex. TRF2 alone can also recruit repressor/activator protein 1 (RAP1) as the sixth member of core shelterin and the interactions between shelterin subunits can generally occur across multiple protein surfaces [133][291]. TPP1 in complex with POT1 interacts with telomerase as part of the coordination of telomerase and shelterin protein complexes [119][134][278,292]. Unsurprisingly, Drosophila has evolved a separate group of proteins in a complex called terminin to protect the retrotransposon-derived sequences at the ends of its chromosomes. Terminin was identified from the larval brain cells of mutant flies with end-fused chromosomes and consists of a core of heterochromatin protein 1/origin recognition complex-associated protein (HOAP), Modigliani (Moi) and an OB-fold protein called Verrocchio (Ver) [135][293]. Whilst fission yeast has a shelterin complex made from paralogues of human proteins [136][137][294,295], budding yeast, instead has a velcro-like network of proteins called the telosome. This consists of Rap1, a general transcription factor which coats double-stranded telomeric DNA, Rif1 which binds DNA via a Myb domain and Rif2 which binds DNA via an AAA+ domain. Rif 1 and Rif2 can bind four or two molecules of Rap1 respectively through binding domains attached to long disordered chains to form a dense protein network ([138][296], reviewed in [139][297]). The system in plants is not yet clear (reviewed in [140][298]).
Although plant proteins that share some sequence homology to human shelterin proteins have been identified (summarized and reviewed in [141][142][299,300]), including those with C-terminal Myb domains similar to TRF1 and TRF2, these do not have any obvious end-protection role [143][301]. The only definitive double-stranded telomeric DNA binding proteins so far characterized in plants are the telomere repeat binding proteins (TRB1–3) [144][145][146][302,303,304]. These proteins bind to Arabidopsis telomeres in vivo [146][147][304,305], and TRB1 colocalizes with telomeres when introduced to Nicotiana benthamiana in live cell imaging studies, suggesting a general role for these proteins at plant telomeres [148][149][306,307]. TRBs have histone-like domains that allow multimerization and binding to telobox-related DNA motifs in a multitude of chromosome sites and N-terminal Myb domains that specifically bind double stranded telomeric DNA [144][145][146][150][151][302,303,304,308,309]. Similar to TPP1 in human shelterin, TRBs can interact with telomerase and so together with DNA binding and multimerization it is easy to draw parallels with other end-protecting proteins [131][141][145][146][289,299,303,304]. It can be speculated that in addition to their other regulatory DNA-binding roles [150][151][152][308,309,310], TRBs could form some sort of end-protection framework, similar to the telosome in yeast. Alternatively, it could simply be that any end-protection proteins in plants are sufficiently variant from other organisms to have eluded discovery so far. One final quirk in plant telomere biology is the occurrence of blunt-ended telomeres. Some blunt DNA ends in Arabidopsis [153][311] are known to at least temporarily bind Ku70/80, a ubiquitous DNA end-protecting protein complex that is part of the normal double-strand break maintenance mechanism. Studies in budding yeast and human cells revealed that Ku can interact with telomeric chromatin either by directly binding to telomeric DNA or via interaction with telomere associated proteins, including the shelterin subunits such as TRF1, TRF2 and Rap1 [154][155][156][312,313,314]. Studies using mice revealed considerable telomere abnormalities where Ku is knocked out, but phenotypes are complex enough that a specific role is difficult to ascertain [157][158][315,316]. In yeast, Ku also binds the telomerase RNA TLC1 separately from telomere ends in a mutually exclusive fashion, and is required to maintain levels and nuclear localization of TLC1. YKu association with telomeres is independent of its association with TLC1 RNA and occurs throughout the cell cycle [159][160][317,318]. As with other eukaryotic systems, the Ku heterodimer in Arabidopsis forms a tube that slides onto and encircles the double-stranded telomere from one free end, providing simple end-protection without translocating inward [115][153][274,311]. It is so far unknown whether Ku-protected blunt ends in Arabidopsis and Chlamydomonas are unique to these organisms or whether a more widespread phenomenon is yet to be found in other eukaryotes. It is possible that these are an evolutionary step that limits the amount of work that telomerase has to conduct or provides cells without telomerase more stability during proliferation [161][319].

5. How to Find a Telomere Candidate

Experimental approaches which have been used successfully in the past to characterize telomeres de novo (summarized in [162][206]) comprise proof of the end-protection function of newly-discovered sequence in vivo [163][164][208,209], genomic DNA library screening with verification of terminal position by BAL31 digestion and Southern hybridization [16][17][62][80] [49,115,116,146,202,[165]205],[166] cloning of telomerase products [54][55][58][64][167][168][30,139,142,148,229,230] and a novel combination of genomic and transcriptomic studies with classical methods [62][162][166][169][146,204,205,206]. Today raw data or assembled contigs generated by researchers or from public NGS (next generation sequencing) datasets can be mined for repetitive sequences using, e.g., Tandem Repeats Finder [170][234] and/or RepeatExplorer [171][39]. New ways of in silico analysis in combination with experimental approaches for the identification and verification of novel telomere sequences were used e.g., in yeast Lachancea sp. [47][132], beetle Anoplotrupes stercorosus [67] [151], a plant with human-like telomere sequence Zostera marina (Alismatales) [172] and[251], also a plant with unusual telomere type A. cepa [62][146]. Moreover, comparative transcriptome study led to identification of telomerase RNA (TR) subunits and telomeric repeats across the entire land plant phylogeny [169][204]. Subsequently, a new bioinformatic approach based on prediction of TR subunits in combination with results from Tandem Repeats Finder resulted in a broad identification of telomere sequences in green algae, ciliates and Stramenopiles including novel types TATAGGG, TGTTAGGG, TGTAAGGG and demonstrated the deep evolutionary TR origin in the megagroup Diaphoretickes [252]. [173].

ScholarVision Creations