Genetic Markers for Metabarcoding of Freshwater Microalgae: History
Please note this is an old version of this entry, which may differ significantly from the current revision.

The metabarcoding approach is widely used for studying the diversity and distribution of freshwater microalgae and for routine biomonitoring. Due to microalgae being a phylogenetically diverse group, the choice of a genetic marker directly affects the metabarcoding results. Specific markers are good for identifying only concrete groups, while universal markers may miss classes or lack the variability necessary for differentiating taxa at the species and sometimes genus levels. An analysis of publications on the subject showed that metabarcoding studies of eukaryotic freshwater microalgae used 12 markers (different nuclear regions 18S and ITS and plastid regions rbcL, 23S and 16S). Studies that compared outcomes from different markers show that the resulting lists of taxa do not match. The plastid marker rbcL is widely used for diatom metabarcoding, as it differentiates taxa at the species and intraspecies levels, and there is a specific set of primers designed for identifying Eustigmatophyceae. The V9 18S region is more variable than V4 18S and provides more diversity at higher taxonomic levels (supergroup and phylum). The ITS1 and ITS2 regions are used rarely and may be underestimated. These barcodes amplify well with the standard primers and are variable enough to identify sequences at the species level. Plastid markers (23S and 16S rDNA) focused on the plastid-containing eukaryotic algae and Cyanobacteria, conserved regions, identify taxa to the genus level and higher. Using specialized curated databases for data interpretation significantly improves the quality of the results.

  • barcode
  • metabarcoding
  • ecological assessment
  • microalgae
  • genetic markers

1. Introduction

Currently eDNA metabarcoding is a popular method for studying the diversity and functioning of various communities, from microbes to mammals. Interest in this method grows every year, and the number of studies increases. For example, a query in the SCOPUS database with the keyword “metabarcoding” returns 2215 results; a query with the keyword “eDNA” returns 26,034 results (date of search 12 February 2023). 
Algae are a phylogenetically heterogeneous group of organisms that is very diverse in morphology and ecological preferences. In the eukaryotic tree of life, photosynthetic eukaryotes are spread across 12 separate phylogenetic lines at the level of phylum [1][2][3]. On a macrosystematic level, they belong to four to seven (according to different estimates) supergroups that also contain non-photosynthetic organisms in each clade [1][2][3][4][5][6]. This phylogenetical heterogeneity is connected with a gene locus “…which is variable enough to provide robust identification at the species level…”[7] (and references in it) “…and different markers are applied for species delimitation in different algal groups.” [8] (and references in it). For example, phylogenetic studies and species descriptions of diatoms and red algae do not use the rDNA ITS marker, whereas it is the main marker currently employed for DNA-based species of green microalgae, Dinoflagellates, Chrysophytes and Synurophytes [7][8] (and references in it). Well-documented nucleotide sequences are accumulated in databases, which are the basis of interpreting the metabarcoding data.
Thus, the choice of the barcode region and primer pairs, which can limit or bias the diversity of organisms observed, is a challenge with environmental metabarcoding studies [9][10]. The proportion of biodiversity covered by metabarcoding studies directly depends on the markers and primers used, so organisms that are not amplified by standard methods go undetected, even if they are common and play an important role in the ecosystem [11]. It is important for a “good barcode” to be taxonomically informative; it needs to be able to distinguish between species (i.e., the DNA region should mutate at the right rate), because most modern biomonitoring and biotic index programs require identification at the species level. At the same time, a barcode needs conserved primer binding areas, or degenerate primers, in order to be able to attach to the DNA of all the organisms in the sample [12]. The choice of primers also impacts the results of a biodiversity assessment of an ecosystem. Complete universality causes a loss of resolution and limits the depth of the biodiversity assessments of groups. Limiting the universality of the primers might, on the other hand, exclude important groups in the analysis and introduce biases, favoring some organisms or groups.
There is a lot of experience already gained in using next-generation sequencing (NGS) approaches for studying algae. One of [13]the high-priority research areas is the integration of metabarcoding into routine biomonitoring. Many methodological questions have been answered; bioinformatics pipelines have been assessed [14][15], sampling, DNA extraction methods and applications of global eDNA have been discussed [16][17][18][19][20][21], and recently, it has been shown that the preservation time and sample preservation methods have little effect on DNA metabarcoding results [22], the experience of integrating eDNA metabarcoding into routine freshwater biomonitoring has been summarized [23][24][25] and the terminology “eDNA” has been clarified [26]). 

2. Genetic Markers for Metabarcoding of Freshwater Microalgae

2.1. Gene Markers and Primer Sets for Freshwater Microalgae Metabarcoding

Twelve various genetic regions are used in the studies. The nuclear regions V3, V4, V7, V9 and V9-ITS1 are used for analyzing whole eukaryotic communities, as well as communities of microalgae, also focusing on individual groups of algae (dinoflagellates and diatoms [27]). The ITS2 region has been chosen for studying green algae s.l. (Viridiplantae) in a series of studies of the Antarctic region [28][29][30][31]. The plastid region rbcL is widely used for diatom metabarcoding, and also, primers for identifying Eustigmatophyceae have been designed and tested [32].
Among nuclear markers, the V4 18S rRNA region is used for analyses most often. Seven options of primer sets, the most used of which was Set 6 (TAReuk454FWD1/TAReukREV3), developed by Stoeck et al. [33]. This set is widely used in metabarcoding of both marine and freshwater eukaryotic plankton. Sets 1 (DIV4for/DIV4rev3) and 2 (M13F-D512/M13R-D978rev) are aimed at diatoms and used in seven and four studies, respectively. The remaining sets are all mentioned in only one publication each, apart from Set 7 (TAReuk454FWD1/V4r), which has been recently accepted as the standard for using environmental DNA in Finnish marine phytoplankton monitoring. The V9 18S region was chosen as a barcode in nine publications. The universal primer Set 1 (1391F/EukBr) was mostly used for the amplification of this region (in eight publications out of nine). 

2.2. Reference Databases for Sequence Interpretation

The scholars used different databases for taxonomic attributions of sequences. Studies on diatoms that use the rbcL region always use “Diat. barcode” (Rsyst:diatom database), a curated barcode library for diatoms [31] for sequence interpretation. Taxonomic attributions of sequences of various 18S rRNA (V3, V4, V7, V8, V9 and combinations) regions are usually carried out using GenBank, as well as quality-controlled databases of ribosomal RNA gene sequences such as “SILVA” [34]. The PR2 (Protist Ribosomal Reference) database—a catalog of unicellular eukaryote small subunit rRNA sequences with curated taxonomy—is used less often. In a series of studies on Antarctic green algae [28][35][36], the sequences were annotated using a recently established reference dataset PLANiTS, which included the sequences of Viridiplantae ITS1, ITS2 and entire ITS sequences, including both Chlorophyta and Streptophyta [37]. To classify the 16S reads of freshwater diatom biofilm [38], PhytoREF, a reference database of the plastid 16S rRNA gene of photosynthetic eukaryotes, was used [39].

2.3. First Works on Testing Genetic Markers on Monoclonal Microalgal Cultures Provide Insight on the Effectiveness of Amplification and the Resolution of Species Differentiation

The first studies that tested the resolution of genetic markers for species differentiation were carried out using large collections of monoclonal algal cultures. It allowed to determine the effectiveness of primers in amplifying certain regions, directly compare the variability of sequences and morphological features (including cryptic species) and establish the regions that are most suitable for further research. These studies became the basis of choosing the markers for next-generation sequencing.
One of the first tests of diatom ”barcode” genes (COI, rbcL, 18S and ITS rDNA) was done by Evans et al. in 2007 [40]. The study aimed to determine the effectiveness of markers in distinguishing cryptic species within the model “morphospecies” Sellaphora pupula agg. As a result of their analysis, the authors suggested the barcode region COI as a valuable phylogenetic marker. However, they also reported some difficulties with the amplification of this gene (a large primer set was used, sequences for Seminavis cf. robusta and for centric diatoms could not be obtained and only partial sequences were obtained for the araphid pennate diatom Tabularia sp.). According to the acquired data, the plastid gene rbcL is less variable than COI, but it supports all the phylogenetic lines of the latter. As for ITS, this barcode has a lot of variability in the length of the region, and there is also the problem of intraindividual variations.
Later, Moniz and Kaczmarska [41] tested as a barcode the small ribosomal subunit (SSU, 1600 bp), a 5′ end fragment of the cytochrome c oxidase subunit 1 (COI, 430 bp), and the second internal transcribed spacer region combined with the 5.8S gene (5.8S + ITS2, 300–400 bp) on 28 species from 22 genera of diatoms. COI showed the lowest rates of amplification (only 29% of good quality DNA amplified with COI, and of those, only 30% were sequenced successfully and found to be diatom DNA). For SSU, the authors noted the highest of all three success rates in amplification and easy alignment; however, a long fragment is required for species delimitation. 5.8S + ITS2 showed a higher rate of successful amplification and sequencing (79% and 84%, respectively), as it was the most variable of the three markers, but its secondary structure was needed to aid in alignment. As a result, the 5.8S + ITS2 fragment was proposed as the best candidate for a diatom DNA barcode. 
A search for a universal marker for diatoms was carried out by Hamsher et al. [42]. The authors assessed the following markers: ∼1400 bp of rbcL, 748 bp at the 3′ end of rbcL (rbcL-3P), LSU D2/D3 and UPA. As a result, rbcL-3P was suggested as the primary marker for diatom barcoding, since it had the power to distinguish all species and could be sequenced more easily. LSU D2/D3 could distinguish all but the most closely related species (96%). UPA showed low resolution, distinguishing only 20% of the species. Relying on the authors’ personal experiences (several copies were amplified, and the resulting sequences were different in length and unreadable), as well as the literary data, it was concluded that ITS is not a good barcode for diatoms.
The effectiveness of rbcL was discussed by M. MacGillivary and I. Kaczmarska [43]. A 540-bp fragment 417 bp downstream of the start codon of the rbcL gene was tested on a large selection of diatom taxa from classes Mediophyceae and Bacillariophyceae (381 sequences representing 66 genera and 245 species). This fragment was chosen after preliminary testing as the most variable. As a result, this fragment of rbcL correctly segregated 96% and 93% of the morphological congeners, respectively. 
The effectiveness of three markers (SSU rDNA, rbcL and COI) for metabarcoding was tested on a mock community of diatom algae (30 strains belonging to 21 species) by Kermarrec et al. [44]. These markers are the primary ones used for the molecular identification of diatoms. The markers ITS and LSU were not considered in this study because of their high interclonal variability and the lack of available data for the establishment of reference libraries. In order to interpret the acquired sequences, reference libraries were created for each marker. Sequences from the authors’ own collection and from GenBank were included in these libraries. Gene marker rbcL showed the best species composition assessment of the mock community, and SSU rDNA was next (it did not differentiate the complexes Nitzschia palea and Gomphonema parvulum at the intraspecific level). COI is variable and provides high resolution, but it was not recommended for routine metabarcoding due to difficulties in amplification and low representativity of the reference library.
A large work on assessing the utility of the gene markers COI, rbcL, ITS, tufA, UPA and 18S for freshwater green algae was done by Hall et al. [45]. They tested representatives of seven distantly related species groups from classes Chlorophyceae, Charophyceae and Zygnematophyceae (151 strains, 40 species total). As a result, the authors concluded that 18S, UPA and COI would be poor choices for a DNA barcode in green algae (18S and UPA proved insufficiently variable and COI difficult to amplify). ITS, rbcL and tufA were sufficiently variable to distinguish most species of Chlorophyceae, but additional primers were sometimes needed for amplification. For the charophytes, rbcL was noted as the most suitable primer but with a remark that it was impossible to differentiate species using this marker alone.

2.4. 18S—Choosing a Variable Barcode Region for Eukaryotes In Silico

The eukaryotic gene 18S-rRNA is used for species delimitation in almost all groups of freshwater algae [8]. It contains nine hypervariable regions (V1 to V9), each of which has been considered as a short barcode for species identification (with the exception of V6, because this region is more conserved in eukaryotes) [46] (and references in it). The question of using hypervariable regions as barcode markers for eukaryotes in silico has been discussed in several publications.
Based on an alignment of eukaryotes containing 24,793 positions from the SILVA database, the characterization of the 18S rRNA gene and the design of universal eukaryote specific primers were provided by Hadziavdic et al. [47]. To describe the nucleotide variation in the alignment, the authors used Shannon entropy values. The results suggested that the V2, V4 and V9 regions were best suited for biodiversity assessments (they yielded the highest taxonomic resolutions at cut-off values ranging 95–100% for the sequence identity). The V1 region is rather short (ca 100 nt) and contains a highly conserved core segment, and the V3 and V5 regions lack highly variable segments and are not very long. V7 has a highly variable core of approximately 20–25 nt. The V8 region is over 150 nucleotides long with variable and conserved positions interspersed across the region, with a conserved segment towards the 3′ end. 

2.5. 18S rRNA Gene Metabarcoding: V4 vs. V9

Bradley et al. [48] examined the effect of PCR/sequencing bias of the V4 and V8–V9 regions on community structure and membership using seven microalgal mock communities consisting of 12 algal species across five major divisions of eukaryotic marine and freshwater microalgae. The authors found a critical shortcoming of the V4 primer set as used in the literature [33] and described the failed sequencing runs. The V4 region failed to reliably capture 2 of the 12 mock community members (the haptophytes Prymnesium parvum and Isochrysis galbana), whereas the V8–V9 hypervariable region more accurately represented the mean relative abundance and alpha and beta diversity. Bradley et al. [49] found that degeneracies on the 3′ end of the current V4-specific primers impacted the read length and mean relative abundance. They modified the TAReukREV3 reverse primer and suggested the V4r primer without degeneracies on the 3′ end for the subsequent sequencing.
A comparative analysis of the V4 and V9 regions of 18S rDNA of the eukaryotic community of a pond [50] showed a remarkable discrepancy: the inventory of the major subdivision groups in the V9 region dataset did not correspond to that in the V4 region dataset. Eukaryotic OTUs for the V9 region were 20% more abundant than those for the V4 region at a 97% identity threshold. V9 also showed a larger diversity from the point of view of taxonomic coverage. The classes Karyorelictea, Prostomatea and Nassophorea in Ciliophora and the family Perkinsida (‘Alveolata’ group) were not detected using the V4 sequencing data, whereas they were detected using the V9 sequencing data. V4 missed Echinamoebida, Eumycetozoa and Euamoebida and green microalgae classes Chloropicophyceae, Pyramimonadophyceae and Mamiellophyceae. The authors noted “… the simultaneous application of two biomarkers may be suitable for understanding the molecular phylogenetic relationships”.
In an investigation dedicated to a eukaryotic community in anaerobic wastewater treatment systems [51], the V4 and V9 regions also detected different taxonomic groups. The authors suggested that commonly used V4 and V9 primer pairs could produce a bias in eukaryotic community analyses. The number of sequences of the amplicon library for the V9 region was almost two times larger than the number of sequences of the V4 amplicon library (340,054 vs. 180,678). The V4 region-specific primer pair showed that the dominant group was fungi.
A comparison of the 18S rRNA V4 and V9 regions for coastal phytoplankton communities with a focus on Chlorophyta [52] showed that the V9 region provided 20% more OTUs built at 97% identity than V4. Interestingly, the expectations were the opposite: the authors assumed that V4 as the longer region would detect more OTUs. The authors noted that both markers work “…equally well to describe global communities at different taxonomic levels from the division to the genus and provided similar Chlorophyta distribution patterns”. The authors concluded that V9 was the better choice for Chlorophyta, as it was more discriminating than V4. In the same cases for prasinophytes clade VII, V9 OTUs allowed to discriminate all subclades defined to date, while, in V4, several clades collapsed together. However, there was also an opposite example: “The V9 region of some Chlamydomonas is very similar to that of prasinophytes clade VII A5”. The authors emphasized the importance of the existence of reference sequences in databases, the absence of which, for instance, prevented the assessment of Dolichomastigales (Chlorophyta and Mamiellophyceae) diversity using V9. Similar results were demonstrated on marine picoeukaryotes [53], amoebae [54] and zoonotic trichomonads [55].

2.6. Internal Transcribed Spacer Ribosomal DNA (ITS) in Metabarcoding Researches

The ITS region is the accepted DNA barcode for fungi and a strong locus for delimiting or identifying species from different algal groups, such as Chlorophyta, Dinophyceae, Chrysophyceae, Xanthophyceae and Eustigmatophyceae [8][56]. Therefore, the usage of this region for metabarcoding has positive prospects, with a high probability of identifying nucleotide sequences at the species level.
The V9-ITS1 region of the 18S was chosen for the large-scale research of freshwater protists from 217 freshwater lakes across Europe [57][58][59]. The studies were aimed at identifying the diversity dynamic of the protist communities relative to the geographic distance and mountain range structures  [57], centers of endemism [59] and models of interactions between the protist community and bacteria [58]. In regard to algae, the diversity of the following groups was determined in these studies: Dinophyceae, Chrysophyceae, diatoms, Cryptophyta and Viridiplantae (green algae). 
The ITS2 gene region is the best marker for DNA barcoding of Chlorophyta. This marker resolves major green algae lineages (some with high bootstrap support), has a high resolution for taxonomic assessment (enables the most species to be distinguished) and a high level of universality (i.e., in primers for PCR) [60] (and references in it). This region was successfully used in the first studies of the diversity of Viridiplantae (including green microalgae) in the Antarctic using the metabarcoding approach in soil and rock surfaces samples [28][61], sediments from lakes [36] and glacial ice [35]. The interpretation of sequences was carried out using the PLANiTS2 database [62], and most of the taxa were identified to the species level.

2.7. Gene Markers for Diatoms

Diatoms are well-known ecological indicators of aquatic ecosystems and are widely used for routine monitoring. Indexes of the water quality in rivers and lakes have been developed on the basis of diatoms and are used in EU countries (the Water Framework Directive in Europe), the USA (the National Water Quality Assessment Program in the USA), Canada, Australia and New Zealand [31][63][64][65]. Therefore, adapting the metabarcoding method for use as a tool for ecological assessment is a relevant task of modern research.
Visco et al. [66] showed a strong similarity between the DI-CH (the Swiss Diatom Index) values inferred from microscopic and V4 18S NGS analyses of diatom communities. However, the authors noted that the interspecies variability of this barcode might change between different genera, and its effectiveness would depend on the taxonomic composition of the diatom community. The V4 resolution did not allow to unambiguously assign Navicula species, but it was sufficient to distinguish most of the species of Nitzschia and Gomphonema.
The rbcL gene marker has a wider application for studying diatom communities, and thanks to the establishment of a quality reference database Diat.barcode/R-syst:diatom [31], it can already be considered the standard for diatom metabarcoding.
The resolution of the rbcL 312 bp marker on the level of intraspecific and cryptic diversity was successfully demonstrated by Pérez-Burillo et al. [67]. Benthic diatom samples (n = 610) were studied with a special focus on several ecologically important diatom species that are also key for the Water Framework Directive monitoring of European rivers: Fistulifera saprophila, Achnanthidium minutissimum, Nitzschia inconspicua and Nitzschia soratensis. As a result, it was shown that intraspecific and cryptic diversity can be assessed and understood through the application of DNA metabarcoding. 
A longer rbcL region (331 bp) was suggested as a result of large-scale research (500 benthic samples from 250 sites in England) with the aim of adopting a metabarcoding approach for ecological status assessment using diatoms [17]. The choice of region was based on an analysis of 390 sequences from a database. Eleven conservative regions of the rbcL gene with >96% identity were identified. These regions were used for developing primers. Variable regions were also analyzed, and four of these showed good potential for species delimitation. 

2.8. Specific Primers Targeted to rbcL Region Detected a High Diversity of Eustigmatophyceae

A high diversity of Eustigmatophyceae was found in environmental DNA samples with the help of new specific primers targeted at the rbcL region [32]. The authors compared their results to previous studies concerning Eustigmatophyceae and concluded that diversity of this group was underestimated. The designed primers allowed to detect 184 ASV haplotypes that were either Eustigmatophyceae (179) or possibly Eustigmatophyceae (15), while, in previous works, representatives of this group were reported only as rare or single finds. The sensitivity of eustigmatophyte-directed rbcL primers was compared higher to universal eukaryotic 18S primers. The authors suggested that the employed techniques can be used for future studies of the population structure, ecology, distribution and diversity of this class.

2.9. Comparison of rbcL and 18S Markers for Freshwater Diatoms Biomonitoring

Inconclusive results were obtained in a study using the rbcL and V4 18S rRNA markers [68] for benthic diatoms biomonitoring in freshwater habitats of Northern Europe. The classes of ecological condition differed significantly depending on the used method: only48% of samples with the 18S marker and 37.5% of samples with the rbcL marker had the same ecological status as with the morphological analysis. The assessment of the ecological conditions gave different results using different markers.

2.10. A 23S rDNA Plastid Marker for Simultaneous Detection of Eukaryotic Algae and Cyanobacteria

The universal plastid amplicon (UPA) is the variable Domain V of the 23S plastid rRNA gene ∼330 bp in length. This region was proposed by Sherwood and Presting [69] as a marker for plastid-containing organisms, i.e., all lineages of eukaryotic algae and Cyanobacteria. In this research, a single pair of universal primers was designed, and it was indicated that these exact priming sequences are present only in cyanobacteria and plastids. However, comparisons with other markers showed the insufficient effectiveness of UPA.

2.11. The 16S rRNA Gene as a Marker for Simultaneous Detection of Prokaryotes and Eukaryotes

The 16S rRNA gene was first proposed as a metabarcoding marker by Eiler et al. в 2013 [70] on the basis of it being universally present in prokaryotes (including cyanobacteria), as well as in chloroplasts of eukaryotes. This enabled the simultaneous detection of prokaryotic and eukaryotic phytoplankton taxa. The authors analyzed the phytoplankton diversity from 49 lakes, including three seasonal surveys, and assessed the data using NGS and microscopy. The NGS approach detected 1.5–2 times more OTUs than there were taxa found by the microscopy approach. A more detailed comparison of taxonomic groups revealed that Heterokonta, Euglenophyta, Cryptophyta and Dinophyta were overrepresented in the microscopic biovolume dataset compared to the NGS data, whereas Cyanobacteria were proportionally overrepresented in the NGS dataset compared to microscopic biovolume data. The authors noted that Dinophyta, a major phylum in microscopic data, was poorly detected by NGS in some lakes. Discrepancies also included Euglenophyta and Heterokonta that were scarce in the NGS but were frequently detected by microscopy. The NGS approach detected a deep-branching taxonomically unclassified cluster that could not be linked to any group identified by microscopy.
Recently, Bonfantine et al. [38] explored the potential of a standard V4 515F-806RB primer pair in recovering diatom plastid 16SrRNA sequences. PhytoREF was used to classify the 16S reads from 72 freshwater biofilm samples. Based on the Clustal nucleotide alignment, the authors confirmed the differences between eukaryotic chloroplast and prokaryotic sequences. “The Ochrophyta, and other eukaryote reads, showed high sequence conservation with no 30 mismatches in the last 5 bases of both forward and reverse 16S v4-515F and V4-806R primers. Two mismatches to the E. coli 16S rRNA (GT vs. TA) wereobserved across all aligned non-E. coli 16S RNA sequences 15 bases upstream of the V4-806 primer-binding site.” More than 90% of the diatom reads in each stream biofilm sample were identified. The authors found significant beta-diversity in diatom assemblages and discrimination among river segments. In an example of the three Australian environmental 16S rRNA datasets selected from NCBI-SRA, it was shown that most of the diatom OTUs (67 out of 71) were detected in other Australian ecosystems. As a result, the authors concluded that diatom plastid 16S rRNA genes are readily amplified with the standard primer sets. “Therefore, the volume of existing 16S rRNA amplicon datasets initially generated for microbial community profiling can also be used to detect, characterize, and map diatom distribution to inform phylogeny and ecological health assessments, and can be extended into a range of ecological and industrial applications.”

2.12. Comparing Approaches: Metabarcoding vs. Morphological Identification (Congruency between Methods)

Comparing the results acquired by using LM and NGS allows to reveal discrepancies between these methods and the causes of these discrepancies to determine the efficiency of the amplification of the chosen genetic markers and to identify problems in bioinformatical processing and taxonomic attribution. Every single work that compared the morphological and molecular approaches for studying the diversity of algal communities indicated a significant difference in the resulting taxonomic lists. The number of taxa detected by both approaches falls between 7.4 and 25.7%.
Some studies have shown that diversity detected by NGS is much higher than that found in LM. In a research of diatom diversity, Zimmermann et al. [71] reported that about 2.5 times more taxa were found by the NGS approach (263 taxa vs. 102 taxa in LM). In an example of studying benthic diatoms, Bailet et al. [68] showed that the metabarcoding method using the 18S marker revealed 27% more taxa than the morphological method and 38% more taxa using the rbcL marker. 
The challenges in metabarcoding analysis:
  • Gap in the reference database [72][73][66][74][75][76][77][78][79][69], etc.
  • The natural intraspecific and intragenomic variabilities of the barcoding marker (single taxon has multiple genotypes at the barcoding region, and members of that taxon might cluster into different Molecular Operational Taxonomic Units (MOTUs)) [80].
  • Cryptic diversity—a single morphological species can represent different genetic groups (e.g., diatoms Sellaphora pupula, Pinnularia borealis, Hantzschia amphioxys and Nitzschia inconspicua and species of Stichococcus, Coccomyxa, Chlorokybus, Cryptomonas, etc.) [81][82][83][84][85][86][87].
  • MOTU richness can be artificially inflated through technical errors at different steps of sample processing during amplification and sequencing [80].
  • The MOTU delimitation approach influences the richness estimation and interpretation [80] (and references in it) (asses[85]sment of the bioinformatics pipelines provided in [88][15]).
  • Complete absence of amplification on the whole due to a mismatch of the primer set used. For example, Salmaso et al. [89] did not find any species belonging to the Euglenales in the HTS results (with universal eukaryotic primers (TAReuk454FWD1 and TAReukREV3) for V4 18S), although they were present in LM. Hanžek et al. [90] reported that the taxa that contributed most to the biomass (Actinotaenium/Mesotaenium sp. and the species Cosmarium tenue, Pantocsekiella comensis, Sphaerocystis schroeteri and Synedropsis roundii) were not identified by eDNA metabarcoding (V9 18S region was amplified using the universal primer pair 1391F and EukB). Proeschöld and Darienko  noted that, although Stichococcus-like organisms are widely distributed in almost all habitats, they are not recorded in environmental studies based on HTS approaches, because the V4 or V9 regions of the SSU contain introns that obstruct amplification. Groendahl et al. [91] reported that Monorhaphidium sp., Selenastrum sp. and Trachelomonas sp. detected using the morphology-based approach were not identified by the metabarcoding approach, despite the fact that all three genera are included in the reference database.
  • Uncertainties and lack of sensitivity of reference databases for the selected DNA markers [89].
The challenges related to morphological identification:
  • Diatom extracellular skeletons are counted in LM even if they come from dead cells. The valves of dead cells can be transported from locations other than the target assemblage. Metabarcoding will not detect these dead cells [92].
  • The proportion of live diatoms found in environmental samples varies greatly, ranging from 2 to 98% [92].
    • Small-celled species and pico-sized cells are often overlooked or underestimated by the morphological approach. For example, the valves of Fistulifera saprophila tend to dissolve during sample processing, which can explain why this species is often missed during morphological identification [81][93][78][69].
  • LM misidentification (false LM positives) [69][73][94].
  • Differences in the detection limits of the two methods: morphological and molecular approaches do not give the same insight into communities of algae and, therefore, do not have the same detection capacity for species [69][73][95].
  • The different sample volumes settled for microscopy and metabarcoding [96].
  • Underlying units used for microscopy (individual cells) and those used for metabarcoding (ASV sequences) are quite different, making direct comparisons imperfect [96][97].
  • A short barcode gene fragment may have limited the taxonomic resolution [96]. For example, the resolution of the V4 18S region does not allow to unambiguously identify some species of Navicula [66]. For the V7–9 18S marker, a lack of intergenus taxonomic resolution was found (the MOTUs matched multiple genera, e.g., Alexandrium pseudogonyaulax and A. hiranoi, Chaetoceros neogracile and C. curvisetus and Thalassiosira eccentrica and T. antarctica) [98]. In some Chlamydomonas, the V9 region is very similar to that of prasinophytes clade VII A5 [52].

3. Conclusions

Metabarcoding has already been accepted as an alternative (faster and more economical) method to the traditional microscopy method for the ecological assessment and monitoring of freshwater bodies of water, rivers and seas based on microalgae. Protocols, technical guidelines or standards for eDNA monitoring are developed and/or approved in many countries [99][100][101][102][103]. Currently, there is no one perfect marker for identifying microalgae across the whole diversity. Among the most popular genetic barcodes for freshwater metabarcoding, it can highlight the nuclear regions V4 and V9 18S rRNA (which allow to determine the composition of auto- and heterotrophic eukaryotes) and the region of the plastid gene rbcL (for diatoms). The regions ITS1 and ITS2 might be underestimated, but they show good potential for usage as a microalgae barcode; they can be easily amplified with the standard primers and are variable enough to identify sequences to the species level.
Below, the main advantages (+) and disadvantages (−) of markers that are used for freshwater microalgae metabarcoding are summarized.
rbcL.
“+” widely used for diatom metabarcoding, distinguishes between taxa at the species and intraspecies levels; a high-quality curated reference database for taxonomic attribution Diat.barcode [31].
“−” extremely heterogeneous in green algae; does not have a set of universal primers [104]; only diatoms are identified well.
Notes: the majority of studies use the region with the length of 263 bp (312 bp including primers) and a complicated set of primers (three forward primers and two reverse primers) suggested by Vasselon et al. [72][105] (bcL primers Set 1). However, recently, it has been shown that a longer region of 331 bp (common region 263 bp, proposed by Kelly et al. [17][18], primers set rbcL 646F-rbcL 998R) has a higher resolution for species and intraspecific variants [106].
V4 18S.
«+» widely used for metabarcoding of marine and freshwater eukaryotes; successfully amplified by a universal primer set. The V4 region (named pre-barcode) was designated as the starting point for the identification of protists in the International Barcode of Life Consortium Project (iBOL, http://www.ibol.org/  (accessed on 22 May 2023) and the Protist Working Group (ProWG) [107]. Provides an understanding of molecular phylogenetic relationships.
«-» compared with V9 18S, misses haptophytes [54], many groups of heterotrophs and green algae from the classes Chloropicophyceae, Pyramimonadophyceae and Mamiellophyceae [108]. The V4 region is less variable compared to the V9 region. Often does not differentiate species.
V9 18S.
«+» widely used for marine and freshwater eukaryotes metabarcoding; successfully amplified by a universal primer set; was chosen to amplify eukaryotes in the global project “The Earth Microbiome Project” (EMP; http://www.earthmicrobiome.org (accessed on 22 May 2023)); V9 is more variable than V4; provides more OTUs and diversity on higher level taxa (supergroup and phylum)
«-» a short region (96 bp–134 bp [109]; sometimes does not differentiate species.
Notes: the V4 and V9 regions detect different taxonomic profiles at the genus, family and macro-taxa levels. Both markers are recommended for a more complete understanding of community structures.
ITS (V9-ITS1 or ITS2 regions are used for metabarcoding).
«+» is a strong locus for some algae (Chlorophyta, Dinophyceae, Eustigmatophyceae and Xanthophyceae) [56]; sufficiently variable; allows to differentiate species; amplified by a universal primer set. A specialized curated reference dataset “PLANiTS” [62] exists for interpreting data, including ITS1, ITS2 and entire ITS sequences of Viridiplantae [62].
«-» for diatoms, this barcode has a great variability in the length of region and a problem in intraindividual variation [110][111].
23S (UPA).
«+»  algal-specific markers focused on plastid-containing eukaryotic algae and Cyanobacteria; sufficiently  amplified by a universal primer set.
«-» a conserved region; low resolution; identifies taxa only to the genus level or higher; is not a strong locus for algae.
16S (V3–V4 and V4 regions is used for metabarcoding).
«+» focused on the chloroplasts of eukaryotes and prokaryotes (Cyanobacteria); sufficiently amplified by a universal primer set; can be used to simultaneously detect prokaryotes and eukaryotic algae.
«-» biased towards bacteria in the community; might not accurately reflect the phytoplankton diversity due to the endosymbiotic origin of chloroplasts; is not a strong locus for algae. 
Notes: is used very rarely for the simultaneous complex analysis of prokaryotes and eukaryotic algae.
In general, it should be taken into account that every marker will demonstrate a different image of the community that depends on successful amplification of the chosen region.
The identification of the taxonomic composition and the level of taxonomic attributiondepends on the region variability and the quality of the reference databases. 

This entry is adapted from the peer-reviewed paper 10.3390/biology12071038

References

  1. Adl, S.M.; Simpson, A.G.B.; Farmer, M.A.; Andersen, R.A.; Anderson, O.R.; Barta, J.R.; Bowser, S.S.; Brugerolle, G.; Fensome, R.A.; Fredericq, S.; et al. The new higher level classification of eukaryotes with emphasis on the taxonomy of protists. J. Eukaryot. Microbiol. 2005, 52, 399–451.
  2. Adl, S.M.; Simpson, A.G.B.; Lane, C.E.; Lukeš, J.; Bass, D.; Bowser, S.S.; Brown, M.W.; Burki, F.; Dunthorn, M.; Hampl, V.; et al. The revised classification of eukaryotes. J. Eukaryot. Microbiol. 2012, 59, 429–514.
  3. Kim, K.M.; Park, J.H.; Bhattacharya, D.; Yoon, H.S. Applications of next-generation sequencing to unravelling the evolutionary history of algae. Int. J. Syst. Evol. Microbiol. 2014, 64, 333–345.
  4. Burki, F. The convoluted evolution of eukaryotes with complex plastids. In Secondary Endosymbioses; Hirakawa, Y., Ed.; Elsevier: Tsukuba, Japan, 2017; Volume 84, pp. 1–30.
  5. Burki, F.; Roger, A.J.; Brown, M.W.; Simpson, A.G. The new tree of eukaryotes. Trends Ecol. Evol. 2020, 35, 43–55.
  6. Gololobova, M.A.; Belyakova, G.A. Position of Algae on the Tree of Life. Dokl. Biol. Sci. 2022, 507, 312–326. [Google Scholar] [CrossRef]
  7. Fawley, M.W.; Fawley, K.P. Identification of Eukaryotic Microalgal Strains. J. Appl. Phycol. 2020, 32, 2699–2709.
  8. Leliaert, F.; Verbruggen, H.; Vanormelingen, P.; Steen, F.; López-Bautista, J.M.; Zuccarello, G.C.; De Clerck, O. DNA-based species delimitation in algae. Eur. J. Phycol. 2014, 49, 179–196.
  9. Leray, M.; Knowlton, N. Random sampling causes the low reproducibility of rare eukaryotic OTUs in Illumina COI metabarcoding. PeerJ 2017, 3, e3006.
  10. Cahoon, A.B.; Huffman, A.G.; Krager, M.M.; Crowell, R.M. A meta-barcoding census of freshwater planktonic protists in Appalachia—Natural Tunnel State Park, Virginia, USA. Metabarcoding Metagenom. 2018, 2, e26939.
  11. Marcelino, V.; Verbruggen, H. Multi-marker metabarcoding of coral skeletons reveals a rich microbiome and diverse evolutionary origins of endolithic algae. Sci. Rep. 2016, 6, 31508.
  12. van der Loos, L.M.; Nijland, R. Biases in bulk: DNA metabarcoding of marine communities and the methodology involved. Mol. Ecol. 2021, 30, 3270–3288.
  13. White, T.J.; Bruns, T.D.; Lee, S.B.; Taylor, J.W. Amplification and Direct Sequencing of Fungal Ribosomal RNA Genes for Phylogenetics. In PCR Protocols: A Guide to Methods and Applications; Innis, M.A., Gelfand, D.H., Sninsky, J.J., White, T.J., Eds.; Academic Press: New York, NY, USA, 1990; pp. 315–322.
  14. Bailet, B.; Apothéloz-Perret-Gentil, L.; Baričević, A.; Chonova, T.; Franc, A.; Frigerio, J.; Kelly, M.; Mora, D.; Pfannkuchen, M.; Proft, S.; et al. Diatom DNA metabarcoding for ecological assessment: Comparison among bioinformatics pipelines used in six European countries reveals the need for standardization. Sci. Total Environ. 2020, 745, 140948.
  15. Czech, L.; Stamatakis, A.; Dunthorn, M.; Barbera, P. Metagenomic Analysis Using Phylogenetic Placement-A Review of the First Decade. Front. Bioinform. 2022, 26, 871393.
  16. Vasselon, V.; Domaizon, I.; Rimet, F.; Kahlert, M.; Bouchez, A. Application of high-throughput sequencing (HTS) metabarcoding to diatom biomonitoring: Do DNA extraction methods matter? Freshw. Sci. 2017, 36, 162–177.
  17. Kelly, M.; Boonham, N.; Juggins, S.; Kille, P.; Mann, D.; Pass, D.; Sapp, M.; Sato, S.; Glover, R. A DNA based diatom metabarcoding approach for classification of rivers. In Science Report SC140024/R; Environment Agency: Bristol, UK, 2018; p. 157.
  18. Kelly, M.; Boonham, N.; Juggins, S.; Mann, D.; Glover, R. Further development of a DNA based metabarcoding approach to assess diatom communities in rivers. Chief Scientist’s Group report. In Version: SC160014/R; Environment Agency: Bristol, UK, 2020; p. 133.
  19. Ruppert, K.M.; Kline, R.J.; Rahman, M.S. Past, present, and future perspectives of environmental DNA (eDNA) metabarcoding: A systematic review in methods, monitoring, and applications of global eDNA. Glob. Ecol. Conserv. 2019, 17, e00547.
  20. Bruce, K.; Blackman, R.; Bourlat, S.J.; Hellström, A.M.; Bakker, J.; Bista, I.; Bohmann, K.; Bouchez, A.; Brys, R.; Clark, K.; et al. A Practical Guide to DNA-Based Methods for Biodiversity Assessment; Pensoft Advanced Books: Sofia, Bulgaria, 2021.
  21. Pawlowski, J.; Bruce, K.; Panksep, K.; Aguirre, F.; Amalfitano, S.; Apothéloz-Perret-Gentil, L.; Baussant, T.; Bouchez, A.; Carugati, L.; Cermakova, K.; et al. Environmental DNA metabarcoding for benthic monitoring: A review of sediment sampling and DNA extraction methods. Sci. Total Environ. 2022, 818, 151783.
  22. Baricevic, A.; Chardon, C.; Kahlert, M.; Karjalainen, S.M.; Pfannkuchen, D.M.; Pfannkuchen, M.; Rimet, F.; Tankovic, M.S.; Trobajo, R.; Vasselon, V.; et al. Recommendations for the preservation of environmental samples in diatom metabarcoding studies. Metabarcoding Metagenom. 2022, 6, e85844.
  23. Kelly, M.G.; Juggins, S.; Mann, D.G.; Sato, S.; Glover, R.; Boonham, N.; Sapp, M.; Lewis, E.; Hany, U.; Kille, P.; et al. Development of a novel metric for evaluating diatom assemblages in rivers using DNA metabarcoding. Ecol. Indic. 2020, 118, 106725.
  24. Pawlowski, J.; Kelly-Quinn, M.; Altermatt, F.; Apothéloz-Perret-Gentil, L.; Beja, P.; Boggero, A.; Borja, A.; Bouchez, A.; Cordier, T.; Domaizon, I.; et al. The future of biotic indices in the ecogenomic era: Integrating (e)DNA metabarcoding in biological assessment of aquatic ecosystems. Sci. Total. Environ. 2018, 637–638, 1295–1310.
  25. Sagova-Mareckova, M.; Boenigk, J.; Bouchez, A.; Cermakova, K.; Chonova, T.; Cordier, T.; Eisendle, U.; Elersek, T.; Fazi, S.; Fleituch, T.; et al. Expanding ecological assessment by integrating microorganisms into routine freshwater biomonitoring. Water Res. 2021, 191, 116767.
  26. Pawlowski, J.; Apothéloz-Perret-Gentil, L.; Altermatt, F. Environmental DNA: What’s behind the term? Clarifying the terminology and recommendations for its future use in biomonitoring. Mol. Ecol. 2020, 29, 4258–4264.
  27. Nolte, V.; Pandey, R.V.; Jost, S.; Medinger, R.; Ottenwälder, B.; Boenigk, J.; Schlötterer, C. Contrasting seasonal niche separation between rare and abundant taxa conceals the extent of protist diversity. Mol. Ecol. 2010, 19, 2908–2915.
  28. White, T.J.; Bruns, T.D.; Lee, S.B.; Taylor, J.W. Amplification and Direct Sequencing of Fungal Ribosomal RNA Genes for Phylogenetics. In PCR Protocols: A Guide to Methods and Applications; Innis, M.A., Gelfand, D.H., Sninsky, J.J., White, T.J., Eds.; Academic Press: New York, NY, USA, 1990; pp. 315–322.
  29. Câmara, P.E.; Menezes, G.C.; Pinto, O.H.; Silva, M.C.; Convey, P.; Rosa, L.H. Using metabarcoding to assess Viridiplantae sequence diversity present in Antarctic glacial ice. Annu. Acad. Bras. Cienc. 2022, 94, e20201736.
  30. Fonseca, B.M.; Câmara, P.E.A.S.; Ogaki, M.B.; Pinto, O.; Lirio, J.; Coria, S.; Vieira, R.; Carvalho-Silva, M.; Amorim, E.; Convey, P.; et al. Green algae (Viridiplantae) in sediments from three lakes on Vega Island, Antarctica, assessed using DNA metabarcoding. Mol. Biol. Rep. 2022, 49, 179–188.
  31. Rimet, F.; Gusev, E.; Kahlert, M.; Kelly, M.; Kulikovskiy, M.; Maltsev, Y.; Mann, D.; Pfannkuchen, M.; Trobajo, R.; Vasselon, V.; et al. Diat. barcode, an open-access curated barcode library for diatoms. Sci. Rep. 2019, 9, 1–12.
  32. Fawley, M.W.; Fawley, K.P.; Cahoon, A.B. Finding needles in a haystack—Extensive diversity in the eustigmatophyceae revealed by community metabarcode analysis targeting the rbcL gene using lineage-directed primers. J. Phycol. 2021, 57, 1636–1647.
  33. Stoeck, T.; Bass, D.; Nebel, M.; Christen, R.; Jones, M.D.; Breiner, H.W.; Richards, T.A. Multiple marker parallel tag environmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water. Mol. Ecol. 2010, 1, 21–31.
  34. Quast, C.; Pruesse, E.; Yilmaz, P.; Gerken, J.; Schweer, T.; Yarza, P.; Peplies, J.; Glöckner, F.O. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Res. 2013, 41, D590–D596.
  35. Câmara, P.E.; Menezes, G.C.; Pinto, O.H.; Silva, M.C.; Convey, P.; Rosa, L.H. Using metabarcoding to assess Viridiplantae sequence diversity present in Antarctic glacial ice. Annu. Acad. Bras. Cienc. 2022, 94, e20201736.
  36. Fonseca, B.M.; Câmara, P.E.A.S.; Ogaki, M.B.; Pinto, O.; Lirio, J.; Coria, S.; Vieira, R.; Carvalho-Silva, M.; Amorim, E.; Convey, P.; et al. Green algae (Viridiplantae) in sediments from three lakes on Vega Island, Antarctica, assessed using DNA metabarcoding. Mol. Biol. Rep. 2022, 49, 179–188.
  37. Banchi, E.; Ametrano, C.G.; Greco, S.; Stanković, D.; Muggia, L.; Pallavicini, A. PLANiTS: A curated sequence reference dataset for plant ITS DNA metabarcoding. Database 2020, 2020, baz155.
  38. Bonfantine, K.L.; Trevathan-Tackett, S.M.; Matthews, T.G.; Neckovic, A.; Gan, H.M. Dumpster diving for diatom plastid 16S rRNA genes. PeerJ. 2021, 9, e11576.
  39. Decelle, J.; Romac, S.; Stern, R.F.; Bendif, E.M.; Zingone, A.; Audic, S.; Guiry, M.; Guillou, L.; Tessier, D.; Le Gall, F.; et al. PhytoREF: A reference database of the plastidial 16S rRNA gene of photosynthetic eukaryotes with curated taxonomy. Mol. Ecol. Resour. 2015, 15, 1435–1445.
  40. Evans, K.M.; Wortley, A.H.; Mann, D.G. An assessment of potential diatom “barcode” genes (cox1, rbcL, 18S and ITS rDNA) and their effectiveness in determining relationships in Sellaphora (Bacillariophyta). Protist 2007, 158, 349–364.
  41. Moniz, M.B.J.; Kaczmarska, I. Barcoding diatoms: Is there a good marker? Mol. Ecol. Resour. 2009, 9, 65–74.
  42. Hamsher, S.E.; Evans, K.M.; Mann, D.G.; Poulickova, A.; Saunders, G.W. Barcoding diatoms: Exploring alternatives to COI-5P. Protist 2011, 162, 405–422.
  43. MacGillivary, M.L.; Kaczmarska, I. Survey of the Efficacy of a Short Fragment of the rbcL Gene as a Supplemental DNA Barcode for Diatoms. J. Eukaryot. Microbiol. 2011, 58, 529–536.
  44. Kermarrec, L.; Franc, A.; Rimet, F.; Chaumeil, P.; Humbert, J.F.; Bouchez, A. Next-generation sequencing to inventory taxonomic diversity in eukaryotic communities: A test for freshwater diatoms. Mol. Ecol. Resour. 2013, 13, 607–619.
  45. Hall, J.D.; Fučíková, K.; Lo, C.; Lewis, L.A.; Karol, K.G. An assessment of proposed DNA barcodes in freshwater green algae. Cryptogam. Algol. 2010, 31, 529–555.
  46. Tanabe, A.S.; Nagai, S.; Hida, K.; Yasuike, M.; Fujiwara, A.; Nakamura, Y.; Takano, Y.; Katakura, S. Comparative study of the validity of three regions of the 18S-rRNA gene for massively parallel sequencing-based monitoring of the planktonic eukaryote community. Mol. Ecol. Resour. 2016, 16, 402–414.
  47. Hadziavdic, K.; Lekang, K.; Lanzen, A.; Jonassen, I.; Thompson, E.M.; Troedsson, C. Characterization of the 18S rRNA Gene for Designing Universal Eukaryote Specific Primers. PLoS ONE 2014, 9, e87624.
  48. Bradley, I.M.; Pinto, A.J.; Guest, J.S. Design and evaluation of Illumina MiSeq-compatible, 18S rRNA gene-specific primers for improved characterization of mixed phototrophic communities. Appl. Environ. Microbiol. 2016, 82, 5878–5891.
  49. Bradley, I.M.; Pinto, A.J.; Guest, J.S. Design and evaluation of Illumina MiSeq-compatible, 18S rRNA gene-specific primers for improved characterization of mixed phototrophic communities. Appl. Environ. Microbiol. 2016, 82, 5878–5891.
  50. Choi, J.; Park, J.S. Comparative analyses of the V4 and V9 regions of 18S rDNA for the extant eukaryotic community using the Illumina platform. Sci. Rep. 2020, 10, 6519.
  51. Hirakata, Y.; Hatamoto, M.; Oshiki, M.; Watari, T.; Kuroda, K.; Araki, N.; Yamaguchi, T. Temporal variation of eukaryotic community structures in UASB reactor treating domestic sewage as revealed by 18S rRNA gene sequencing. Sci. Rep. 2019, 9, 12783.
  52. Tragin, M.; Zingone, A.; Vaulot, D. Comparison of coastal phytoplankton composition estimated from the V4 and V9 regions of the 18S rRNA gene with a focus on photosynthetic groups and especially Chlorophyta. Environ. Microbiol. 2018, 20, 506–520.
  53. Giner, C.R.; Forn, I.; Romac, S.; Logares, R.; de Vargas, C.; Massana, R. Environmental Sequencing Provides Reasonable Estimates of the Relative Abundance of Specific Picoeukaryotes. Appl. Environ. Microbiol. 2016, 82, 4757–4766.
  54. Maritz, J.M.; Rogers, K.H.; Rock, T.M.; Liu, N.; Joseph, S.; Land, K.; Carlton, J. An 18S rRNA Workflow for Characterizing Protists in Sewage, with a Focus on Zoonotic Trichomonads. Microb. Ecol. 2017, 74, 923–936.
  55. Zheng, X.; He, Z.; Wang, C.; Yan, Q.; Shu, L. Evaluation of different primers of the 18S rRNA gene to profile amoeba communities in environmental samples. Water Biol. Secur. 2022, 1, 100057.
  56. Fawley, M.W.; Fawley, K.P. Identification of Eukaryotic Microalgal Strains. J. Appl. Phycol. 2020, 32, 2699–2709.
  57. Boenigk, J.; Wodniok, S.; Bock, C.; Beisser, D.; Hempel, C.; Grossmann, L.; Lange, A.; Jensen, M. Geographic distance and mountain ranges structure freshwater protist communities on a European scale. Metabarcoding Metagenom. 2018, 2, e21519.
  58. Bock, C.; Jensen, M.; Forster, D.; Marks, S.; Nuy, J.; Psenner, R.; Beisser, D.; Boenigk, J. Factors shaping community patterns of protists and bacteria on a European scale. Environ. Microbiol. 2020, 22, 2243–2260.
  59. Olefeld, J.L.; Bock, C.; Jensen, M.; Vogt, J.; Sieber, G.; Albach, D.; Boenigk, J. Centers of endemism of freshwater protists deviate from pattern of taxon richness on a continental scale. Sci. Rep. 2020, 10, 14431.
  60. Buchheim, M.A.; Keller, A.; Koetschan, C.; Förster, F.; Merget, B.; Wolf, M. Internal Transcribed Spacer 2 (nu ITS2 rRNA) Sequence-Structure Phylogenetics: Towards an Automated Reconstruction of the Green Algal Tree of Life. PLoS ONE 2011, 6, e16931.
  61. Câmara, P.E.A.S.; de Menezes, G.C.A.; Oliveira, F.S.; Souza, C.D.; Amorim, E.T.; Schaefer, C.E.; Convey, P.; Pinto, O.H.; Carvalho-Silva, M.; Rosa, L.H. Diversity of Viridiplanta e DNA present on rock surfaces in the Ellsworth Mountains, continental Antarctica. Polar. Biol. 2022, 45, 637–646.
  62. Banchi, E.; Ametrano, C.G.; Greco, S.; Stanković, D.; Muggia, L.; Pallavicini, A. PLANiTS: A curated sequence reference dataset for plant ITS DNA metabarcoding. Database 2020, 2020, baz155.
  63. Wang, Y.-K.; Stevenson, R.J.; Metzmeier, L. Development and evaluation of a diatom-based Index of Biotic Integrity for the Interior Plateau Ecoregion, USA. J. North Am. Benthol. Soc. 2005, 24, 990–1008.
  64. Chessman, B.C.; Bate, N.; Gell, P.A.; Newall, P. A diatom species index for bioassessment of Australian rivers. Mar. Freshw. Res. 2007, 58, 542–557.
  65. Lavoie, I.; Hamilton, P.B.; Wang, Y.-K.; Dillon, P.J.; Campeau, S. A comparison of stream bioassessment in Québec (Canada) using six European and North American diatom-based indices. Nova Hedwig. 2009, 135, 37–56.
  66. Visco, J.A.; Apothéloz-Perret-Gentil, L.; Cordonier, A.; Esling, P.; Pillet, L.; Pawlowski, J. Environmental Monitoring: Inferring the Diatom Index from Next-Generation Sequencing Data. Environ. Sci. Technol. 2015, 49, 7597–7605.
  67. Pérez-Burillo, J.; Trobajo, R.; Leira, M.; Keck, F.; Rimet, F.; Sigró, J.; Mann, D.G. DNA metabarcoding reveals differences in distribution patterns and ecological preferences among genetic variants within some key freshwater diatom species. Sci. Total Environ. 2021, 798, 149029.
  68. Bailet, B.; Bouchez, A.; Franc, A.; Frigerio, J.-M.; Keck, F.; Karjalainen, S.-M.; Rimet, F.; Schneider, S.; Kahlert, M. Molecular versus morphological data for benthic diatoms biomonitoring in Northern Europe freshwater and consequences for ecological status. Metabarcoding Metagenom. 2019, 3, e34002.
  69. Brown, P.D.; Craine, J.M.; Richards, D.; Chapman, A.; Marden, B. DNA metabarcoding of the phytoplankton of Great Salt Lake’s Gilbert Bay: Spatiotemporal assemblage changes and comparisons to microscopy. J. Great Lakes Res. 2022, 48, 110–124.
  70. Eiler, A.; Drakare, S.; Bertilsson, S.; Pernthaler, J.; Peura, S.; Rofner, C.; Simek, K.; Yang, Y.; Znachor, P.; Lindström, E. Unveiling Distribution Patterns of Freshwater Phytoplankton by a Next Generation Sequencing Based Approach. PLoS ONE 2013, 8, e53516.
  71. Zimmermann, J.; Glöckner, G.; Jahn, R.; Enke, N.; Gemeinholzer, B. Metabarcoding vs. morphological identification to assess diatom diversity in environmental studies. Mol. Ecol. Resour. 2015, 15, 526–542.
  72. Vasselon, V.; Domaizon, I.; Rimet, F.; Kahlert, M.; Bouchez, A. Application of high-throughput sequencing (HTS) metabarcoding to diatom biomonitoring: Do DNA extraction methods matter? Freshw. Sci. 2017, 36, 162–177.
  73. Salmaso, N.; Vasselon, V.; Rimet, F.; Vautier, M.; Elersek, T.; Boscaini, A.; Donati, C.; Moretto, M.; Pindo, M.; Riccioni, G.; et al. DNA sequence and taxonomic gap analyses to quantify the coverage of aquatic cyanobacteria and eukaryotic microalgae in reference databases: Results of a survey in the Alpine region. Sci. Total. Environ. 2022, 834, 155175.
  74. Bailet, B.; Bouchez, A.; Franc, A.; Frigerio, J.-M.; Keck, F.; Karjalainen, S.-M.; Rimet, F.; Schneider, S.; Kahlert, M. Molecular versus morphological data for benthic diatoms biomonitoring in Northern Europe freshwater and consequences for ecological status. Metabarcoding Metagenom. 2019, 3, e34002.
  75. Zimmermann, J.; Abarca, N.; Enke, N.; Skibbe, O.; Kusber, W.-H.; Jahn, R. Taxonomic reference libraries for environmental barcoding: A best practice example from diatom research. PLoS ONE 2014, 9, e108793.
  76. Seymour, M.; Edwards, F.; Cosby, B.; Kelly, M.; Bruyn, M.; Carvalho, G.; Creer, S. Executing multi-taxa eDNA ecological assessment via traditional metrics and interactive networks. Sci. Total. Environ. 2020, 729, 138801.
  77. Abad, D.; Albaina, A.; Aguirre, M.; Laza-Martínez, A.; Uriarte, I.; Iriarte, A.; Villate, F.; Estonba, A. Is metabarcoding suitable for estuarine plankton monitoring? A comparative study with microscopy. Mar. Biol. 2016, 163, 149.
  78. Pissaridou, P.; Cantonati, M.; Bouchez, A.; Tziortzis, I.; Dörflinger, G.; Vasquez, M.I. How can integrated morphotaxonomy- and metabarcoding-based diatom assemblage analyses best contribute to the ecological assessment of streams? Metabarcoding Metagenom. 2021, 5, e68438.
  79. Vasselon, V.; Rimet, F.; Tapolczai, K.; Bouchez, A. Assessing ecological status with diatoms DNA metabarcoding: Scaling-up on a WFD monitoring network (Mayotte island, France). Ecol. Indic. 2017, 82, 1–12.
  80. Mora, D.; Abarca, N.; Proft, S.; Grau, J.H.; Enke, N.; Carmona, J.; Skibbe, O.; Jahn, R.; Zimmermann, J. Morphology and metabarcoding: A test with stream diatoms from Mexico highlights the complementarity of identification methods. Freshw. Sci. 2019, 38, 448–464.
  81. Rivera, S.F.; Vasselon, V.; Ballorain, K.; Carpentier, A.; Wetzel, C.; Ector, L.; Bouchez, A.; Rimet, F. DNA metabarcoding and microscopic analyses of sea turtles biofilms: Complementary to understand turtle behavior. PLoS ONE 2018, 13, e0195770.
  82. Mann, D.G.; Evans, K.M. The species concept and cryptic diversity, Moestrup, Ø, Eds. In Proceedings of the 12th International Conference on Harmful Algae, International Society for the Study of Harmful Algae and Intergovernmental Oceanographic Commission of UNESCO, Copenhagen, Sweden, 4–8 August 2008; pp. 262–268.
  83. Souffreau, C.; Vanormelingen, P.; Van de Vijver, B.; Isheva, T.; Verleyen, E.; Sabbe, K.; Vyverman, W. Molecular evidence for distinct Antarctic lineages in the cosmopolitan terrestrial diatoms Pinnularia borealis and Hantzschia amphioxys. Protist 2013, 164, 101–115.
  84. Darienko, T.; Gustavs, L.; Eggert, A.; Wolf, W.; Pröschold, T. Evaluating the Species Boundaries of Green Microalgae (Coccomyxa, Trebouxiophyceae, Chlorophyta) Using Integrative Taxonomy and DNA Barcoding with Further Implications for the Species Identification in Environmental Samples. PLoS ONE 2015, 10, e0127838.
  85. Proeschöld, T.; Darienko, T. The green puzzle Stichococcus (Trebouxiophyceae, Chlorophyta): New generic and species concept among this widely distributed genus. Phytotaxa 2020, 441, 113–142.
  86. Irisarri, I.; Darienko, T.; Pröschold, T.; Fürst-Jansen, J.M.R.; Jamy, M.; de Vries, J. Unexpected cryptic species among streptophyte algae most distant to land plants. Proc. Biol. Sci. 2021, 288, 1963.
  87. Hoef-Emden, K. Revision of the genus Chroomonas Hansgirg: The benefits of DNA-containing specimens. Protist 2018, 169, 662–681.
  88. Bailet, B.; Apothéloz-Perret-Gentil, L.; Baričević, A.; Chonova, T.; Franc, A.; Frigerio, J.; Kelly, M.; Mora, D.; Pfannkuchen, M.; Proft, S.; et al. Diatom DNA metabarcoding for ecological assessment: Comparison among bioinformatics pipelines used in six European countries reveals the need for standardization. Sci. Total Environ. 2020, 745, 140948.
  89. Salmaso, N.; Vasselon, V.; Rimet, F.; Vautier, M.; Elersek, T.; Boscaini, A.; Donati, C.; Moretto, M.; Pindo, M.; Riccioni, G.; et al. DNA sequence and taxonomic gap analyses to quantify the coverage of aquatic cyanobacteria and eukaryotic microalgae in reference databases: Results of a survey in the Alpine region. Sci. Total. Environ. 2022, 834, 155175.
  90. Hanžek, N.; Udovič, M.G.; Kajan, K.; Borics, G.; Várbíró, G.; Stoeck, T.; Žutinić, P.; Orlić, S.; Stanković, I. Assessing ecological status in karstic lakes through the integration of phytoplankton functional groups, morphological approach and environmental DNA metabarcoding. Ecol. Indic. 2021, 131, 108166.
  91. Groendahl, S.; Kahlert, M.; Fink, P. The best of both worlds: A combined approach for analyzing microalgal diversity via metabarcoding and morphology-based methods. PLoS ONE 2017, 12, e0172808.
  92. Mora, D.; Abarca, N.; Proft, S.; Grau, J.H.; Enke, N.; Carmona, J.; Skibbe, O.; Jahn, R.; Zimmermann, J. Morphology and metabarcoding: A test with stream diatoms from Mexico highlights the complementarity of identification methods. Freshw. Sci. 2019, 38, 448–464.
  93. Pérez-Burillo, J.; Trobajo, R.; Leira, M.; Keck, F.; Rimet, F.; Sigró, J.; Mann, D.G. DNA metabarcoding reveals differences in distribution patterns and ecological preferences among genetic variants within some key freshwater diatom species. Sci. Total Environ. 2021, 798, 149029.
  94. Rivera, S.F.; Vasselon, V.; Ballorain, K.; Carpentier, A.; Wetzel, C.; Ector, L.; Bouchez, A.; Rimet, F. DNA metabarcoding and microscopic analyses of sea turtles biofilms: Complementary to understand turtle behavior. PLoS ONE 2018, 13, e0195770.
  95. Vasselon, V.; Rimet, F.; Domaizon, I.; Monnier, O.; Reyjol, Y.; Bouchez, A. Assessing pollution of aquatic environments with diatoms’ DNA metabarcoding: Experience and developments from France Water Framework Directive networks. Metabarcoding Metagenom. 2019, 3, e39646.
  96. MacKeigan, P.W.; Garner, R.E.; Monchamp, M.È.; Walsh, D.; E Onana, V.; Kraemer, S.; Pick, F.; Beisner, B.; Agbeti, M.; Barbosa da Costa, N.; et al. Comparing microscopy and DNA metabarcoding techniques for identifying cyanobacteria assemblages across hundreds of lakes. Harmful Algae 2022, 113, 102187.
  97. Pawlowski, J.; Kelly-Quinn, M.; Altermatt, F.; Apothéloz-Perret-Gentil, L.; Beja, P.; Boggero, A.; Borja, A.; Bouchez, A.; Cordier, T.; Domaizon, I.; et al. The future of biotic indices in the ecogenomic era: Integrating (e)DNA metabarcoding in biological assessment of aquatic ecosystems. Sci. Total. Environ. 2018, 637–638, 1295–1310.
  98. Dzhembekova, N.; Moncheva, S.; Ivanova, P.; Slabakova, N.; Nagai, S. Biodiversity of phytoplankton cyst assemblages in surface sediments of the Black Sea based on metabarcoding. Biotechnol. Biotechnol. Equip. 2018, 32, 1507–1513.
  99. CEN/TR 17245:2018; Water Quality—Technical Report for the Routine Sampling of Benthic Diatoms from Rivers and Lakes Adapted for Metabarcoding Analyses. iTeh Standards: Newark, DE, USA, 2018.
  100. CEN/TR 17244:2018; Water Quality—Technical Report for the Management of Diatom Barcodes. iTeh Standards: Newark, DE, USA, 2018; CEN/TC 230/WG 23—Aquatic Macrophytes and Algae: 2018.
  101. ie, Y.; Giesy, J.P. UofS-ETL-EDNA-30 Metabarcoding of Cyanobacteria Assembly; University of Saskatchewan: Saskatoon, SK, Canada, 2018; Version 1.
  102. Yarimizu, K.; Fujiyoshi, S.; Kawai, M.; Norambuena-Subiabre, L.; Cascales, E.-K.; Rilling, J.-I.; Vilugrón, J.; Cameron, H.; Vergara, K.; Morón-López, J.; et al. Protocols for monitoring harmful algal blooms for sustainable aquaculture and coastal fisheries in Chile. Int. J. Env. Res. Pub. Health 2020, 17, 7642.
  103. Jerney, J.; Hällfors, H.; Oja, J.; Reunamo, A.; Suikkanen, S.; Lehtinen, S. Guidelines for using environmental DNA in Finnish marine phytoplankton monitoring–Improved biodiversity assessment through method complementation. Rep. Finn. Environ. Inst. 2022, 40, 69.
  104. Buchheim, M.A.; Keller, A.; Koetschan, C.; Förster, F.; Merget, B.; Wolf, M. Internal Transcribed Spacer 2 (nu ITS2 rRNA) Sequence-Structure Phylogenetics: Towards an Automated Reconstruction of the Green Algal Tree of Life. PLoS ONE 2011, 6, e16931.
  105. Vasselon, V.; Rimet, F.; Tapolczai, K.; Bouchez, A. Assessing ecological status with diatoms DNA metabarcoding: Scaling-up on a WFD monitoring network (Mayotte island, France). Ecol. Indic. 2017, 82, 1–12.
  106. Pérez-Burillo, J.; Mann, D.G.; Trobajo, R. Evaluation of two short overlapping rbcL markers for diatom metabarcoding of environmental samples: Effects on biomonitoring assessment and species resolution. Chemosphere 2022, 307, 135933.
  107. Pawlowski, J.; Audic, S.; Adl, S.; Bass, D.; Belbahri, L.; Berney, C.; Bowser, S.; Cepicka, I.; Decelle, J.; Dunthorn, M.; et al. CBOL protist working group: Barcoding eukaryotic richness beyond the animal, plant, and fungal kingdoms. PLoS Biol. 2012, 10, e1001419.
  108. Choi, J.; Park, J.S. Comparative analyses of the V4 and V9 regions of 18S rDNA for the extant eukaryotic community using the Illumina platform. Sci. Rep. 2020, 10, 6519.
  109. Kutty, S.N.; Loh, R.K.; Bannister, W.; Taylor, D. Evaluation of a diatom eDNA-based technique for assessing water quality variations in tropical lakes and reservoirs. Ecol. Indic. 2022, 141, 109108.
  110. Behnke, A.; Friedl, T.; Chepurnov, V.A.; Mann, D.G. Reproductive Compatibility and rDNA Sequence Analyses in the Sellaphora Pupula Species Complex (Bacillariophyta). J. Phycol. 2004, 40, 193–208.
  111. Hamsher, S.E.; Evans, K.M.; Mann, D.G.; Poulickova, A.; Saunders, G.W. Barcoding diatoms: Exploring alternatives to COI-5P. Protist 2011, 162, 405–422.
More
This entry is offline, you can click here to edit this entry!
Video Production Service