Functional Genomics Differentiate Inherent and Environmentally Influenced Traits in Dinoflagellate and Diatom Communities.

Dinoflagellates and diatoms are among the most prominent microeukaryotic plankton groups, and they have evolved different functional traits reflecting their roles within ecosystems. However, links between their metabolic processes and functional traits within different environmental contexts warrant further study. The functional biodiversity of dinoflagellates and diatoms was accessed with metatranscriptomics using Pfam protein domains as proxies for functional processes. Despite the overall geographic similarity of functional responses, abiotic (i.e., temperature and salinity; ~800 Pfam domains) and biotic (i.e., taxonomic group; ~1500 Pfam domains) factors influencing particular functional responses were identified. Salinity and temperature were identified as the main drivers of community composition. Higher temperatures were associated with an increase of Pfam domains involved in energy metabolism and a decrease of processes associated with translation and the sulfur cycle. Salinity changes were correlated with the biosynthesis of secondary metabolites (e.g., terpenoids and polyketides) and signal transduction processes, indicating an overall strong effect on the biota. The abundance of dinoflagellates was positively correlated with nitrogen metabolism, vesicular transport and signal transduction, highlighting their link to biotic interactions (more so than diatoms) and suggesting the central role of species interactions in the evolution of dinoflagellates. Diatoms were associated with metabolites (e.g., isoprenoids and carotenoids), as well as lysine degradation, which highlights their ecological role as important primary producers and indicates the physiological importance of these metabolic pathways for diatoms in their natural environment. These approaches and gathered information will support ecological questions concerning the marine ecosystem state and metabolic interactions in the marine environment.


Introduction
The marine environment covers more than 70% of the world's surface yet much remains to be described of its structure and function. The ocean is not structured by hard environmental barriers and is considered a highly connected system. Regarding marine microeukaryotes, this has led to a discussion about "everything is everywhere but the environments selects" [1]. However, oceanic provinces have been defined according to Longhurst [2] and confirmed by water chemistry [3] and biodiversity [4] patterns. According to Baas-Becking [1], local environmental processes mediate species diversity and molecular data show regional and global differences in biodiversity and corresponding community structures [5][6][7].
To improve our understanding of the interplay between taxonomic and functional diversity of a community relative to environmental influence, this study distinguishes functional processes that may be affected by changing environmental conditions from those functions that are affected by biodiversity (species identity). The overall aim of this study is to link the biodiversity and functioning of microeukaryotic communities. In particular, it focuses on: (1) the influence of environmental parameters, i.e., temperature, salinity and inorganic macronutrients, on the functional biodiversity of microeukaryotic communities in arctic and subarctic/temperate waters; (2) the identification of functional processes influenced by these environmental parameters; and (3) the identification of functional processes associated with diatoms and dinoflagellates, the latter group of which exhibits more diverse lifestyles and can have different roles in the marine ecosystem (autotrophic, mixotrophic, heterotrophic, symbiotic and parasitic).

Field Sampling for Plankton and Environmental Parameters
Sampling was performed during the MSM-21/3 ARCHEMHAB expedition [38]  may be affected by changing environmental conditions from those functions that are affected by biodiversity (species identity). The overall aim of this study is to link the biodiversity and functioning of microeukaryotic communities. In particular, it focuses on: (1) the influence of environmental parameters, i.e., temperature, salinity and inorganic macronutrients, on the functional biodiversity of microeukaryotic communities in arctic and subarctic/temperate waters; (2) the identification of functional processes influenced by these environmental parameters; and (3) the identification of functional processes associated with diatoms and dinoflagellates, the latter group of which exhibits more diverse lifestyles and can have different roles in the marine ecosystem (autotrophic, mixotrophic, heterotrophic, symbiotic and parasitic).

Field Sampling for Plankton and Environmental Parameters
Sampling was performed during the MSM-21/3 ARCHEMHAB expedition [38]   Samples for inorganic nutrient measurements (ammonium, nitrate, nitrite, phosphate and silicate) were directly taken from the Niskin bottles, transferred to polypropylene bottles (50 mL) and frozen immediately at −20 °C. Nutrient samples were analyzed with a continuous-flow autoanalyzer (Evolution III, Alliance Instruments, Freilassing, Germany)) based upon standard seawater analytical methods for determination of nitrate and nitrite [39], ammonium [40], silicate [41] and phosphate [42].
In situ chlorophyll-a concentrations were analyzed by filtering 1.0-2.0 L of seawater through glass microfiber filters (Whatman GF/F, Whatman, Maidstone, UK; nominal pore size: 0.7 µm). The Samples for inorganic nutrient measurements (ammonium, nitrate, nitrite, phosphate and silicate) were directly taken from the Niskin bottles, transferred to polypropylene bottles (50 mL) and frozen immediately at −20 • C. Nutrient samples were analyzed with a continuous-flow autoanalyzer (Evolution III, Alliance Instruments, Freilassing, Germany)) based upon standard seawater analytical methods for determination of nitrate and nitrite [39], ammonium [40], silicate [41] and phosphate [42].
In situ chlorophyll-a concentrations were analyzed by filtering 1.0-2.0 L of seawater through glass microfiber filters (Whatman GF/F, Whatman, Maidstone, UK; nominal pore size: 0.7 µm). The filtration unit was covered to exclude light, and sample filters were frozen immediately at −80 • C until laboratory analysis. Pigments were extracted by sonication of filters in 10 mL 90% acetone and then incubated overnight in darkness at 4 • C. The extract was centrifuged at 3020× g for 10 min, and the fluorescence of the supernatant was determined at 665 nm (TD-700 fluorometer, Turner Designs, Sunnyvale, USA). Extracted chlorophyll-a data are archived in the data repository PANGAEA [43].
Seawater for DNA extraction was integratively sampled from the surface (1 to 30 m) through a hose connected to a shipboard membrane pump (QBY-25-SS; AGB-Pumpen, Hamburg, Germany). In total, 60 L of seawater per sampling station was passed over a sequential filter tower with progressively decreasing nylon mesh-sizes (200, 50 and 20 µm) to obtain distinct plankton size-fractions. For this study, the microplankton (20-50 µm) size fraction was processed.
Retentates from the filter tower and membrane filters for molecular analyses were back-washed into centrifuge tubes with 0.2 µm-filtered seawater, adjusted to a total volume of 45 mL and split into three 15 mL aliquots. Cell suspensions were centrifuged for 15 min at 3600× g at 4 • C, after which the pellet aliquots were separated for subsequent DNA and RNA analysis. Pellets for DNA analysis were resuspended in a 400 µL AP1 lysis buffer (Qiagen, Hilden, Germany), whereas pellets for RNA analysis were resuspended in Tri Reagent (Sigma Aldrich, Darmstadt, Germany) and then each was transferred to a microcentrifuge vial with 0.2 µm glass beads (Sigma Aldrich, Darmstadt, Germany). Vials were shock-frozen in liquid nitrogen and kept at −20 • C (for DNA) or −80 • C (for RNA) until further processing.

Metabarcoding: rRNA Analysis and Taxonomic Assignment
In total, 20 integrated surface plankton samples were sequenced on a 454 FLX+ sequencer. Raw reads were deposited in the European Nucleotide Archive (ENA) via the GFBio portal under Accession No. PRJEB32397 (MSM21-3) and PRJEB37605 (HE-431).
Methodological errors in 454-sequencing and OTU clustering may lead to an over-or underestimation of diversity [44], but recent methodological advances in amplicon sequence denoising promise the capability to reconstruct the true biologic sequences of amplicon surveys [45]. These amplicon sequence variants (ASV) are thus expected to replace the common OTUs as taxa proxies because ASVs resolve single nucleotide differences and provide constant labels across studies [46]. The R package Dada2 [47] was used in this study to obtain such ASVs. Cutadapt [48] was used to remove the leading forward primer from the reads. Reads without detected forward primer sequences were discarded. These pre-processed sequences were further processed with R version 3.5.2 [49] using functions of the Dada2 package (version 1.11.0). Sequences were truncated to a length of 400 bp. Reads shorter than 400 bp, with the expected base error above 5-and those containing any base with a Phred quality score ≤10-were discarded. ASVs were constructed, cleaned of chimeras and taxonomically annotated following the guidelines of the Dada2 online tutorial (https://benjjneb.github.io/dada2/tutorial.html); settings were adapted to the recommendations for 454 data. The Parc version of the Silva 132 LSU dataset was used as reference for taxonomic assignment [50] and the taxonomic labels were adjusted as described in http://blog.mothur.org/2018/01/10/SILVA-v132-reference-files/.
The ASVs annotated as alveolates or stramenopiles were classified more accurately by PhyloAssigner version 6.166 [51] with a phylogenetic placement onto reference trees based on 18S/28S concatenated alignments (doi:10.5281/zenodo.3741720), according to Elferink et al. [27] Statistical analyses were run in the R environment version 3.0.2 [49] with the package 'vegan' [52] and 'plyr' [53]. The sequence numbers per sample were not subsampled to the smallest sequence number of any sample to avoid an increase in the number of potential false positives [54] by comparison to the relative abundance of counts in samples. Rarefaction and ASV (species) accumulation curves of each station were calculated in vegan using the function 'rarecurve' and created after quality control and clustering to represent the local species richness and to serve as a reference for the diversity captured in a sample. The final ASV table was Hellinger-transformed (double-square root transformation) as recommended for clustering or ordination of species abundance data [55,56] with the 'decostand' function. A principal coordinate analysis (PCoA) was applied to the Bray-Curtis dissimilarities matrix of ASVs. Environmental factors were Hellinger-transformed to normalize the dataset and standardized by the factors' maximum values. To compare environmental similarities among samples, principle component analysis (PCA) with calculated Euclidean distances was carried out (log10 transformation for normalization). The package 'treemap' was used to illustrate the taxonomic hierarchical structure among regions.
Transcriptomes were trimmed (length > 75 bp), 'De Novo' assembled in CLC Genomic Workbench 11 contigs blasted (blastn) against the MMETSP database [57] with BLAST to obtain taxonomic information. Raw reads from the obtained metatranscriptomes were mapped to the consensus sequence to obtain read counts per contig. Subsequently, read counts per contig were normalized with the R package 'limma' by applying the normalizeQuantile function [58]. Functional annotation of the contigs was conducted by Trinotate v3.0.2 and predicted from the top blastx hits (e-value ≤ 10 −9 ).
Variance partition analyses were done in the R environment with the package 'variancePartition' and 'DESeq.' The functional responses of temperature, salinity, total nitrogen, phosphate, silicate, chlorophyll-a, dinoflagellate abundance and diatom abundance were analyzed. From the variance partition model, the estimates for all parameters were used to calculate from which environmental parameter most influenced the respective Pfam. This was the case when the variance explained by a particular parameter (e.g., temperature) exceeded the variance explained by all other parameters considered, including the residual variance. The significance of one Pfam for a region was identified with permutation (times = 1000, function permatfull from 'vegan' package) and filtering (p value = 0.05/(17926*20 [stations]). To obtain the respective Pfam domains for dinoflagellates and diatoms, two separate variance partition models were conducted (each with the environmental parameter identified above), because of the strong negative correlation between groups in our data.
Further, KEGG Orthologies (KOs) were assigned to Pfam domains and the KO abundances counted because one Pfam domain can result in different KOs and different Pfam domains can result in the same KO. For visualization of the data, results were plotted for each environmental parameter separately with the web application FuncTree2 [59].

Results and Discussion
This study links the biodiversity and associated functional traits of microeukaryotic communities in arctic and subarctic/temperate waters, with emphasis on dinoflagellates and diatoms. The two taxonomic groups encompass different life strategies and niches within the marine environment, with diatoms being functionally similar, i.e., all silicifying and primarily photoautotrophic, whereas dinoflagellates comprise highly diverse nutritional modes (autotrophic, mixotrophic and heterotrophic) and lifestyles (free-living, symbiotic or parasitic). To improve our understanding of the interplay between biodiversity and functional diversity of a microeukaryotic community relative to environmental influences on the communities, this study distinguishes functional processes that are affected by changing environmental conditions from those functions which are more affected by the species identity. This was accomplished by variance partitioning analysis, i.e., aligning Pfam domain abundances with measured environmental parameters and species abundances. First the community is taxonomically and functionally described to then identify the influence of environmental parameters, i.e., temperature, salinity and inorganic macronutrients, on the diversity and functional diversity of microeukaryotic communities. Second, we identify and discuss the functional processes significantly influenced by these environmental parameters, and finally we then elucidate functional processes significantly influenced within the diatoms and dinoflagellates with their contrasting life strategies and niches.

Taxonomic and Functional Description of the Microplankton Communities
Quality processing of rRNA sequencing reads and the removal of singletons, doubletons and potential fungal and metazoan sequences (defined by dada2) led to an average of 11,714 reads (range: 4565 to 36,266, Table S1, an average of 279 ASVs per sample (range: 29 to 279). Among the total 506 ASVs, 22 ASVs were present in all four regions, and the number of unique ASVs varied across the sampling regions (11 for Sweden to 57 for Iceland ASVs) ( Figure S1). The assembly of the metatranscriptome resulted in 4,217,269 contigs, from which 970,582 contigs (23%) had a taxonomic assignment (MMETSP); these contigs were annotated based on the Pfam database of protein families, which resulted in 17,926 Pfam domains with taxonomic and functional assignment. The Pfam domains represent the conserved functional part of a protein; the metatranscriptome Pfam annotations were used as a proxy to further analyze and compare functional biodiversity based on expressed cellular processes.
The taxonomic assignment of ASVs (rRNA) and contigs (mRNA) resulted in a similar overall distribution of the supergroups alveolates and stramenopiles ( Figure S2), but also reached high similarity at a lower taxonomic level (Supplementary ASV Table). This corresponding pattern of taxonomic diversity based on rRNA and mRNA sequence comparison allowed a valid separation of functions into different taxonomic groups based on Pfam annotations. Among the alveolates, 85% ASV and 92% contig reads were assigned to dinoflagellates, and among the stramenopile, 88% ASV and 97% contig reads were assigned to diatoms.
The communities of Icelandic and Swedish coastal water samples were dominated by alveolates, especially dinoflagellates, as revealed by their rRNA and mRNA sequences ( Figures S2 and S3). This indicates a seasonal succession from diatoms, which still dominated the microeukaryotic communities in Greenland and Norway, to dinoflagellate-dominated communities. Swedish rRNA samples were dominated by Lingulodinium (Gonyaulacales) and Protoperidinium (Peridinales) (~48% of assigned reads) and mRNA samples matched mainly Lingulodinium (65% of assigned reads). Protoperidinium was not detected based on the mRNA, but~4% of the mRNA reads were assigned to its higher order Peridiniales. Among rRNA samples from Iceland coastal water, Dinophysis (Dinophysiales) and Tripos (Gonyaulacales) (~39% of assigned reads) were identified and among mRNA samples, Dinophysis and Neoceratium (Gonyaulacales) were identified (~61% of assigned reads), but Tripos was not. The observed discrepancies for Protoperidinium and Tripos may be explained by the lack of references to these groups in the respective databases.
The communities in Greenlandic and Norwegian water samples were dominated by stramenopiles, as revealed by their rRNA and mRNA sequences. Greenlandic rRNA and mRNA samples mainly consisted of Detonula, Thalassiosira (both Thalassiosirales) and Chaetoceros (Chaetocerales) (63% and 74% of assigned rRNA and mRNA reads, respectively) and the Norwegian rRNA and mRNA samples dominated by Corethron (Corethrales) and Chaetoceros (Chaetocerales) genera (79% and 64% of assigned reads). Species of the genus Chaetoceros were also present in Swedish coastal rRNA water samples (21%) but only accounted for 0.5% of the mRNA signal. This discrepancy could be the result of a PCR bias towards this species group during rRNA amplification or, more likely, may be the result of the insufficient representation of environmentally significant mRNA sequences in the reference database. Overall, the taxonomic assignment deduced from the rRNA and the mRNA was similar. Discrepancies between the taxonomic resolution of the rRNA and mRNA sequences can be expected due to the differences in the completeness of the corresponding reference databases, namely Silva 132 LSU for rRNA [50] and MMETSP for mRNA [57]. For example, some species may be completely absent from one of the databases or only relatives may be represented. Alternatively, different and changing taxonomic affiliations can complicate a clear classification, e.g., the change from Ceratium to Tripos.

Regional Environmental Conditions and Differences in Taxonomic and Functional Biodiversity
Principle component analysis of environmental parameters ( Figure S4) revealed differences between the regions with the most distinguishable environmental conditions between Swedish and Greenland/Norway coastal waters. The clustering of the samples represents the water bodies characterized by the arctic, subarctic or temperate region. Surface water temperatures ( Figure S5) were highest in Swedish coastal waters (15-18 • C) and coldest in Greenland (3-7 • C). Mean surface salinity was highest for Iceland and Norway (~34.5) and lowest in Sweden (~24.5) coastal water due to the influence of Baltic Sea water masses. In Greenland coastal water, salinity was generally lower (between 31 and 32) due to arctic freshwater input of melting glaciers and sea-ice but reached levels of 34 at two stations. Generally, low nitrite, nitrate and ammonium concentrations (~0.01-1.16 µM) were measured at all stations and regions; however, overall nitrite concentrations were highest at two Swedish stations (0.3-0.45 µg L −1 ) and total nitrogen concentration was highest in Norway (1.3-4.3 µg L −1 ). Phosphate concentrations tracked silicate concentrations but were lower overall than silicate concentrations (≤0.3 µg L −1 ).
Community composition based on mRNA and rRNA reflected the ecosystem from which the samples originated, namely the arctic coastal waters of Greenland, the subarctic coastal waters of Norway and Iceland and the temperate Swedish coastal waters ( Figure 1 and Figure S3). Sampling took place in more or less the same season (27 July-6 September); however, differences in the plankton communities are expected due to successional patterns, i.e., summer plankton succession is further advanced in temperate regions compared to subarctic or arctic regions during the same period. The successional differences between the regions, along with regional differences in prevailing environmental conditions, are well reflected in both taxonomic and functional diversity ( Figure 2). The taxonomic diversity shows a clear partitioning between the four regions, with each region located in a separate quadrant of the Principle Coordinate Analysis (PCoA). Accordingly, like the ASV PCoA (Figure 2a), the Pfam domain composition and abundance deduced from mRNA sequences showed partitioning between the four regions ( Figure 2b), capturing 37.5% of the variance. However, the samples from the subarctic coastal waters of Norway and Iceland seem to be more similar on a functional level than on a taxonomic level. This is consistent with the results of the water chemistry analysis, which revealed a higher similarity between Iceland and Norway's coastal water samples ( Figures S4 and S5) than between the other two regions. Given the fact that the plankton communities from Norwegian and Icelandic samples are dominated by different taxonomic groups, i.e., diatoms and dinoflagellates, respectively, their functional similarity seems largely influenced by water chemistry and not by taxonomic identity. Because the two groups are dominant among Greenlandic and Norwegian coastal water samples (i.e., diatoms) or Sweden and Icelandic coastal water samples (i.e., dinoflagellates) functional differences should be noticeable since the groups have generally different roles within the ecosystem [13].

Metatranscriptomic analysis-Functional Biodiversity and Functional Processes Deduced from Metatranscriptomics
KEGG Orthologies (KOs) were assigned to Pfam domains based on the presence of a respective protein domain within a KO gene representative. Although this approach can lead to the loss of information when Pfam domains cannot be matched to KO identifiers, it nonetheless offered the benefit of providing a functional classification of 22% of expressed genes. In sum, from a total of 17,926 Pfam domains, 8135 could be mapped and classified with the KO identifier annotations and subsequently plotted with the FuncTree2 tool for detailed interpretation.
The abundance of all Pfam domains in Swedish, Icelandic, Greenlandic and Norwegian coastal water samples clustered well with the geographic origin of the sample (Figure 2b), e.g., Icelandic samples close to the Norwegian samples. Pfam domain occurrences were plotted as presence/absence data with FuncTree2, which revealed that the metatranscriptomes of all four sampling regions are functionally similar ( Figure S6). Hence, based on their functional potential, the communities do not differ greatly, but they differ based on their gene expression profiles (Figure 2b). This implies that it is not the functional diversity stored in a community transcriptome that explains cellular processes associated with certain environmental settings, it is how this functional diversity is manifest, i.e., the genes expressed. The question remains as to what the reasons for this different expression of functional processes may be. For example, it could be the biodiversity within a community or species interactions (e.g., protection against grazing or the secretion of allelochemical molecules). Various environmental parameters influence the transcriptome, as species need to adapt to environmental circumstances.
As environmental parameters often correlate and interact, it is generally difficult to pinpoint one driver of a particular effect. Generally, the abundance of transcripts/sequence reads of a gene reflects the sum of all biotic and abiotic factors acting at the moment of sampling. Also, other stochastic and environmentally independent factors can influence the abundance of a gene in a community metatranscriptome analysis, such as e.g., grazing, parasitism and population size. To gain insight into how certain environmental settings alter functional diversity, we analyzed which parts of the variance of the Pfam domain abundances can be explained by broadly defined biotic and abiotic parameters. In our study, many environmental parameters were correlated ( Figure S7); thus, we only associated Pfam domains to a certain environmental parameter if the variance explained by the respective parameter exceeded the sum of variances explained by the other parameters (including the residual unexplained variance).
With this rather conservative approach, we identified 801 Pfam domains out of a total of 17,926 domains that can be explained by the environmental variation, and 1533 Pfam domains that explain

Metatranscriptomic Analysis-Functional Biodiversity and Functional Processes Deduced from Metatranscriptomics
KEGG Orthologies (KOs) were assigned to Pfam domains based on the presence of a respective protein domain within a KO gene representative. Although this approach can lead to the loss of information when Pfam domains cannot be matched to KO identifiers, it nonetheless offered the benefit of providing a functional classification of 22% of expressed genes. In sum, from a total of 17,926 Pfam domains, 8135 could be mapped and classified with the KO identifier annotations and subsequently plotted with the FuncTree2 tool for detailed interpretation.
The abundance of all Pfam domains in Swedish, Icelandic, Greenlandic and Norwegian coastal water samples clustered well with the geographic origin of the sample (Figure 2b), e.g., Icelandic samples close to the Norwegian samples. Pfam domain occurrences were plotted as presence/absence data with FuncTree2, which revealed that the metatranscriptomes of all four sampling regions are functionally similar ( Figure S6). Hence, based on their functional potential, the communities do not differ greatly, but they differ based on their gene expression profiles (Figure 2b). This implies that it is not the functional diversity stored in a community transcriptome that explains cellular processes associated with certain environmental settings, it is how this functional diversity is manifest, i.e., the genes expressed. The question remains as to what the reasons for this different expression of functional processes may be. For example, it could be the biodiversity within a community or species interactions (e.g., protection against grazing or the secretion of allelochemical molecules). Various environmental parameters influence the transcriptome, as species need to adapt to environmental circumstances.
As environmental parameters often correlate and interact, it is generally difficult to pinpoint one driver of a particular effect. Generally, the abundance of transcripts/sequence reads of a gene reflects the sum of all biotic and abiotic factors acting at the moment of sampling. Also, other stochastic and environmentally independent factors can influence the abundance of a gene in a community metatranscriptome analysis, such as e.g., grazing, parasitism and population size. To gain insight into how certain environmental settings alter functional diversity, we analyzed which parts of the variance of the Pfam domain abundances can be explained by broadly defined biotic and abiotic parameters. In our study, many environmental parameters were correlated ( Figure S7); thus, we only associated Pfam domains to a certain environmental parameter if the variance explained by the respective parameter exceeded the sum of variances explained by the other parameters (including the residual unexplained variance).
With this rather conservative approach, we identified 801 Pfam domains out of a total of 17,926 domains that can be explained by the environmental variation, and 1533 Pfam domains that explain variation among the community as determined based on the percentage of dinoflagellates and diatoms in the samples. The variation partitioning analysis with dinoflagellates revealed that 1497 Pfam domains are potentially associated with changes in dinoflagellate abundance independent of environmental influences. The same analysis with diatoms revealed that 838 Pfam domains are associated with changes in diatom abundance and are independent of environmental influences. Of these dinoflagellate and diatom Pfam domains, 802 domains are shared (Figure 3). In total 1533 Pfam domains account for dinoflagellates, diatoms or both of these classes. variation among the community as determined based on the percentage of dinoflagellates and diatoms in the samples. The variation partitioning analysis with dinoflagellates revealed that 1497 Pfam domains are potentially associated with changes in dinoflagellate abundance independent of environmental influences. The same analysis with diatoms revealed that 838 Pfam domains are associated with changes in diatom abundance and are independent of environmental influences. Of these dinoflagellate and diatom Pfam domains, 802 domains are shared (Figure 3). In total 1533 Pfam domains account for dinoflagellates, diatoms or both of these classes. Figure 3. Overview of the Pfam domain distribution as a Venn diagram: numbers represent Pfam domains for which variability is most likely related to changes in environmental parameters (chlorophyll-a, phosphate, salinity, silicate or temperature), relative dinoflagellate abundance, relative diatom abundance or both relative dinoflagellate and diatom abundance.

Functional Response to Environmental Factors
In general, all processes involved in functional pathways play a vital role in species and community interactions. Some functional pathways may be expressed from all species and thus form a general signal, whereas others may be affected by environmental parameters. Here, several Pfam domains (no. = 801) for which the majority of the variance could be attributed to certain environmental factors (temperature, salinity, silicate, phosphate or chlorophyll-a) were identified. Most Pfam domains were associated with temperature and salinity; hence, corresponding functions are discussed with an emphasis on these two parameters. The KO terms were assigned to the 801 Pfam domains, but given the situation that not all Pfam domains had a corresponding KO term and some Pfam domains corresponded to multiple KO terms, the environmental parameters silicate, phosphate and chlorophyll-a contributed little information ( Figure S8) and are not further discussed here (e.g., silicate had only one KO term). A full list of associated Pfam domains can be viewed in the Supplemental Material. The influences of temperature and salinity on functional processes are highlighted and discussed in the following paragraphs.

Temperature
The shifts of species distributions towards polar regions due to an increase in temperature at lower latitudes [8] lead to adaptation of local species or the introduction of new species, either of which can have cascading effects in a community. Temperature effects on marine plankton are often described in the context of two fundamental concepts, namely, the metabolic theory of ecology [60,61] and the hypothesis of temperature-dependent physiology [62,63]. Although these two concepts are often studied separately, they are directly linked to each other by the influence of temperature on chemical reactions and metabolic rates, i.e., the sum of all enzyme-driven chemical reactions within cells [64]. Pfam domains for which variability is most likely related to changes in environmental parameters (chlorophyll-a, phosphate, salinity, silicate or temperature), relative dinoflagellate abundance, relative diatom abundance or both relative dinoflagellate and diatom abundance.

Functional Response to Environmental Factors
In general, all processes involved in functional pathways play a vital role in species and community interactions. Some functional pathways may be expressed from all species and thus form a general signal, whereas others may be affected by environmental parameters. Here, several Pfam domains (no. = 801) for which the majority of the variance could be attributed to certain environmental factors (temperature, salinity, silicate, phosphate or chlorophyll-a) were identified. Most Pfam domains were associated with temperature and salinity; hence, corresponding functions are discussed with an emphasis on these two parameters. The KO terms were assigned to the 801 Pfam domains, but given the situation that not all Pfam domains had a corresponding KO term and some Pfam domains corresponded to multiple KO terms, the environmental parameters silicate, phosphate and chlorophyll-a contributed little information ( Figure S8) and are not further discussed here (e.g., silicate had only one KO term). A full list of associated Pfam domains can be viewed in the Supplemental Material. The influences of temperature and salinity on functional processes are highlighted and discussed in the following paragraphs.

Temperature
The shifts of species distributions towards polar regions due to an increase in temperature at lower latitudes [8] lead to adaptation of local species or the introduction of new species, either of which can have cascading effects in a community. Temperature effects on marine plankton are often described in the context of two fundamental concepts, namely, the metabolic theory of ecology [60,61] and the hypothesis of temperature-dependent physiology [62,63]. Although these two concepts are often studied separately, they are directly linked to each other by the influence of temperature on chemical reactions and metabolic rates, i.e., the sum of all enzyme-driven chemical reactions within cells [64].
Increases in temperature translate into increased rates of cellular and metabolic processes, most of which require ATP. The increased cellular demands result in an increased production of ATP, which is derived from respiration and photosynthesis [61,64,65]. Both processes are affected differently by temperature increases, with respiration possessing a relatively higher activation energy and hence respiration rates should increase more rapidly compared with photosynthesis [61,64,65]. Additionally, temperature is a key determinant of phytoplankton stoichiometry [63,64,66,67], and changes in temperature have been shown to alter the optimal allocation to C, N and P pools in phytoplankton via phenotypic plasticity [63,66,67]. Temperature has been shown to positively correlate with C:P and N:P ratios, but not with C:N. These shifts occur because fewer P-rich ribosomes are required at warmer temperatures due to their increased use efficiency [62,63,67,68]. However, elevated C and N content and a decrease of P have also been attributed to observed stoichiometric changes in warm-acclimated phytoplankton cells [69]. Hence, more than one process potentially causes temperature-dependent shifts in elemental ratios.
Consistent with a general increase in metabolic activity with temperature, and an associated higher demand for ATP, we found that Pfam domains assignable to the KEGG group carbohydrate metabolism (especially the citrate cycle, glycolysis/gluconeogenesis and pyruvate metabolism) to be positively correlated with temperature ( Figure 4a). We also found increased abundances of Pfam domains involved in oxidative phosphorylation. Additionally, we found a positive correlation within the KEGG group metabolism of other amino acids and herein the pathways of glutathione and taurine/hypotaurine metabolism. Glutathione and taurine control the redox gradient across the mitochondrial inner-membrane and act as a pH buffer in the mitochondrial matrix, thereby ensuring performance under high loads of electrons [70]. Further, we found Pfam domains attributable to sulfur metabolism within the KEGG group energy metabolism that positively correlated with temperature. Hence, temperature may not only affect the stoichiometry of C:N:P but also cellular sulfur contents, which can have cascading effects through the marine sulfur cycle.
We also identified patterns that are potentially linked to changes in cellular stoichiometry as proposed by the temperature-dependent physiology hypothesis [62,63]. Consistent with Toseland et al. [67], we found that temperature negatively correlates with the abundance of Pfam domains in the KEGG group translation (ribosome, and biogenesis of ribosomes) and nucleotide metabolism (Figure 4a). Both processes are functionally interlinked, as most substrates from purine and pyrimidine metabolism are used for rRNA synthesis [67,71]. These patterns confirm the molecular explanation for observed declines in cellular P contents with increasing temperature [63,67,68]. Furthermore, parts of our data also include cellular processes that can be attributed to increases in C and N content. These processes include a positive association between temperature and Pfam domain abundances involved in lipid metabolism (one of the major cellular sinks for C allocation) and amino acid metabolism (cyanoamino acids, which represents an N-sink). With increasing temperature and possibly more metabolic activity, it can be further assumed that pathways involved in the growth and turnover of microeukaryotic cells are stimulated. Positive correlations were observed among the KEGG category cellular processes, especially in the group cell growth and death (Figure 4a).
In sum, our results at the genetic level demonstrate higher metabolic activity and respiration with increasing temperatures, as predicted by the metabolic theory of ecology [60,61]. Although we did not measure the C:N:P ratio, our genetic data are consistent with other published results mentioned here, i.e., metabolic processes related to C-, N-and P-allocation change along the temperature gradient (Figure 4a), largely independent of the underlying broad taxonomic composition. This is consistent with what Toseland et al. [67] found at the genetic level and Yvon-Durocher et al. [68] at the community level, namely that there is a highly conserved response of elemental stoichiometry to temperature across multiple spatial, temporal and organizational scales.
In addition to the above-discussed mechanisms, we also observed further indications for higher cellular activity and an associated turnover of proteins due to a positive correlation of Pfam domains involved in the KEGG category genetic information processing, in particular within the groups folding, sorting and degradation and replication and repair. In the category environmental information processing, only a few positive and negative correlations for signal transduction pathways were observed, suggesting that signal transduction may be less influenced by temperature. folding, sorting and degradation and replication and repair. In the category environmental information processing, only a few positive and negative correlations for signal transduction pathways were observed, suggesting that signal transduction may be less influenced by temperature.

Salinity
Pfam domains (595) for which most of the variance could be explained by changes in salinity showed rather diverse effects on metabolic processes, with negative and positive associations in various categories (Figure 4b). Salinity influences an organism's capacity for ion homeostasis. This is associated with salinity having a strong effect on community composition, i.e., indicating that species-specific responses determine fitness variation among community members. We identified both negative and positive correlations between salinity and Pfam domain abundances among the KEGG groups amino acid metabolism, nucleotide metabolism and carbohydrate metabolism ( Figure  4b). A more unidirectional negative correlation with salinity was observed for Pfam domains assigned to secondary metabolisms, i.e., biosynthesis of other secondary metabolites (e.g., stilbenoid, diarylheptanoid and gingerol biosynthesis) and metabolism of terpenoids and polyketides (e.g., diterpenoid biosynthesis and biosynthesis of ansamycins). Such effects are well documented in the literature for several plankton groups, e.g., Prymnesium [72,73], diatoms [74] and dinoflagellates [75]. Cellular concentrations of many secondary metabolites change at lower salinities, although the direct physiological mechanisms and metabolic linkages remain unknown. Another unidirectional response was observed for the KEGG category environmental information processing at lower salinities, which seems to exert a direct effect on the expression of several signal transduction processes (Figure 4b). This higher expression of signal transduction processes may be a factor underlying the variable expression of metabolic processes in low salinity regimes and further

Salinity
Pfam domains (595) for which most of the variance could be explained by changes in salinity showed rather diverse effects on metabolic processes, with negative and positive associations in various categories (Figure 4b). Salinity influences an organism's capacity for ion homeostasis. This is associated with salinity having a strong effect on community composition, i.e., indicating that species-specific responses determine fitness variation among community members. We identified both negative and positive correlations between salinity and Pfam domain abundances among the KEGG groups amino acid metabolism, nucleotide metabolism and carbohydrate metabolism (Figure 4b). A more unidirectional negative correlation with salinity was observed for Pfam domains assigned to secondary metabolisms, i.e., biosynthesis of other secondary metabolites (e.g., stilbenoid, diarylheptanoid and gingerol biosynthesis) and metabolism of terpenoids and polyketides (e.g., diterpenoid biosynthesis and biosynthesis of ansamycins). Such effects are well documented in the literature for several plankton groups, e.g., Prymnesium [72,73], diatoms [74] and dinoflagellates [75]. Cellular concentrations of many secondary metabolites change at lower salinities, although the direct physiological mechanisms and metabolic linkages remain unknown. Another unidirectional response was observed for the KEGG category environmental information processing at lower salinities, which seems to exert a direct effect on the expression of several signal transduction processes (Figure 4b). This higher expression of signal transduction processes may be a factor underlying the variable expression of metabolic processes in low salinity regimes and further indicates that salinity has an overall strong effect (in comparison to temperature) on the community (also observed among the taxonomic diversity).

Functional Community Response of Diatoms and Dinoflagellates
Diatoms and dinoflagellates have different life strategies and niches within the marine environment. Diatoms are photosynthetic autotrophs (except for some obligate heterotrophic species of the genus Nitzschia [76,77]) with a strong dependency on inorganic nutrients, whereas dinoflagellates can have different trophic niches and can be more variable in their trophic roles. In this study, the different cellular community functions of diatoms and dinoflagellates were assessed by using the abundance of Pfam domains as a proxy. The KO terms were assigned to Pfam domains for which most of the variance could be attributed to changes in the relative abundance of diatoms and dinoflagellates and were plotted with FuncTree2 ( Figure 5). As a result of the strong negative correlation of dinoflagellate and diatom abundance in our data set ( Figure S9), we applied variance partitioning to each separately and combined the result afterward (see Material and Methods). This means, that for most of the data, most of the variance could either be explained by an increase in dinoflagellate abundance and a decrease in diatom abundance or vice versa ( Figure S9). Any described positive association with diatom abundance hereafter refers to processes that increase with diatom abundance and decrease with dinoflagellate abundance. Therefore, only the positive correlation of the Pfam domain variability is plotted ( Figure 5). Almost half of the Pfam domains, for which most of the variance was explained by shifting taxa abundances, were only associated with changes in dinoflagellate abundance. Hence, some Pfam domain abundances fit well with the dinoflagellate abundance alone and were less influenced by the other tested parameters.
In general, cellular functions and metabolic pathways associated with diatoms and dinoflagellates abundance relate to the same processes, but they are differentially expressed ( Figure 5). Both groups are phylogenetically and evolutionary distinct [78], which has resulted (among other things) in differences in gene structure and regulation. The distinct genetic diversity between the two groups together with interspecific interactions may additionally affect the gene regulation as response to e.g., protection against grazing or the secretion of allelochemical active molecules (e.g., toxins). Differentially evolved regulations of cellular and physiological functions may promote different lifestyles. Similar environmental signals lead to taxa-specific differential regulation of the same pathways, hence creating different cellular responses and trade-offs. Microorganisms 2020, 8, x FOR PEER REVIEW 13 of 22 , biosynthesis of other secondary metabolites (boosm) and xenobiotics biodegradation and metabolism (xbam); genetic information processing (2): replication and repair (rar), folding, sorting and degradation (fsad), transcription (tc) and translation (tl); environmental information processing (3): signaling molecules and interaction (smai), signal transduction (str) and membrane transport (mrp); and cellular processes (4): cell motility (cm), cellular community-prokaryote (ccp)/eukaryotes (cce), cell growth and death (cgd) and transport and catabolism (tac). KO abundance was counted and plotted (size = relative abundance of KOs in each plot). The Venn diagram shows the overlapping KOs between diatoms and dinoflagellates. Some KOs mainly appear in dinoflagellates (e.g., environmental information processing), whereas for genetic information processing some KOs can be found only in diatoms.
In general, cellular functions and metabolic pathways associated with diatoms and dinoflagellates abundance relate to the same processes, but they are differentially expressed ( Figure  5). Both groups are phylogenetically and evolutionary distinct [78], which has resulted (among other things) in differences in gene structure and regulation. The distinct genetic diversity between the two groups together with interspecific interactions may additionally affect the gene regulation as response to e.g., protection against grazing or the secretion of allelochemical active molecules (e.g., toxins). Differentially evolved regulations of cellular and physiological functions may promote different lifestyles. Similar environmental signals lead to taxa-specific differential regulation of the same pathways, hence creating different cellular responses and trade-offs. replication and repair (rar), folding, sorting and degradation (fsad), transcription (tc) and translation (tl); environmental information processing (3): signaling molecules and interaction (smai), signal transduction (str) and membrane transport (mrp); and cellular processes (4): cell motility (cm), cellular community-prokaryote (ccp)/eukaryotes (cce), cell growth and death (cgd) and transport and catabolism (tac). KO abundance was counted and plotted (size = relative abundance of KOs in each plot). The Venn diagram shows the overlapping KOs between diatoms and dinoflagellates. Some KOs mainly appear in dinoflagellates (e.g., environmental information processing), whereas for genetic information processing some KOs can be found only in diatoms.

The Metabolic Associations with Diatoms and Dinoflagellates
In the biologic KEGG category metabolism, KOs of dinoflagellates and diatoms were positively associated, especially with pathways of carbohydrate metabolism (e.g., amino sugar and nucleotide sugar metabolism, ascorbate and aldarate metabolism, the citrate cycle and pentose phosphate pathway), energy metabolism (e.g., carbon fixation in photosynthetic organisms, methane metabolism, photosynthesis), nucleotide metabolism (purine and pyrimidine metabolism) and amino acid metabolism (e.g., alanine, aspartate and glutamate metabolism, arginine and proline metabolism, cysteine and methionine metabolism, glycine, serine and threonine metabolism and valine leucine and isoleucine degradation) ( Figure 5). Upregulated genes involved in carbohydrate metabolism are associated with the dinoflagellate Prorocentrum donghaiense and with diatoms [25]. Furthermore, positive correlations were observed for the dinoflagellate abundance with the catabolic process beta-oxidation of lipid metabolism. Diatom abundance correlated positively with fatty acid synthesis (KEGG pathway fatty acid biosynthesis, elongation), which is an anabolic process. This indicates that dinoflagellates preferably use the catabolic and diatoms, as autotrophs, the anabolic part of fatty acid metabolism. Additionally, diatoms can accumulate lipids, which are more lightweight than other chemical constituents and thus regulate their buoyancy [79]. This extra buoyancy (compared with other organisms) may counteract the effect of their relatively heavy silica cell walls.
Among the metabolic processes, diatom abundance correlated positively with lysine metabolism (in KEGG group amino acid metabolism), especially lysine degradation, with fatty acid synthesis (fatty acid biosynthesis, elongation) and with the C5 isoprenoid biosynthesis, mevalonate (MVA) pathway. The degradation of lysine delivers Acetyl-CoA, a universal metabolic precursor for several pathways, including fatty acid biosynthesis and C5 isoprenoid biosynthesis, MVA pathway. Lysine was shown to be actively taken up from media by the model diatom species, Phaeodactylum tricornutum to complement N and C demand [80,81]. The ability to take up lysine may not be present in all diatom species and the contribution and availability of lysine to overall C and N demand remains to be determined. However, the positive association of diatoms with lysine degradation observed here indicates the physiological importance of this metabolic pathway for diatoms in their natural environment.
The positive association of diatoms with fatty acid synthesis and with the C5 isoprenoid biosynthesis, the MVA pathway can have several, non-mutually exclusive reasons. Both, the fatty acid synthesis and the C5 isoprenoid biosynthesis, MVA pathway are the starting points for a rich suite of metabolites, including carotenoids, sterols, isoprenoids and hydrocarbons. The presence of these compounds in diatoms has put them on the research agenda as a source of natural products and as a proxy for climate reconstruction [79,82]. Our result, therefore, stresses the ecological relevance of these compounds in diatoms and hence the importance of diatoms as a producer of these compounds for other trophic levels. Carotenoids, for example, are known to be produced in large quantities in diatoms [83] and serve several functions at higher trophic levels, e.g., as antioxidants due to their effects on the immune system and involvement in vision or body coloration [84]. Diatoms are further known to contribute a major proportion of marine volatile isoprene, which can have regional effects on oceanic climate [85,86]. The positive association of diatoms with the metabolic pathway genes of these compounds in our study is not surprising but has not yet been described before using metatranscriptomics data from field samples. Our results show that genes of metabolic pathways, for which diatoms have already been described as dominant producers, can be traced back to genetic activity patterns in field samples. This illustrates the ecological relevance of these metabolic pathways and compounds, as they are linked to the success of diatoms. Accordingly, these pathways are relevant for other trophic levels and ecosystem functions via diatoms.
For dinoflagellates, the positive association with nitrogen metabolism (within the KEGG category energy metabolism) was particularly notable among the metabolic processes. This group contained genes coding for nitrate/nitrite transporters, nitrate reductase, ammonium assimilating enzymes (glutamate dehydrogenase, glutamate synthase) and genes coding for cyanate lyases. These results correspond well with the results of a metatranscriptome study by Zhuang et al. [87], who identified the same nitrogen acquisition genes associated with a dinoflagellate bloom (Alexandrium fundyense, now A. catenella [88]). Particularly striking is the congruent identification of the positive association of the cyanate lyase gene with the abundance of dinoflagellates, which provides further evidence that the use of cyanate as an organic N-source is an ecologically important trait in dinoflagellates [87]. Dinoflagellates mainly dominate plankton communities at times when macronutrients are limiting. A positive association of dinoflagellates with N-acquisition genes was linked to growth promotion, with the exploitation of various N-sources (DIN, DON) likely conferring a competitive advantage [87].

Other Functional Associations with Diatoms and Dinoflagellates
In the biologic KEGG category cellular processes, among cell growth and death (i.e., cell cycle, meiosis), positive correlations were shared with dinoflagellates and diatoms, but most other Pfam domains in this category were positively correlated with dinoflagellates. As expected, since planktonic diatoms are not mobile, no KO terms of cell motility processes correlated positively with this group. The overall positive correlations of dinoflagellates with several cellular processes are most likely related to the broad functional diversity of this group. Transport and catabolism processes were positively correlated with dinoflagellates, especially to autophagy and endocytosis. These two processes are closely linked to vesicular transport. For the biologic KEGG category genetic information processing (folding, sorting and degradation), dinoflagellates highly correlated to SNARE interactions in vesicular transport (SNARE: SNP-Receptor, Soluble N-ethylmaleimide sensitive fusion attachment protein receptor). Pathways related to the uptake and secretion of substances (intracellular trafficking, secretion and vesicular transport; KOG-Category 'U') were also identified in a metatranscriptomics study by Wohlrab et al. [23] as particularly important for the persistence of dinoflagellate communities. Also, important are processes associated with the excretion or uptake of substances that can have profound effects on community members and biogeochemical processes (e.g., the secretion of allelochemically active molecules).
The biologic KEGG category genetic information processing was split into two groups: a group of pathways with more KOs positively correlating with the variation in diatoms compared to dinoflagellates and the converse for the other group. Diatom abundance correlated in pathways of replication and repair, as well as folding, sorting and degradation, probably reflecting their characteristic behavior as r-strategists with high growth rates [89]. Dinoflagellate abundance correlated with pathways of transcription and (to a lesser degree) translation. Also, dinoflagellates were positively associated with the biologic category of environmental information processing and especially signal transduction pathways. These positive associations reflect the transcriptomic characteristics of dinoflagellates, which possess genes coding for protein kinases, EF-hand domain containing proteins (both involved in signal transduction) and RNA recognition factors (involved in transcription) [90].
Compared to other biologic categories and pathways, the differences between the dinoflagellates and diatoms were most evident within the environmental information processing category. The broader diversity of lifestyles of dinoflagellates compared with diatoms strongly reflects biotic interactions [91,92]. Signal transduction processes are thereby a prerequisite for the interaction between eukaryotic microalgae and other components of the marine food web, thereby affecting energy flow, food web dynamics, and community structure [93]. Dinoflagellates, and in particular toxigenic species, can use different (species-specific) forms of biologically active secondary metabolites, which, once excreted, interact with cell membranes and affect signal transduction at receptor sites of target organisms [93]. Species interactions, therefore, have played a central role in the evolution of dinoflagellates and have shaped their gene content, as demonstrated by the results of metatranscriptomics analysis in this study and several previously reported several transcriptomic studies.

Conclusions
This study aimed to improve understanding of the interplay of community composition, functional traits and environmental parameters to strengthen the hypothesis that marine communities and ultimately higher trophic levels are affected by the environmental conditions associated with climate change. Variance analyses of Pfam domains deduced from the metatranscriptomes elucidated functional cellular and metabolic processes affected by changing environmental conditions or community structure, reflecting aspects of the ecological niches of dinoflagellates and diatoms ( Figure 6). functional traits and environmental parameters to strengthen the hypothesis that marine communities and ultimately higher trophic levels are affected by the environmental conditions associated with climate change. Variance analyses of Pfam domains deduced from the metatranscriptomes elucidated functional cellular and metabolic processes affected by changing environmental conditions or community structure, reflecting aspects of the ecological niches of dinoflagellates and diatoms ( Figure 6). Temperature and salinity had the most influence on the functional biodiversity of microeukaryotic communities in the arctic and subarctic/temperate waters. However, the clear partitioning of taxonomic diversity between the four regions (especially Norwegian and Icelandic samples), is not consistent with their functional similarity, which seems largely to be influenced by water chemistry and not by taxonomic identity. Thus, despite the overall geographic similarity of functional responses, the identified abiotic (i.e., temperature and salinity) and biotic (i.e., taxonomic group, namely diatoms and dinoflagellates) factors influenced functional responses. Hence, cellular processes associated with certain environmental settings originate less from the functional diversity stored in a community's transcriptome than how this functional diversity is expressed.
Functional processes were significantly influenced by temperature and salinity. Temperature is a primary driver of metabolic activities and growth, while salinity influences an organisms' capacity for ion homeostasis. The general increase in metabolic activity with temperature and an associated higher demand for ATP was associated with metabolic processes related to C-, N-and P-allocation shifts along the temperature gradient, largely independent of the underlying diverse taxonomic composition (i.e., positive correlations with carbohydrate metabolism, especially the citrate cycle, glycolysis/gluconeogenesis and pyruvate metabolism). Further, the tendency for higher growth was evidenced by a positive correlation between temperature and the category cellular processes, especially in the group of cell growth and death. Along the salinity gradient, differences in the expression of most metabolic functions were identified: secondary metabolism correlated positively, and signal transduction correlated negatively with increased salinity. Hence, salinity had an overall strong effect (in comparison with temperature) on the community, as well as on taxonomic diversity. Cellular functions and metabolic pathways of diatoms and dinoflagellates were associated with the same genetically based structure, but genes were differentially expressed. Differentially evolved Figure 6. Summary of the main influences of temperature and salinity on the microeukaryotic community, and main functional pathways correlated with diatom or dinoflagellate abundance.
Temperature and salinity had the most influence on the functional biodiversity of microeukaryotic communities in the arctic and subarctic/temperate waters. However, the clear partitioning of taxonomic diversity between the four regions (especially Norwegian and Icelandic samples), is not consistent with their functional similarity, which seems largely to be influenced by water chemistry and not by taxonomic identity. Thus, despite the overall geographic similarity of functional responses, the identified abiotic (i.e., temperature and salinity) and biotic (i.e., taxonomic group, namely diatoms and dinoflagellates) factors influenced functional responses. Hence, cellular processes associated with certain environmental settings originate less from the functional diversity stored in a community's transcriptome than how this functional diversity is expressed.
Functional processes were significantly influenced by temperature and salinity. Temperature is a primary driver of metabolic activities and growth, while salinity influences an organisms' capacity for ion homeostasis. The general increase in metabolic activity with temperature and an associated higher demand for ATP was associated with metabolic processes related to C-, N-and P-allocation shifts along the temperature gradient, largely independent of the underlying diverse taxonomic composition (i.e., positive correlations with carbohydrate metabolism, especially the citrate cycle, glycolysis/gluconeogenesis and pyruvate metabolism). Further, the tendency for higher growth was evidenced by a positive correlation between temperature and the category cellular processes, especially in the group of cell growth and death. Along the salinity gradient, differences in the expression of most metabolic functions were identified: secondary metabolism correlated positively, and signal transduction correlated negatively with increased salinity. Hence, salinity had an overall strong effect (in comparison with temperature) on the community, as well as on taxonomic diversity. Cellular functions and metabolic pathways of diatoms and dinoflagellates were associated with the same genetically based structure, but genes were differentially expressed. Differentially evolved regulations of cellular and physiological functions may promote different life-history strategies. Similar environmental signals possibly lead to taxa-specific differential regulations of the same pathways, hence creating different cellular responses and trade-offs. For dinoflagellates, the importance of nitrogen metabolism, vesicular transport, and signal transduction highlights the role of biotic interactions in the evolution of dinoflagellates. Further, this study shows that the expression of various N-acquisition genes in dinoflagellates is correlated with their abundance; hence, the exploitation of various N-sources likely confers a competitive advantage on this group. Diatoms were especially associated with metabolites, like isoprenoids and carotenoids, which highlights their critical autotrophic role. Diatoms were also linked with lysine degradation, indicating the physiological importance of this metabolic pathway for diatoms in their natural environment.
Both, the influence of the environment on the microeukaryotic community, as well as functional processes influenced by the diversity, impact the overall marine diversity and higher trophic levels; however, the amount of impact has to be examined. The information gathered here can support ecological questions concerning the marine ecosystem state and metabolic interactions in the marine environment.
Supplementary Materials: The following are available online at http://www.mdpi.com/2076-2607/8/4/567/s1, Figure S1: Venn diagram of ASVs among the microeukaryotes in the four coastal sampling regions, Figure S2: Bar chart of the relative rRNA and mRNA diversity of the microeukaryotes, Figure S3: Distribution of taxonomic groups in the four coastal sampling regions, Figure S4: Principle Component analysis (PCA) of Euclidean distances of environmental parameters, Figure S5: Distribution of environmental parameters of the sampling regions, Figure S6: FuncTree2 Plots of all Pfam domains corresponding to KEGG Orthologies (KO) terms for each region, Figure S7: Heat map of correlation between environmental parameters and relative abundance of dinoflagellates and diatoms, Figure S8: FuncTree 2 Plot of correlations with Pfam domains and what the variability is most likely related to, including chlorophyll-a, phosphate, and silicate, Figure S9: The negative correlation between dinoflagellates and diatom Pfam abundancetitle, Table S1: Overview of 454-pyrosequencing reads and number of ASVs, Table S2: Details to supplementary Figure