Introduction

The islands and archipelagos in the Eastern Indo-Pacific host a multitude of diverse terrestrial and marine environments, often still largely undisturbed by humans. Their faunal and floral assemblages are highly diverse, originating both from Australasia and the Americas, with ocean currents, geological history and recent human influence shaping the unique faunal composition of the different islands (Gillespie et al. 2008; Neall and Trewick 2008; Aswani and Allen 2009; Hall et al. 2013).

The 118 islands and atolls of French Polynesia are among the most isolated regions on Earth, and with only about half of them being inhabited, large parts of their natural environments remain pristine (Lecchini et al. 2021). The 2000 km region spanning French Overseas Collectivity is divided into the five main archipelagos of Society (Windward & Leeward Islands), Tuamotu, Marquesas, Austral, and Gambier. The region hosts a wide variety of endemic terrestrial and marine lifeforms (e.g., Delrieu-Trottin et al. 2015; Petek et al. 2017; Ramage 2017) (Fig. 1). As defined by the Marine Ecoregions of the World (MEOW, Spalding et al. 2007), a hierarchical classification of marine regions into realms, provinces and ecoregions, these island systems form the two Marine Provinces (“MP”s) Southeast Polynesia and the MP Marquesas within the Eastern Indo-Pacific realm.

Fig. 1
figure 1

Modified from UN OCHA/ReliefWeb

Map of French Polynesia (Eastern Indo-Pacific realm), covering MPs Southeast Polynesia and Marquesas (highlighted in red), displaying major island groups and other notable islands. Border colors reflect positions of map and inset on miniature globe on the top right. Inset: Detail of Wallis and Futuna islands with the adjacent Samoa archipelago in the MP Central Polynesia (highlighted in blue).

The islands of Wallis and Futuna constitute a further French Overseas Collectivity in the Eastern Indo-Pacific realm. They are located west of Samoa (Fig. 1), as part of the MP Central Polynesia. Wallis and Futuna adjoin the border between the Central Indo-Pacific realm and the Eastern Indo-Pacific realm, making them a junction for faunal exchange in the region and emphasizing their significance for biodiversity studies.

While the biodiversity of many reef ecosystems in the Eastern Indo-Pacific realm has been surveyed in several independent studies, most of the research has focused on molluscs (Legendre and Salvat 2015), corals (Pratchett et al. 2013), and fishes (Galzin et al. 1994). Despite other formerly underrepresented organismal groups with important roles in the maintenance of reef ecology, like sponges (Phylum Porifera), having been explored more thoroughly in the past decade, they still remain understudied in the Eastern Indo-Pacific realm in comparison (e.g., Van Soest et al. 2012; Hall et al. 2013; Petek et al. 2017). The ability of sponges to cycle abundant dissolved organic matter into particulate organic matter, which then becomes available to higher trophic levels, is a key component of the nutrient cycle in oligotrophic coral reef ecosystems (de Goeij et al. 2013; Pawlik and McMurray 2020). Likewise, their capacity to substantially alter different types of marine substrates contributes to the shaping of reefs, while also acting as important providers of micro- and macrohabitats (Bell 2008; Maldonado et al. 2015; Folkers and Rombouts 2020). Sophisticated biochemical defense mechanisms present in many sponges constitute an important source of marine bioactive compounds with a large potential for novel pharmaceutical applications, making bioprospecting for these compounds another major source of data on global sponge distribution and biodiversity (e.g., Amade et al. 1982; El-Demerdash et al. 2019; Galitz et al. 2021). One of the most challenging aspects of chemo- and biodiversity studies on sponges is frequently uncertain or unreliable identification of the sponges due to phenotypic plasticity and convergent characters (see Erpenbeck et al. 2006; Andreakis et al. 2012; Galitz et al. 2021). In most cases molecular data are indispensable for the correct identification and delineation of ambiguous sponge taxa, although major taxonomic challenges still apply in certain orders, e.g., Haplosclerida (López-Legentil et al. 2010; DeBiasse and Hellberg 2015; Vicente et al. 2019).

Hall et al. (2013) analyzed the biodiversity of sponges of the Society and Marquesas Islands based on 75 morphospecies, but data on the other island groups of French Polynesia remain comparatively sparse, with the Tuamotu archipelago, Austral Islands and Gambier Islands in particular being understudied (see also: Moorea, Society Islands: Freeman and Easson 2016; Calcarea only: Klautau et al. 2020; Society Islands, Marquesas (submarine caves): Schuster et al. 2021). To date, only 16 valid sponge species have been confirmed for MP Marquesas, with an additional 39 species for the remainder of the numerous French Polynesian islands in MP Southeast Polynesia, and another 14 species known from MP Central Polynesia (see de Voogd et al. 2023). These numbers appear to be exceptionally low, when compared to the recorded sponge species biodiversity in some of the marine provinces of the Central Indo-Pacific realm (Tropical Southwestern Pacific: 412 species; Sahul Shelf: 321 species; Northeast Australian Shelf: 321 species). This vast and remote geographic area of French Polynesia with its multitude of different reef ecosystems might be prone to biases, such as incomplete sampling or the omission of cryptic (i.e., reef matrix) habitats. Consequently, additional data are needed to obtain a more comprehensive understanding of the sponge fauna in the Eastern Indo-Pacific realm and of the extent of sponge biodiversity in this region (Van Soest et al. 2012).

In this study, we report on the sponge fauna of Tuamotu and other major French Polynesian archipelagos, as collected during different campaigns in the area between 2009 and 2013, and for the first time, with an emphasis on molecular taxonomy. We aim to investigate the molecular biodiversity of demosponges (Class Demospongiae) in the Eastern Indo-Pacific realm, understand the connectivity of its marine provinces to those of the adjacent Central Indo-Pacific realm, and discuss the results in relation to previous findings from these regions. This study aims to fill gaps in the knowledge of biodiversity and species distribution of Indo-Pacific sponges, and to provide information necessary for the adoption of appropriate measures to increase effectivity of species conservation and bioprospecting (Glover et al. 2018).

Materials and methods

The sponge material processed in this study was collected in the course of multiple scientific expeditions to the French Polynesian archipelagos, and had been morphologically described and classified into Queensland Museum morpho-OTUs (QM####) (here also referred to as “morpho-IDs”) in Hall et al. (2013) and Petek et al. (2017). The present study aimed to generate independent corresponding molecular taxonomic units (referred to as MOTUs in the following text) from the available sponge material, for the use in biodiversity and phylogenetic analyses, which were not necessarily restricted to the previously assigned morpho-IDs. Specimens with conflicting results between morpho-ID and subsequent MOTU underwent additional morphological re-identification, where possible.

Sponge material

In total, 371 samples (Supplementary Table S1) were collected by SCUBA in the course of different collecting campaigns in the Eastern Indo-Pacific. The campaigns aimed to assess the bio- and chemodiversity of marine invertebrates around the Polynesian archipelagos and islands. The applied sampling strategy focussed on the collection of macroscopic sponges on the reef surface, consequently cryptic spaces, such as overhangs, caves or internal reef matrix, were not explicitly sampled in the course of these expeditions.

Surveys of the Marquesas Islands were conducted in 2009 (BSM-PF1 expedition Leg 2, Debitus et al. 2009) and 2011 (Pakaihi I Te Moana cruise Leg 2), while during the Tuam 2011 expedition, Tuamotu and a part of the Society archipelago were sampled (Debitus et al. 2011). Note that for samples and data from Marquesas Islands primarily sponge material with relevancy for biochemical studies was to our disposal, consequently deviating from the sampling scheme of other expeditions, and resulting in a specimen bias of a numerical overrepresentation of a small number of species (see Supplementary Table S1). Further samples from the Society Archipelago were obtained during the Tahiti Iti expedition (Debitus et al. 2013b), during surveys for Hall et al. (2013), and Leg 1 of the BSM-PF1 expedition (Debitus et al. 2009). Additional specimens from the Austral and Gambier islands were collected during the Coralspot and Tuhaa Pae (Debitus et al. 2013a) surveys respectively. Most of the sponge specimens collected in the course of these expeditions and used in this study are also depicted in Petek et al. (2017), with relevant metadata, classification into (morphological) OTUs, and, for some, morphological descriptions. Comparative sequences of sponge specimens from MP Central Polynesia (Wallis and Futuna), as collected in 2016 during the Tara Pacific expedition (Futuna, see Planes et al. 2019) and the WALLIS 2018 expedition (Petek et al. 2018), were added to this study. The aim of these scientific expeditions was to investigate the chemo- and biodiversity of those islands, which will be covered in greater detail in a following study.

DNA extraction, amplification, and sequencing

For DNA extraction, the protocol for PALL plate extraction was used, which allows for fast processing of large amounts of samples (Vargas et al. 2012). Widely used barcoding primers for fragments of the C-region of the nuclear ribosomal large subunit (28S) (Chombard et al. 1998) and the cytochrome oxidase subunit 1 (CO1) standard barcoding fragment (Meyer and Paulay 2005) were used in the PCR amplification process of the samples (Table 1). The combination of these two markers (ribosomal and mitochondrial) has been successfully employed in previous biodiversity studies on the sponge communities of other Indo-Pacific sponge faunas (Erpenbeck et al. 2016, 2020). In this study we primarily focused on the use of 28S sequences for MOTU generation, biodiversity analyses, and supplemental phylogenetic tree reconstruction, due to their higher phylogenetic resolution in most sponge taxa. The comparatively more conservative mitochondrial CO1 marker was employed to confirm the generated 28S sequences and identify potential ambiguities.

Table 1 List of primers used in this study

PCR reactions were performed in volumes of 25 µL, comprising 5 µL 5× green GoTaq ® PCR Buffer (Promega Corp, Madison, WI), 4 µL 25 mM MgCl2 (Promega Corp, Madison, WI), 2 µL 10 mM dNTPs, 1 µL of primer (5 µM), 2 µL BSA (100 µg/ml), 7.8 µL water, 0.2 µL GoTaq® DNA polymerase (5u/μL, Promega Corp, Madison, WI), and 2 µL DNA template per reaction.

Amplification followed established temperature profiles of initial denaturation (3 min/95 °C), followed by 35 cycles of 95 °C/30 s (denaturation), followed by 51 °C (28S) or 40 °C (CO1) for 30 s (annealing) and 72 °C/1 min (extension), concluded by a final extension step at 72 °C for 5 min (Erpenbeck et al. 2016). The examination of PCR products was conducted on a 1% TAE agarose gel stained with peqGREEN (peqlab) fluorescence dye. The processed products were subsequently sequenced with a BigDye® Terminator v3.1 (Applied Biosystems®) following the manufacturer’s protocol at the Sequencing Service of the Department Biology, LMU—Genomics Service Unit (Martinsried, Munich, Germany) on an ABI 3730 capillary sequencing machine. Raw sequences were basecalled and trimmed in CodonCode Aligner v9.0.4 (www.codoncode.com), and assembled in Geneious Prime 2019 (v2019.2.3) (www.geneious.com). Individual sequences were checked for intragenomic polymorphisms, which were subsequently corrected to corresponding IUPAC ambiguity codes and disregarded in the subsequent biodiversity analyses. To check for contamination and to pre-classify unidentified samples, sequences were compared against the databases of the European Nucleotide Archive (ENA) using BLASTn (Altschul et al. 1990), and against NCBI Genbank. Verified Porifera sequences were further processed and corrected in Geneious. Final sequences are deposited in the European Nucleotide Archive (ENA) database under the accession number range OX451516-OX451734 and the Sponge Barcoding Project (www.spongebarcoding.org) (Wörheide and Erpenbeck 2007) (SBD#2345-2565).

Phylogenetic analyses and taxonomic evaluation

Sequences were aligned using the MAFFT v7.450 (Katoh and Standley 2013) plugin for Geneious Prime® (v2019.2.5) and applying the FFT-NS-i x2 alignment algorithm. Comparative sponge sequences from other marine provinces were added from the Sponge Barcoding Project. These included MP Central Polynesia (Galitz et al. unpublished), and three further MPs of the adjacent Central-Indo Pacific realm (MPs Tropical Southwestern Pacific, Sahul Shelf, and Northeast Australian Shelf). Maximum-likelihood reconstructions were generated with RAxML 8 (Stamatakis 2014) as implemented in Geneious Prime® 2019.2.5 under the GTR GAMMA I model as suggested by Modeltest 2.1.10 (Darriba et al. 2012) and 1000 rapid bootstrap replicates.

Molecular Biodiversity analyses

For the biodiversity analysis, 28S sequences, which comprised the largest set of Central Indo-Pacific sequences for comparison, were divided into their respective MPs sensu Spalding et al. (2007) using QGIS 3.10. (QGIS Development Team 2019, https://qgis.org/). While CO1 sequences were generated in addition to 28S, the smaller size of the available mitochondrial datasets from the studied MPs, and lower resolution compared to the nuclear marker, made the CO1 sequences less suitable for biodiversity analyses. Consequently, in the following “MOTU” will always refer to 28S sequences as their origin. Sequences longer than 350 bp were aligned with ClustalW (Thompson et al. 1994) as implemented in the msa package for R (Bodenhofer et al. 2015). DECIPHER 2.0 (Wright 2016) was used to cluster sequences into respective MOTUs. MOTUs will be used as the molecular equivalent to the previously assigned morpho-IDs, and in this context as baseline for molecular biodiversity assessments. Note that MOTUs are inherently independent from their original morpho-IDs, with conflicting results, however, being further investigated. To delineate MOTUs, we used the unweighted pair group method with arithmetic mean (UPGMA) algorithm (see Cowman et al. 2017; Hadiyanto et al. 2021), which has been shown to have the best performance in hierarchical cluster analyses (Kreft and Jetz 2010). The threshold for 28S MOTU differentiation was set to 0.3% base pairs difference over a minimal sequence length of 350 bp in this fragment, equal to a genetic difference of ≤ 1 bp, which follows Erpenbeck et al. (2016, 2020), but accounts for additional, undetected intragenomic polymorphisms. Since false MOTUs may occur due to undetected sequencing errors, despite careful basecalling and sequence quality control, we also repeated the analysis with single-specimen-MOTUs (= singletons) omitted for additional quality control.

Biodiversity analyses were performed in R v4.1.1 using the packages VEGAN (v4.2.4), picante (v1.8.2), and iNEXT (Dixon 2003; Kembel et al. 2010; Chao et al. 2016; R Core Team 2021). Visualization of MOTU overlap or uniqueness between regions was conducted using the package UpSetR (Conway et al. 2017). For biodiversity analyses within and between regions commonly used indices were computed, with additional Chao1 species richness and Pielou’s evenness estimates (Pielou 1966; Chao 1984): Local (alpha) biodiversity is represented by Shannon–Wiener and Fisher’s alpha indices, although differences in sampling strategy, sample size and sampling biases might make Shannon values less suitable for comparing one area’s regional biodiversity to another. Fisher’s alpha should be less sensitive in this regard (Beck and Schwanghart 2010). Beta diversity between the selected marine provinces was calculated by the Jaccard dissimilarity index (Jaccard 1912). Beta diversity estimates can be sensitive to differences in sample size. Among the plethora of beta-diversity indices, the Jaccard index is discussed as comparatively invulnerable to errors of taxonomy and enumeration, and with low error rates in geographic undersampling (see e.g., Williams 1949; Pos et al. 2014; Schroeder and Jenkins 2018; Hadiyanto et al. 2021). The significance of the calculated beta diversity indices was subsequently verified by applying a pairwise PERMANOVA (permutational multivariate analysis of variance) using a custom variation of the adonis function of VEGAN for pairwise comparison of regions, with 999 permutations and adjustments for false discovery rate (FDR) taken into account (Anderson 2001, 2017). In addition to the computation of biodiversity indices, rarefaction analyses were conducted to assess and extrapolate both the species richness and the sampling completeness of the respective provinces. These were computed and visualized with iNEXT (Chao et al. 2016).

Morphological species identification

For supplemental morphological re-identification of conflicting results between prior morphological analyses and molecular genetic data generated in the course of this study, identical analytical procedures to Hall et al. (2013) and Petek et al. (2017) were conducted where necessary:

For morphological analyses and subsequent categorization into morphospecies and morpho-IDs of the collected Queensland Museum specimens, the fresh sponge material was immediately preserved in 70% ethanol. For light microscopy analyses, thin sections through the ectosome and choanosome of the tissue were cut with a scalpel. The sections were cleared in phenol-xylene overnight and embedded in Fluka Durcopan™ (Sigma-Aldrich Co., St. Louis, MO, USA). Additional preparations of the spicules were made for light microscopy by digesting small portions of sponge in nitric acid; the sponge-acid mix was heated over a flame and the remaining spicules mounted in Canada balsam.

Separate preparations of fibers were conducted by the addition of 12.5% sodium hypochlorite to remove soft tissue. The dissolution of tissue was monitored and facilitated by the removal of excess collagen with forceps, with subsequent neutralization of the reaction with distilled water, and finally two-fold rinsing in 70% ethanol and then 100% ethanol. Fibers were examined using an Olympus SZ60 dissection microscope with a Tucsen 3.0 camera. The fixed section and spicule microscope slides were examined using an Olympus BH2 with an optical stage micrometer and photographed with a Nikon CoolPix 5400 mounted camera. Preserved specimens were photographed with a Canon G5X. Surface characteristics were examined using an Olympus SZ60 dissection microscope with a Tucsen 3.0 camera.

Spicule preparations for Scanning Electron Microscope (SEM) were made by dissolving the tissue in 12.5% sodium hypochlorite for the removal of soft tissue, followed by neutralization in distilled water, rinsing twice in 70% ethanol, and finally rinsing twice in 98% ethanol and air-drying the preparations. SEM preparations were sputter-coated in gold to improve resolution. The scanning electron micrograph photos and measurements were made using a Hitachi TM-1000 SEM.

Results

Sequencing and species identification

From a total of 371 available sponge specimens from the Eastern Indo-Pacific MP Marquesas and MP Southeast Polynesia, 242 28S (65.2%) and 196 CO1 (52.8%) sequences were successfully obtained, with 137 specimens being covered by both fragments (36.9%). Maximum length of complete sequences was 536 bp for 28S (shortest fragmented: 264 bp) and 682 bp for CO1 (shortest fragmented: 288 bp). Of the initial 371 sponge samples, 327 specimens (88.1%) were morphologically identified to species, or at least to genus level, corresponding to 130 different morpho-IDs and 104 MOTUs respectively.

Rarefaction analyses of the MP Southeast Polynesia samples indicated high species diversity and high potential for yet undiscovered species (Fig. 2). The MP Marquesas was not included in the final rarefaction analysis, as part of the MP Marquesas specimen data were primarily forwarded for biochemical studies, which deviates from the sampling scheme of the other regions included in this study, which is also reflected in the low evenness value for MP Marquesas (Table 2). Due to this specimen bias of a numerical overrepresentation of a small number of species (i.e., Ptilocaulis sp. (QM1640), Monanchora sp. (QM4696), Suberea sp. (QM2093), Suberea ianthelliformis; see Supplementary Table S1) in the available Marquesas data subsets, they were unsuitable for further biodiversity evaluation (see “Discussion”).

Fig. 2
figure 2

a MP Southeast Polynesia rarefaction and extrapolation of biodiversity based on sample-size; b Sample completeness curve

Table 2 Distribution of specimens and MOTUs per region, local rate of endemism, alpha biodiversity indices for Shannon–Wiener and Fisher’s alpha, and estimates for Chao1 species richness and Pielou’s evenness; based on 28S sequence data

Biodiversity analyses

After the addition of comparative material published in the Sponge Barcoding Project (430 28S sequences) from the adjacent MPs Northeast Australian Shelf, Tropical Southwestern Pacific, Sahul Shelf, and Central Polynesia, the final dataset comprised a total of 966 sequences, of which 894 sequences exceeded the predefined threshold of 350 bp and were included in the following analysis steps. The total for unique MOTUs across the entire dataset amounted to 388, of which singletons accounted for 61.3% (238) (Table 2, see Table S1 in supplementary material for a complete list of MOTUs). The number of MOTUs per province strongly differed between 13 MOTUs from MP Marquesas to up to 104 MOTUs from MP Central Polynesia. The ratio of MOTUs unshared with other provinces (= endemic in the dataset) ranged between 63.9% (MP Tropical Southwestern Pacific) and 86.5% (MP Sahul Shelf) (Table 2). 179 MOTUs from the dataset are exclusive to the Eastern Indo-Pacific realm (MP Central Polynesia = 84, MP Southeast Polynesia = 76, MP Marquesas = 9). They comprised 46.1% of the total 388 unique MOTUs in this study.

Within the Eastern Indo-Pacific realm, the largest MOTU overlap between marine provinces was between MPs Southeast Polynesia and Central Polynesia (7 MOTUs), followed by MPs Southeast Polynesia and Marquesas (2 MOTUs), and no shared MOTUs between MPs Central Polynesia and Marquesas. Only one MOTU was shared between all three provinces (Figs. 3, 4). The remaining 169 MOTUs were not shared with any other province of the Eastern Indo-Pacific realm. The MOTUs restricted to the Central Indo-Pacific realm amounted to 193 (49.7%) (Fig. 4). In total, only 16 (4.1%) MOTUs were shared between the Central Indo-Pacific realm (= MPs Northeast Australian Shelf + Tropical Southwestern Pacific + Sahul Shelf) and the Eastern Indo-Pacific realm (= MPs Central Polynesia + Southeast Polynesia + Marquesas).

Fig. 3
figure 3

MOTU count (incl. singletons) per marine province and overlap between provinces. Histogram on the left: Number of MOTUs per MP. To its right: Black dots without connecting lines represent the number of unique MOTUs in a MP, while dots connected by lines indicate the number of MOTUs present in two or more MPs. Respective numbers of MOTUs for each scenario depicted with histogram on the top. Abbreviated marine province names (MPs) adopted from Spalding et al. (2007): Marquesas (MAR), Southeast Polynesia (SEP), Central Polynesia (CEP), Tropical Southwestern Pacific (TSP), Sahul Shelf (SAH), Northeast Australian Shelf (NEA)

Fig. 4
figure 4

a Number of shared and unshared MOTUs as unique to the MPs Central Polynesia, Marquesas and Southeast Polynesia within the Eastern Indo-Pacific realm. Graphical overlap between Marquesas and Central Polynesia removed due to the absence of shared MOTUs; b Left: MOTU overlap between the Central Indo-Pacific realm and Eastern Indo-Pacific MPs Central Polynesia, Southeast Polynesia + Marquesas shown separately. Right: MOTU overlap between Central and Eastern Indo-Pacific realms only. Circle sizes relative to absolute MOTU numbers. Created with BioVenn (Hulsen et al. 2008)

The Eastern Indo-Pacific realm MP with the highest Central Indo-Pacific realm MOTU overlap was MP Central Polynesia (Agelasida: 3; Dictyoceratida: 2; Scopalinida: 1; Tetractinellida: 1; Verongiida: 1; Biemnida: 1), compared to MP Southeast Polynesia (Agelasida: 1; Biemnida: 1; Haplosclerida: 1) and MP Marquesas (Verongiida: 1) (Figs. 3, 4). These respective MOTUs were not shared with another MP of the Eastern Indo-Pacific realm, and consequently constitute independent faunal links to the Central Indo-Pacific realm, with the genera Agelas (4 MOTUs) and Suberea (2 MOTUs) being the only ones representing their order with more than one species.

The alpha biodiversity analysis results of the studied MPs displayed comparatively high values for the Shannon–Wiener index, largely coinciding with the respective estimated Chao1 species richness, with the lowest in the MP with the smallest total MOTU numbers (MP Tropical Southwestern Pacific, Table 2). Fisher’s alpha analysis results indicate generally high biodiversity, but show strong variations between provinces, with no clear relationship to location, richness estimates, or numbers of specimens or MOTUs. Despite the varying results of the alpha diversity analyses, evenness across the studied MPs appears high, displaying values > 0.9 for all datasets. Indices for MP Marquesas are excluded in this assessment due to the above-mentioned specimen bias (See “Materials and methods”).

The beta diversity analyses recovered Jaccard dissimilarity indices between the studied provinces mostly ranging between 0.95 and 1.0, coinciding with the low number of MOTU overlaps between the MPs of the study area and high endemism. All p values (FDR adjusted) of Jaccard indices were recovered as being highly significant (Fig. 5). In a few instances, dissimilarity values were below 0.95: between MPs Central Polynesia and MP Southeast Polynesia in the Eastern Indo-Pacific realm (11 MOTUs), and also between MPs Northeast Australian Shelf and Tropical Southwestern Pacific in the Central Indo-Pacific realm (10 MOTUs) (Fig. 3), reflecting a relatively higher overlap compared to the other regions (see also Figs. 5, 6). The majority of overlapping OTUs belonged to sponges of the orders Verongiida, Dictyoceratida and Haplosclerida in both cases, however no genera stood out as prominently represented for their respective orders. Higher values of similarity between MPs are in line with the biogeographical classification on the realm level, with highest congruence between the faunas in the MPs of the Eastern Indo-Pacific realm and of the Central Indo-Pacific realm respectively (Fig. 5). Similarity values on the province level may not (always) reflect the affiliation of an MP with their respective realm, as geographical distance (generally) correlated well with lower Jaccard indices.

Fig. 5
figure 5

Values on bottom right of the pairwise comparison diagram indicate 28S gene sequence Jaccard dissimilarity (in percent) of the studied marine provinces; higher values correspond to lower faunistic similarity between MPs, with 100 representing no overlap of MOTUs; graduated in 1.5% increments. Values on top right of the pairwise comparison diagram indicate false discovery rate (FDR) adjusted p values of respective 28S Jaccard indices generated through pairwise PERMANOVA. Values for MP Marquesas under reserve due to potential distortion by specimen bias. The dendrogram on the left depicts faunal similarity between two given regions based on their respective Jaccard indices; numbers on branches are bootstrap replicates. Background color corresponds to Central Indo-Pacific marine realms (yellow) and Eastern Indo-Pacific marine realms (green). Abbreviated MP names: Marquesas (MAR), Southeast Polynesia (SEP), Central Polynesia (CEP), Tropical Southwestern Pacific (TSP), Sahul Shelf (SAH), Northeast Australian Shelf (NEA)

Fig. 6
figure 6

Geographical plot of Jaccard dissimilarities according to values in Fig. 5. Coloured lines display the degree of dissimilarity between the MPs as denoted in the legend; higher Jaccard-indices correspond to lower faunistic similarity between MPs, with 100 representing no overlap of MOTUs; graduated in 1.5% increments. Marine realms of the Central Indo-Pacific (yellow) and Eastern Indo-Pacific (green) are highlighted. Abbreviated MP names: Marquesas (MAR), Southeast Polynesia (SEP), Central Polynesia (CEP), Tropical Southwestern Pacific (TSP), Sahul Shelf (SAH), Northeast Australian Shelf (NEA). Dashed lines to MP Marquesas (MAR) indicating a data/specimen bias, and consequently a potential ambiguity of the respective results (see “Discussion” for details)

Discussion

Our results reveal high biodiversity and high levels of endemism among the previously largely unknown sponge faunas of the investigated Eastern Indo-Pacific MPs, but also highlight the substantial differences between marine provinces regarding species richness and composition. Most of these differences appear to be independent of geographical distance, but rather are influenced by factors like local geography, sea surface currents, and animal biology which shape the regional biodiversity (for similar results of other marine taxa see also Briggs and Bowen 2012, 2013; Kulbicki et al. 2013; Veron et al. 2015; Crandall et al. 2019).

Molecular taxonomic diversity within MPs Southeast Polynesia and Marquesas

MP Southeast Polynesia: Our MP Southeast Polynesia data (92 MOTUs) exceeds the study of Hall et al. (2013, 41 (morpho-)species), by inclusion of Austral, Gambier, and Tuamotu samples. While Hall et al. (2013) exclusively discussed samples from the Society archipelago, our MP Southeast Polynesia data comprises a wider geographic range, in which now 53% originated from French Polynesian archipelagos other than Society Islands (77 of 146).

Our data display similarities in taxonomic composition of the MP Southeast Polynesia sponge fauna to those of Hall et al. for Society Islands. Dictyoceratida (36 MOTUs) constitute the dominant sponge order, followed by Haplosclerida (18) and Verongiida (10). Such dominance of Dictyoceratida corroborates taxonomic studies across the Indo-Pacific, including the Western Indian Ocean (e.g., Erpenbeck et al. 2016, 2020), and can therefore be regarded as a general pattern for shallow water tropical reefs. It is interpreted as the consequence of a complex interaction of biotic and abiotic factors favoring the occasionally photosymbiotic Dictyoceratida, such as types of substrate, exposure to light and ocean currents, as well as individual reproductive and dispersal potential over short and long distances (see for details Wilkinson 1988; Duckworth et al. 2008; Wulff 2012).

MP Marquesas: In contrast to the number of morphospecies (38; incl. 15 verified to species level) from MP Marquesas reported in Hall et al. (2013), the data of our molecular study recovers a lower number of MOTUs (13 MOTUs). The taxonomic composition of our MP Marquesas MOTUs differs from Hall et al. (2013) in that Poecilosclerida, Verongiida and Haplosclerida (4 each) account for the majority of MOTUs. This large disparity is mainly caused by an apparent specimen bias of the available data subsets from the respective Marquesas expeditions, with a numerical overrepresentation of a small number of species (e.g., BSM-PF1) (see “Materials and methods”). The bias is reflected in the skewed ratio of molecular MOTUs among the three Eastern Indo-Pacific realm MPs Marquesas [~ 6.7% of the total 195 MOTUs (including MOTUs shared with all MPs)], Southeast Polynesia (~ 47.2%), and Central Polynesia (~ 53.3%), despite comprising 15.6% of the total specimen numbers from this realm.

Our molecular phylogenetic analyses initially also recovered several morphospecies (sensu Petek et al. 2017) as potential cryptic species based on non-monophyletic occurrences in the tree. After careful re-examination of the original material, it was possible to assign new morpho-IDs to the majority of these assumed cryptic species based on minute differences of morphological characters. The ambiguous species complexes in question comprise a divergent clade of Monanchora sp. (QM4696) in Poecilosclerida, now regarded as Poecilosclerida indet. (SNSB-BSPG.GW9929, GW9930, GW9933), due to morphological ambiguities. Likewise, a seemingly cryptic complex of Spongia sp. (QM1983) in Dictyoceratida could be resolved into Spongia sp. (QM4490) and Hyattella sinuosa. Lastly, a single verongiid Suberea sp. (QM2093) exhibited molecular identity to Suberea sp. (QM2121), grouping apart from the majority of other Suberea sp. (QM2093) specimens. Initial morphological re-identification yet did not recover morphological differences but will be continued (See Supplementary Table S1, Fig. S2). In this respect, molecular data repeatedly had revealed cryptic speciation in sponges of the Indo-Pacific (e.g., Wörheide et al. 2002, 2008; Andreakis et al. 2012) and particularly morpho-OTU based taxonomic assessments (i.e., prior to the full description of a species, e.g., Hall et al. 2013) can be prone to overlooked cryptic species (Pöppe et al. 2010).

An important, but frequently ignored component of biodiversity of benthic organisms in tropical reefs is the cryptofauna living within the reef matrix, which sponges are major occupants of (Richter et al. 2001). Novel metabarcoding approaches on its cryptobenthic community in Hawaiʻi complemented traditional collection and barcoding efforts and significantly increased the taxonomic knowledge on Eastern-Indo Pacific cryptofauna and its differences to the benthic epifauna (Vicente et al. 2021; Timmers et al. 2022). These studies on cryptofaunal diversity highlighted once more the vast taxonomic potential that is hidden within the reef, which traditional sampling efforts, as applied in our study, cannot assess.

Evaluation of differences in local (alpha) biodiversity

The studied MPs of the Central and Eastern Indo-Pacific realms display high local biodiversity, as implied by their alpha diversity indices and richness estimates, with values comparable to other studies investigating the diversity of Indo-Pacific sponge faunas (e.g., Powell et al. 2014; Fromont et al. 2016; Rovellini et al. 2019). Shannon–Wiener indices for most MPs exceed values of 4, with MP Tropical Southwestern Pacific displaying the lowest value of ~ 3.5, and MP Marquesas exhibiting a specimen bias (Table 2). The notable drop for MP Tropical Southwestern Pacific coincides with the dataset for this province being the smallest, which can pose a bias for the Shannon–Wiener index calculation, despite the relative evenness calculated for all unbiased MPs. Alpha biodiversity values of the Fisher’s index display larger variations between MPs, with values ranging from ~ 57.8 (MP Central Polynesia) up to 113.4 (MP Sahul Shelf), and MP Marquesas again being a distinct outlier. The Fisher’s alpha values locally deviate from the predicted Shannon–Wiener and Chao1 indices, suggesting a comparatively lower biodiversity for MPs Central Polynesia and Northeast Australian Shelf instead (Table 2). The large provincial variations, however, also imply the possibility of index bias due to differences in the respective dataset structures, as the different collections were not initially conducted with detailed local or large scale biodiversity estimation in mind.

Indices measuring the alpha biodiversity can be beneficial to determine an estimation of the local species richness of a given region, but they can also have some major drawbacks and variance depending on the index used and the type and quality of data provided for the respective locality. Shannon–Wiener indices in particular have to be interpreted carefully, as coupling of species richness and evenness is discussed as an increasingly inadequate measure of actual biodiversity (Strong 2016; MacDonald et al. 2017). Notable outliers in our results of Shannon–Wiener and Fisher’s alpha indices as assessed for MP Marquesas (Table 2) are due to the high number of specimens with concurrently low taxonomic diversity, with a single genus (Suberea) making up almost half of the total specimens. Considering the respective biodiversity indices of MP Southeast Polynesia and other MPs (Table 2), these values are unlikely to represent accurate biodiversity, but can be attributed to the large proportion of mono-specific specimens present in the available data subset of the BSM-PF1 expedition, while sponges of the AMP-Marquises expedition only represent 10% of the total specimen count from Marquesas.

Conclusively, the large variations in specimen and MOTU counts between the regions make an objective comparison of alpha biodiversity difficult. Despite high evenness for the majority of datasets, the variances of both Shannon–Wiener and Fisher’s alpha biodiversity indices may signify a lack of sampling completeness, which is supported by rarefaction projections and richness estimations of the respective MPs (Table 2).

Sponge beta diversity in Central- and French Polynesia

The Jaccard dissimilarity values indicate low MOTU overlap between the marine provinces studied, with most provinces displaying dissimilarity values > 95 and consequently emphasizing high provincial endemism (Fig. 5). Similar levels of endemism in sponge faunas have been found during molecular biodiversity assessments of the Red Sea and Persian Gulf sponge fauna (Erpenbeck et al. 2016, 2020); these studies corroborated restricted species distribution in sponges as earlier detected in taxon-level studies (Wörheide et al. 2008; Pöppe et al. 2010; Xavier et al. 2010; Reveillaud et al. 2011; Setiawan et al. 2015; Erpenbeck et al. 2017).

Despite the Jaccard index being less prone to certain types of common biases (see “Materials and methods”), it is not invulnerable to biased sampling or datasets, as seen in the results for MP Marquesas. With MP Marquesas disregarded, we find Jaccard dissimilarity values among MPs of the same marine realm mostly lower than inter-realm (e.g., MPs Central Polynesia to Southeast Polynesia; MP Northeast Australian Shelf to Tropical Southwestern Pacific or Sahul Shelf, Figs. 5, 6), which supports the biogeographic classification of Spalding et al. (2007). Our findings also support the morphospecies multivariate clustering of Hall et al. (2013), that found the sponge faunas of the Eastern Indo-Pacific MPs Southeast Polynesia and Marquesas closer to each other compared to Central Indo-Pacific provinces.

We recover a comparatively strong molecular overlap for Eastern Indo-Pacific's MP Central Polynesia with Central Indo-Pacific realm provinces with shared MOTUs unshared with any other MP of the Eastern Indo-Pacific realm (See Fig. 4). Orders Agelasida (Agelas spp.) and Dictyoceratida constitute the majority of shared MOTUs. The apparent abundance of Agelasida MOTUs in both MP Central Polynesia and the Central Indo-Pacific realm is notable, given that the genus Agelas is yet unreported for MP Southeast Polynesia (Petek et al. 2017).

Consequently, our results indicate that the sponge fauna of MP Central Polynesia, situated as a geographic midpoint between the realms of Central and Eastern Indo-Pacific, is comparatively strongly influenced by the sponge faunas of adjacent MPs of both realms, like Southeast Polynesia and Tropical Southwestern Pacific, turning it into a small-scale “melting pot” of regional sponge species biodiversity.

Influence of ocean surface currents and geography on sponge dispersal and distribution

Geographical proximity between marine provinces generally correlates well with faunal similarity according to our data. However, our results also show that the similarity of sponge faunal assemblages between different biogeographical regions in the Central and Eastern Indo-Pacific realms is dependent on the scale of the spatial biogeographical units examined. Computed relatedness of MPs generally corresponds well to their respective marine realm (Fig. 5), however in some cases similarities between singular MPs may be higher across realms than within them, e.g. for MPs Tropical Southwestern Pacific, Northeast Australian Shelf and Central Polynesia, when compared to their connectivity to Sahul Shelf and, under reserve, lower faunal similarity between Marquesas and Central Polynesia, when compared to Tropical Southwestern Pacific (Figs. 5, 6).

Understanding biodiversity and connectivity of biota between isolated regions like the French Polynesian islands, MP Central Polynesia, and the ecoregions of the Central Indo-Pacific realm, thus requires the consideration of multiple factors besides geographical distance (Fig. 6): These factors include (a) ocean surface currents as a primary factor influencing the dispersal range, directions, speed and seasonal fluctuations of animal larvae, and (b) larval movement speed, motility, and pelagic larval duration (Maldonado 2006).

Ocean surface currents

The principal current system regulating the regional ocean movements in the Southern Pacific is the South Pacific Gyre, with the comparatively slow, westward flowing South Equatorial Current to its North (10° S) and the extensions of the South Pacific Current in the South (~ 25° S). This gyre also includes the East Australian Current in the West, which is one of the main drivers of dispersal of tropical marine fauna in the Central Indo-Pacific realm (Fig. 7) (e.g., James and Scandol 1992; Booth et al. 2007; Condie et al. 2011).

Fig. 7
figure 7

Marine provinces of the Central Indo-Pacific marine realm (yellow background) and Eastern Indo-Pacific marine realm (green background) after Spalding et al. (2007) in the southwestern Pacific. The shape of the marine realms has been simplified. The major sea surface currents are indicated by arrows, redrawn after satellite data which was produced by the OSCAR system (NASA/NOAA) (Dohan and Maximenko 2010). Abbreviated points of interest: Fi Fiji, NC New Caledonia, Sa Samoa, So Society Islands, Tu Tuamotu Islands, Wa Wallis and Futuna

The relative isolation of the marine sponge fauna of the Marquesas Islands, as discussed in Hall et al. (2013) and corroborated in this study, can be attributed to the comparatively strong influence of the South Equatorial Current in this region, with a constant WSW flow directed towards the North of the French Polynesian Islands (Fig. 7). This current system largely limits faunal exchange in one direction and only peripherally links to the northern islands of MP Southeast Polynesia, after a travel distance of 500–1500 km. Exchange between MP Marquesas and Hawaii or the Eastern Pacific (Americas) is further extremely limited due the strong Equatorial Countercurrent, both North and South Equatorial Currents, and the Eastern Pacific Barrier, an uninterrupted > 6000 km stretch of open ocean (Romero-Torres et al. 2018; Crandall et al. 2019).

Our data reveal higher faunal similarities between the distant MPs Southeast Polynesia, Central Polynesia (3000 km), and the provinces of the Central Indo-Pacific realm, as examined here, are, compared to the two adjacent French Polynesian provinces MP Southeast Polynesia and MP Marquesas (min. 500 km) (Fig. 6). Still, the total number of shared species is low, largely coinciding with the marine barrier hypotheses as proposed by Vermeij (1987). Geographic distance, and to a minor degree geologic history, appear to have the largest impact on gene flow within and between the continental and uplifted islands of the Central Indo‐Pacific realm and the volcanic islands of the Eastern Indo-Pacific realm (Crandall et al. 2019).

The main promoters of specimen exchange within and between the marine realms of Central and Eastern Indo-Pacific are the South Equatorial Current and the stronger South Pacific Current. The South Equatorial Current in particular is an important stimulus for faunal connectivity and dispersal of sponges between MP Central Polynesia and MP Southeast Polynesia (distance: ~ 3000 km). Facilitated by the comparatively stronger South Pacific Current, a number of sponge species from the Central Indo-Pacific realm may have also bypassed the MP Central Polynesia, and partially MP Tropical Southwestern Pacific, towards MP Southeast Polynesia, in rare cases of long-distance travel and settlement. The South Pacific Current and its numerous small gyres and turbulences to the South of Fiji and Tonga extend up to the islands of Wallis and Futuna and Samoa (MP Central Polynesia). These current systems are likewise in part responsible for the higher faunal overlap of MP Central Polynesia (Eastern Indo-Pacific realm) with the MPs in the Central Indo-Pacific realm, compared to the number of MOTUs shared with the French Polynesian provinces (Eastern Indo-Pacific realm) (Fig. 4). Another possible factor further impacting the biodiversity in MP Central Polynesia is the seasonal variation of the Pacific currents, causing strong eastward directed flows from the Solomon Islands and Papua New Guinea towards Wallis and Futuna during the warm El Niño events of the El Niño-Southern Oscillation (ENSO), which are not present during neutral or cold (La Niña) phases (Steele et al. 2010).

Pelagic larval duration

While ocean currents are the main forces driving larval dispersal, the survival time of animal larvae in the water column, Pelagic Larval Duration (PLD), plays a further pivotal role. A high PLD does not always guarantee a high larval distribution range, but it does dictate the effective range of animal larvae in combination with marine current systems (Alzate et al. 2019). This is particularly important for the lecithotrophic sponge larvae, as they have one of the shortest PLD among other benthic and pelagic reef organisms, ranging between few minutes and up to 20 days (van der Molen et al. 2018). Unlike planktotrophic larvae, sponge larvae are limited in their energy supply by the amount of yolk available, which consequently also limits their potential PLD (Maldonado 2006). The disadvantage of short PLDs is partially offset by the unusually high swimming speeds of sponge larvae, which are significantly faster than those of other marine organisms (Montgomery et al. 2019, 2020; Lanna and Riesgo 2020). Despite the combination of short PLD with high mobility, the high levels of endemism in the Southern Pacific realms, as recovered in our study and in Hall et al. (2013), highlight the poor connectivity among most of the ecoregions in the Central and Eastern Indo-Pacific realms, due to the comparatively long travel times and subsequent high levels of species filtration (Mora et al. 2012).

Anthropogenic influences on sponge dispersal

A further current-independent factor with potential impact on the dispersal capabilities of sponges is anthropogenic influence (e.g., ballast water), which can facilitate the crossing of larger distances or naturally impossible ranges, and thus contribute to the introduction of novel invasive species into foreign ecosystems (Hutchings et al. 2002; Carballo et al. 2013; Seebens et al. 2013). The influence of anthropogenic organism dispersal on sponge species in the Central and Eastern Indo-Pacific is yet insufficiently investigated. However, previous studies from other maritime regions and different taxa found this mode of transportation to also be a plausible factor for artificial dispersal: Ballast water and hull fouling of maritime vessels can aid sessile organisms to cross otherwise insurmountable marine barriers (Godwin 2003; Carballo et al. 2013), with long-range trade of aquacultures and related commercial goods also being a potential vector for the dispersal of invasive species (Henkel and Janussen 2011). Anthropogenic influence on species dispersal can also occur as a consequence of natural events, such as storms or tsunamis, creating floating debris of man-made structures as a hard substrate for the settlement of sessile organisms, and consequently expanding their potential dispersal range (Carlton et al. 2017).

Dispersal capabilities of other Indo-Pacific reef organisms

The dispersal capacities and biodiversity of other shallow water marine taxa in the Indo-Pacific can show a greater degree of variation compared to sponges, expressed in a variety of differing distribution patterns, which not always seem to directly translate from their effective potential dispersal ranges:

The Crown-of-Thorns starfish (Acanthaster planci Linnaeus), despite being widespread with a high dispersal range, tends to adhere to localized populations, with long-range expansions being comparatively uncommon (Timmers et al. 2012; Vogler et al. 2013; Pratchett et al. 2014). Populations in the South Pacific also show patterns of genetic isolation by distance between the Central and Eastern Indo-Pacific realms (Yasuda et al. 2009). Dispersal across the East Pacific Barrier appears to be possible according to molecular analyses, however sufficient data for this mode of migration is not yet available (Vogler et al. 2008; Haszprunar et al. 2017).

Scleractinian corals show similar distribution patterns to sponges in regard to the affinity to specific ecoregions, but their levels of endemism are magnitudes lower in comparison, while geographical ranges of coral species at the same time are vastly higher (Veron et al. 2015). While endemism of coral species in the Central and Eastern Indo-Pacific realms is low, Veron et al. (2015) also emphasize the clear ecological differentiation of both realms, which is further highlighted in other studies (e.g., Oury et al. 2021), but also note the lack of available data for the Eastern Indo-Pacific realm in particular. Coral communities of the Central and Eastern Indo-Pacific realms are generally isolated from the Tropical Eastern Pacific realm by the East Pacific Barrier, but with the possibility of rare long-range dispersal and settlement events (Romero-Torres et al. 2018).

Indo-Pacific reef fish show distributions very similar to those of corals, in that they display equally high or even higher levels of both species diversity and taxon overlap between ecoregions across the Central and Eastern Indo-Pacific realms. Maximum endemism levels of ~ 6% occur in those realms, supporting a hypothesis of high dispersal and colonization capability (Briggs and Bowen 2012, 2013; Kulbicki et al. 2013). Connectivity of shore fish species across the East Pacific Barrier is elevated in both directions in comparison to the low dispersal capabilities of benthic organisms across this barrier (Robertson et al. 2004).

In contrast to the observed distributions for fishes and corals, Meyer et al. (2005) showed, by using the example of marine gastropods, that fine-scale endemism of morphological species complexes on archipelago or even island scale may not be an uncommon occurrence in Indo-Pacific reef taxa, especially among those with limited larval dispersal capabilities. This appears to be the case for sponges in particular, as confirmed by the high rates of endemism and low numbers of shared species between regions found in this study. This finding is in agreement with earlier investigations of the sponge biogeography in the Indo-Pacific (Hooper and Lévi 1994), and is also evidenced in regional barcoding studies (Pöppe et al. 2010).

Conclusion

The sponge fauna of the Eastern Indo-Pacific realm is unique in displaying both high species diversity and high endemism. Species diversity appears to be increasing from French Polynesia westwards, with Marquesas being the most isolated and least diverse ecoregion. Wallis and Futuna, on the border to the Central Indo-Pacific ecoregions, displays among the highest levels of species richness and taxa shared with the adjacent realm, especially when taking in account the size of the regions investigated. A detailed look at the marine provinces also reveals differences in their biodiversity with some sponge orders being dominant in one region, but nearly absent in the others. These differences can be traced back to the local reef and island structure, the geographic distances between the studied regions, as well as the complex and seasonally impacted current systems. These factors have a large influence on the dispersal and settlement abilities of sponge taxa and are therefore responsible for the high rates of endemism and low numbers of shared taxa between marine provinces in this region, compared to other marine organisms. The Central Polynesian border region (e.g., Wallis and Futuna) plays a critical role in this regard, as it has a larger variety of those factors impacting biodiversity compared to the islands of French Polynesia, making it a “melting pot” of sponge species diversity between the two major marine realms of the Central and Eastern Indo-Pacific.