Optimising Sampling Strategies in Coral Reefs Using Large-Area Mosaics

: Adequately sampling benthic cover in marine ecosystems is a challenge with most methods encompassing only a small portion of the area for which cover is estimated. Recent advances in photogrammetric techniques are providing opportunity to map expansive areas of reef. This study aimed to evaluate the adequate level of sampling for traditional quadrat-based designs at ecologically relevant scales. We used large-area (~250 m 2 ), high-resolution (0.911 ± 0.143 mm / pixel) mosaics to assess the precision and reproducibility of quadrat-based benthic sampling and identify the most e ﬃ cient strategy (size and number of quadrats). There was a strong relationship between the percent cover of benthic classes and the level of sampling e ﬀ ort required to adequately sample them. As expected, larger quadrats were found to be more e ﬃ cient when sampling e ﬀ ort was expressed in number of quadrats. This study aims to identify the optimal level of sampling (least e ﬀ ort that would result in a given target precision) to characterize coral reef benthic communities (whatever they are) within each site. As such, the sites selected were intentionally very di ﬀ erent and together represented the broad scale of heterogeneity found in shallow coral reef communities. Abundance data can be used in combination with the relationships presented here to determine the optimal sampling protocols for management approaches to coral reef monitoring.


Introduction and Background
Coral reefs are receding globally [1]. This decline is largely driven by anthropogenic threats and climate change [2] and includes cascading effects such as habitat loss for associated species [3], shifts in assemblage composition-for instance from coral-dominated to algal-dominated reefs [4]-and changes to structural complexity [5] and coral cover. For these reasons, monitoring activities capable of sufficient precision have become critical to assess the impact of disturbances and detect changes in coral reef trajectories. Marine habitats must be rigorously quantified over ecologically relevant spatial scales to provide the data necessary to understand ecological processes and inform management actions [6,7] Effective, rapid, and robust quantitative techniques to assess benthic communities are of importance to marine ecologists as they provide the basis for ongoing monitoring and assessment of community dynamics. However, a common issue with sampling methods is the degree to which the sampling strategy delivers estimates that are adequate (accurate and precise) and representative (captures taxonomic diversity) assessments of the sample area. For instance, in the AIMS long-term monitoring program protocols, three sites per reef are surveyed with a sampling effort of five transects per site and 40 photographs per transect which is equivalent to 600 photographs per reef [8]. If a single photograph is considered to cover 0.25 m 2 (with the camera type used, focal length 28mm, photos taken 40cm above substrate), then it means 150 m 2 of reef are surveyed per reef. Given the high diversity and often non-random distribution of coral taxa across large areas of reef, representative sampling is a considerable challenge. Large-area surveys are then essential to gather information and to identify general patterns for monitoring ecosystems at relevant spatial scales [7,9]. Recent improvements in photogrammetry algorithms and computing power have notably increased the resolution of information that can be efficiently gathered over large areas of habitat. Recent studies have used structure from motion (SfM) software to build three-dimensional (3D) models of large reef areas [10][11][12] and a few have assessed their accuracy and precision [13][14][15]. Additionally, it is possible to extract a two-dimensional projection from 3D models as photomosaics or orthomosaics (when spatially referenced).
Photo-based and in-situ sampling are two key strategies for surveying the seafloor. The former approach is now more commonly used as it minimises time underwater, keeps a permanent record, removes the need for taxonomic expertise in the field, and can be used for automated image annotation [16]. However, even though the process of image collection is fast, post-processing associated with photo-based technique can be very time consuming. Time spent in post-collection image scoring/annotation varies depending on the identification method (e.g., point-intercept or total area estimates), photo resolution, habitat spatial heterogeneity, and the taxonomic level required [17].
One of the most common outputs of marine benthic imagery in coral reef monitoring is the estimation of the percent coverage of sessile, habitat-forming organisms and features [18,19]. Different sampling protocols have been implemented to convey representative and precise estimates of percent cover. However, in most studies, the selection process of such protocols is often unclear and/or statistically unsupported [7,9,20]. In most cases, sampling scale and density are based on previous studies, convention or arbitrary choices, despite the fact that scale is known to have a direct influence on benthic habitat mapping [21,22]. Some studies have investigated the effect of sampling design, particularly image sampling, on seascape metric estimations. For instance, Palma [9] tested two sampling strategies (nested and random quadrats) across a range of sampling scales (0.5 × 0.5 m, 2 × 2 m, 5 × 5 m, and 7 × 7 m) and densities (1-100 quadrats) within a 1655 m 2 coral reef area in Mozambique. They found that most key seascape metrics were more accurately identified by larger sample scales (≤ 5 × 5 m quadrats) and densities (≤ 30 quadrats). Perkins [18] used simulated data of benthic cover and provided guidance about the sampling effort required (number of images per transect and number of points per image) to minimise estimation bias. They found that increased precision was best achieved by using more images rather than more points per image. However, there is still a need to set standard guidelines to optimise virtual or field sampling strategies. We need a detailed and reproducible framework to adequately capture specific proportions of cover estimates with a given level of precision.
This study presents an objective and consistent method for setting a high-level of precision of cover estimates based on common sampling strategies. We specifically investigated how percent cover of a given benthic organism affects the corresponding sampling effort at a given level of precision. This question is addressed at an ecologically relevant scale by basing assessments from fully annotated mosaics of~250 m 2 areas of reef. The aim of the study is to identify the optimal level of sampling (least effort that would result in a given target precision) to characterise coral reef benthic communities (whatever they are) within each site sampled. As such, we intentionally selected sites that differed and together represented the broad scale of heterogeneity found in shallow coral reef communities. This study provides novelty in examining how differences in sampling strategies influence the accuracy and precision of percent cover estimates derived from in-situ data (i.e. not simulated).

Study Site and Image Acquisition
Data were collected in January 2016, at eight reef slopes around Heron Reef (23.26 S, 151.54 E), Great Barrier Reef, Australia: Big Tub (BT), Harry's Bommie (HB), Canyons (CA), Heron Bommie (HE), Pams Point (PP), Junction (JT), Blue Pools (BP), and Blue Pools East (BE) (Figure 1). Images were collected along one transect per site at depths of 5-10 m with transects following depth contours. Thirty-meter transects were mapped and modelled as per the methodology of Figueira et al. (2015) [15]. First, three visual references (30 cm-long uniquely coded targets) and two scale bars (50 cm PVC cylinders marked at 10 cm intervals) were deployed along each transect. These references are later used to scale the 3D models of the transects in the laboratory. Then three overlapping passes were done by SCUBA-divers with a custom mapping system flown 2-3m off the bottom. The mapping system consisted of three cameras (GoPro Hero4 Black, 12MP, Wide FOV, 4000 × 3000 pixels) equidistant apart (1 m), mounted on a 2 m long by 30 cm deep aluminium frames set to capture one image each per second. This protocol enabled a fast and non-destructive sampling of a 300 m 2 area of reef in approximately 7-10 min. This generated 1300-1800 images per transect from the three cameras, with an image footprint of approximately 5 × 4 m and 80% overlap in imagery [15]. Because only one transect was analysed per site, only the term 'site' will be used from this point. We aimed to identify optimal levels of sampling to characterise coral reef benthic communities within each site and not at reef level. As such, we intentionally selected sites that were very different and together represented the broad scale of heterogeneity found in shallow coral reef communities.
Remote Sens. 2019, 11, x FOR PEER REVIEW 4 of 22 generated 1300-1800 images per transect from the three cameras, with an image footprint of approximately 5 × 4 m and 80% overlap in imagery [15]. Because only one transect was analysed per site, only the term 'site' will be used from this point. We aimed to identify optimal levels of sampling to characterise coral reef benthic communities within each site and not at reef level. As such, we intentionally selected sites that were very different and together represented the broad scale of heterogeneity found in shallow coral reef communities.

Model and Mosaics Generation
Images were imported and processed with the software package Photoscan Professional (v. 1.3) [24] which uses structure from motion algorithms to determine the relative position of all cameras from which a high-resolution photo mosaic of the whole area can be built. The high-resolution large-area mosaics [11] were created using a process adapted from Figueira [15] with the specific parameter values summarised in Table 1. The resulting models were clipped to remove poorly resolved edges and scaled using the five scale features mentioned above. High

Model and Mosaics Generation
Images were imported and processed with the software package Photoscan Professional (v. 1.3) [24] which uses structure from motion algorithms to determine the relative position of all cameras from which a high-resolution photo mosaic of the whole area can be built. The high-resolution large-area mosaics [11] were created using a process adapted from Figueira [15] with the specific parameter  Table 1. The resulting models were clipped to remove poorly resolved edges and scaled using the five scale features mentioned above. High resolution (0.911 ± 0.143 mm/pixel) 2D mosaics were then built and exported in TIFF format processing using geospatial software.

Digital Annotation of Benthic Organisms
All eight mosaics were standardised to a total area of 250 m 2 (25 × 10 m) ( Figure 2A) except for one which was smaller (230 m 2 ) due to defects around edges. Organisms were manually digitised by delineating them with polygons using editor tools in ArcGIS (ArcMap 10.3) [25]. Each polygon was stored in a shapefile or annotation layer, per site, and assigned to a specific subclass ( Figure 2B). Cover area (m 2 ) was automatically calculated for each polygon within ArcMap. The national Australian standard classification scheme Collaborative Annotation Tools for Analysis of Marine Imagery and Video (CATAMI) [26], with modifications to fit this study, was used to describe the benthos. 'Reef matrix' was used as an umbrella class to encompass organisms smaller than 10 cm, aggregates of dead organisms, rock, and rubble. Organisms smaller than 10 cm in diameter were excluded; ignoring small benthic organisms did not affect the overall result, because these were typically smaller versions of the same benthic class and thus overall diversity would not be greatly affected. This approach would not address specific questions with respect to small benthic organisms such as coral recruits. Only sessile organisms were accounted for. This procedure enabled the classification and the calculation of surface cover area for each benthic class. The total area annotation process of all organisms within one mosaic required 4-24 h to complete, depending on benthos heterogeneity and diversity.

Testing Different Sampling Efforts
To determine how many quadrats or how much area must be sampled to adequately estimate percent cover within each site, we first generated virtual quadrats and then resampled the dataset for each site across a range of sample densities. We tested three sampling quadrat sizes, 0.5 × 0.5 m, 1 × 1 m, and 2 × 2 m, representing a commonly used range of sampling scales [8,9,18,19,27,28]. Virtual quadrats were created from three grids (fishnets) of cell sizes corresponding to each quadrat size placed on top of each annotated mosaic ( Figure 2C). Information about benthic organisms contained within each grid cell was then extracted by uniting the grids and the annotated mosaics (Analysis Toolbox > Union Tool). Datasets containing only full quadrats were loaded in R (v.3.3.3.) [29] and Remote Sens. 2019, 11, 2907 6 of 17 random virtual subsampling was undertaken with sample size set to the number of quadrats of each size required to sample 10, 20, 30, 40, 50, 60, 70, 80, 90, and 100% of the total area covered by the mesh (Table 2). For each mosaic and each quadrat size, this process was repeated 1000 times by selecting subsamples with replacement for each level of sample size. Subsampling with replacement was used as doing so without replacement creates a dependency amongst the datapoints (co-variance > 0) as the probability of selecting each becomes a function of the number of previously drawn points [30]. Relative standard error (RSE) amongst the 1000 samples was used to determine the precision of the cover area estimates for each class and each level of sample size. RSE was calculated as followed using SE and X, respectively standard error and mean cover of each benthic class Table 2. Correspondence between number of quadrats (NQuad) and total area sampled (m 2 ) by each quadrat sizes for each site. Only quadrats that fitted completely on the mosaics were included in the analyses and thus, the total area sampled is lower for larger quadrat sizes as it was more likely they would include some non-mosaic components.

Precision Level of Cover Estimates
For each combination of site, benthic class and quadrate size, the relationship between precision of cover estimates (relative standard error, RSE) and sample size (total area of site sampled) was evaluated to estimate the optimal sampling strategy. In all cases, this relationship followed a power function which approached an asymptotic RSE value (scaled by a) at a rate dictated by b The a and b coefficients were estimated in Statistica (v.12.1, [31]) for each combination of quadrat size and benthic cover class using the non-linear estimation tool with a least squares loss function. In all cases, the power curves were fit with very high significance (p < 0.0001) and parameter estimates with very high precision (5% RSE at maximum and 0.62% on average). To evaluate the best sampling strategy, it is first necessary to define some criteria related to the acceptable level of RSE. Here, we used the approach of identifying a point on the diminishing returns curve where the improvement in RSE indicated that extra sampling was not warranted. This was done by assessing the value of sample size (on the x-axis) at which the slope (first derivative of y) of the power function exceeded a specific criterion (i.e., it got too shallow). The exact threshold value for this slope was determined by evaluating the first derivative curves of all relationships and choosing a slope threshold that was as close to zero as possible but which would intersect all curves within the 0-100% range (see Figure 3 for Remote Sens. 2019, 11, 2907 7 of 17 an example). In this case, the value −0.001 was chosen and this represents the target change in RSE per % area where further sampling would not result in valuable gain of precision. It is important to note that one could decide on a different value for this slope or even a different criterion altogether (sample effort required to achieve a specific RSE for instance). What is critical is that a common criterion is used against which to evaluate all sites, benthic classes, and quadrat size combinations.

Effect of Quadrat Type, Benthic Class, and Cover on Target Sample Size
For all the following analyses, data were pooled across all sites. Three analyses were conducted to assess the factors that drive differences in percent area (target-%Area) and number of quadrats (target-NQuad). Even though the metric 'number of quadrats' seems redundant with area covered, and although it is expected that larger quadrats provide higher precision when number of quadrats is used as the metric for sampling effort [32], this has never been tested for coral reef monitoring programs and with in-situ data. Therefore, in this study, sampling effort is expressed in both percent area and number of quadrats. First, the effect of benthic class and quadrat size on the target-%Area was evaluated using a one-factor (quadrat size) ANOVA for each benthic class separately. This process was then repeated using target-NQuad as the response variable. Note, we did not attempt to include benthic class in a two-factor analysis given the very high number of levels and the extreme imbalance this would bring to the design. Only benthic classes which occurred in at least two of the eight sites for a given quadrat size were included. Assumptions of Based on the methodology above, we calculated the proportion of the total area that needs to be sampled to hit the target change in RSE for every combination of site, benthic class, and quadrat size. We converted this into number of quadrats based on the size of the quadrat and the mosaic involved. These values, both represent the target sample size and are hereafter referred to as target-%Area and target-NQuad respectively.

Effect of Quadrat Type, Benthic Class, and Cover on Target Sample Size
For all the following analyses, data were pooled across all sites. Three analyses were conducted to assess the factors that drive differences in percent area (target-%Area) and number of quadrats (target-NQuad). Even though the metric 'number of quadrats' seems redundant with area covered, and although it is expected that larger quadrats provide higher precision when number of quadrats is used as the metric for sampling effort [32], this has never been tested for coral reef monitoring programs and with in-situ data. Therefore, in this study, sampling effort is expressed in both percent area and number of quadrats. First, the effect of benthic class and quadrat size on the target-%Area was evaluated using a one-factor (quadrat size) ANOVA for each benthic class separately. This process was then repeated using target-NQuad as the response variable. Note, we did not attempt to include benthic class in a two-factor analysis given the very high number of levels and the extreme imbalance this would bring to the design. Only benthic classes which occurred in at least two of the eight sites for a given quadrat size were included. Assumptions of normality and homoscedasticity were assessed visually by plotting residuals and tested with Shapiro test and Levene's test, respectively. Where a significant difference between quadrat size was found, Tukey's HSD post-hoc test was run to determine which quadrat sizes were significantly different from each other.
Secondly, we tested the hypothesis that differences in target-%Area and target-NQuad may have more to do with the percent cover of a given benthic class rather than the identity of the benthic class itself. This was evaluated using a generalised linear model (in Statistica v12.1; [31]) with either target-%Area or target-NQuad as the response variable and two predictor variables, benthic class cover (continuous) and quadrat size (categorical, fixed, three-levels). Normality was checked visually, and homogeneity of variance assessed using Cochran's test. Tukey's HSD post-hoc test was used to determine which quadrat sizes were significantly different from each other.
Lastly, we investigated how the best sampling strategy proposed in this study affects how much of the taxonomic richness of an area would be captured reliably. This was evaluated by plotting the target sample size (area sampled for each quadrat size) against the proportion of benthic classes that would be adequately (according to the precision criteria given above) captured by following this strategy.

Results
Eight mosaics, with resolution of 0.911 ± 0.143 mm/pixel, were built, annotated within a standardised area of 250 m 2 and used to quantify cover estimates of benthic classes larger than 10 cm in diameter. The aim of this study was to identify optimal levels of sampling to characterise coral reef benthic communities within each site (and not at reef level). As such, the study sites were different and together represented the broad scale of heterogeneity found in shallow coral reef communities.

Percent Cover and Precision Target
A total of 42 benthic classes were identified and organised into eight coarse categories: abiotic, hard coral, soft coral, macroalgae, molluscs, dead coral, bleached coral, and unknown ( Table 3). The coarse category 'abiotic' describes sandy areas, the rocky matrix and cement platforms or eroded surfaces that cannot be distinguished as dead organisms. The term 'macroalgae' encompasses algae patches larger than 10 cm in diameter and does not include turf algae. 'Dead coral' represents coral that recently died and are still morphologically identifiable with higher complexity than the rocky matrix. Four out of 42 benthic classes; reef matrix, staghorn, bleached staghorn, and dead staghorn; had the highest mean percent covers from 40.6% to 15.5%. Ten benthic classes showed medium to low mean percent cover from 5.4% to 1%, and 28 were low (<1%) ( Table 3).

Effect of Sampling Strategy and Benthic Class on Target Sample Size
When sample size was expressed as percent area, there was a trend for the smallest quadrat size to correspond to the smallest sampling size required for all benthic classes. However, variability in percent cover was high and this was only significant for the benthic classes staghorn, tabulate, dead mix, and dead staghorn (Table A1, Figure 4). Tukey's HSD post hoc test revealed that 2 × 2 m quadrats were always significantly different from 0.5 × 0.5 m for staghorn (p = 0.038), tabulate (p = 0.031), dead mix (p = 0.004) and dead staghorn (p = 0.007); but none was different from 1 × 1 m for those classes (p > 0.05). Within plot with n(classes)>2, classes are arranged by mean total percent cover, from lowest on the left to highest on the right. The boxplots show the first, second (median) and third quartiles. The upper whisker extends from the hinge to the largest value no further than 1.5 × IQR from the hinge (where IQR is the inter-quartile range, or distance between the first and third quartiles). The lower whisker extends from the hinge to the smallest value at most 1.5 × IQR of the hinge. Circles and crosses represent outliers and means, respectively.
When sample size was expressed as number of quadrats, there was a significant effect of quadrat size for all benthic classes except bleached staghorn (Table A1, Figure 5). The number of quadrats decreased when quadrat sizes increased for all classes, though Tukey's HSD post hoc test revealed the differences were only significant between quadrat size 0.5 × 0.5 m and 1 × 1 m and between 0.5 × 0.5 m and 2 × 2 m. There was no difference between 1 × 1 m and 2 × 2 m for all benthic classes.
Apparent patterns are highlighted in Figure 5 which shows differences in target sampling effort between the different classes. Although differences were not statistically evaluated, there were clear patterns where certain classes had a low target effort while it was higher for others. For instance, reaching the target precision required a low sampling effort (<10% Area) for reef matrix (Reef), staghorn (Stag) and dead staghorn (Stag(D)) Figure 5A-C), regardless of quadrat size. Intermediate sampling effort (20-50% Area) was required for most classes and high sampling Within plot with n(classes)>2, classes are arranged by mean total percent cover, from lowest on the left to highest on the right. The boxplots show the first, second (median) and third quartiles. The upper whisker extends from the hinge to the largest value no further than 1.5 × IQR from the hinge (where IQR is the inter-quartile range, or distance between the first and third quartiles). The lower whisker extends from the hinge to the smallest value at most 1.5 × IQR of the hinge. Circles and crosses represent outliers and means, respectively.
When sample size was expressed as number of quadrats, there was a significant effect of quadrat size for all benthic classes except bleached staghorn (Table A1, Figure 5). The number of quadrats decreased when quadrat sizes increased for all classes, though Tukey's HSD post hoc test revealed the differences were only significant between quadrat size 0.5 × 0.5 m and 1 × 1 m and between 0.5 × 0.5 m and 2 × 2 m. There was no difference between 1 × 1 m and 2 × 2 m for all benthic classes.
Apparent patterns are highlighted in Figure 5 which shows differences in target sampling effort between the different classes. Although differences were not statistically evaluated, there were clear patterns where certain classes had a low target effort while it was higher for others. For instance, reaching the target precision required a low sampling effort (<10% Area) for reef matrix (Reef), staghorn (Stag) and dead staghorn (Stag(D)) Figure 5A-C), regardless of quadrat size. Intermediate sampling effort (20-50% Area) was required for most classes and high sampling effort (>50% Area) was required to capture shell (Shel), algae (Alga), and dead bottlebrush (Botb(D)) ( Figure 5B,C). effort (>50% Area) was required to capture shell (Shel), algae (Alga), and dead bottlebrush (Botb(D)) ( Figure 5B, C). Within plot with n(classes)>2, classes are arranged by mean total percent cover, from lowest on the left to highest on the right. The boxplots show the first, second (median) and third quartiles. The upper whisker extends from the hinge to the largest value no further than 1.5 × IQR from the hinge (where IQR is the inter-quartile range, or distance between the first and third quartiles). The lower whisker extends from the hinge to the smallest value at most 1.5 × IQR of the hinge. Circles and crosses represent outliers and means, respectively.

Effect of Sampling Strategy and Cover on Target Sample Size
Overall, percent cover had a greater effect on sampling density than the identity of the benthic class itself. Indeed, low cover classes (i.e., rare, <1% mean cover) required a rather high level of sampling, from 40% up to 65% of the total area sampled, to be adequately represented. In contrast, organisms that had high coverage across the study area (i.e., 15-40% mean cover) only required a level of sampling ≤30% of the total area to be adequately captured (Figures 5 and 6). Such conspicuous organisms would act as good candidates for long-term monitoring as the sampling effort required to gain accurate and precise estimates of their coverage Within plot with n(classes)>2, classes are arranged by mean total percent cover, from lowest on the left to highest on the right. The boxplots show the first, second (median) and third quartiles. The upper whisker extends from the hinge to the largest value no further than 1.5 × IQR from the hinge (where IQR is the inter-quartile range, or distance between the first and third quartiles). The lower whisker extends from the hinge to the smallest value at most 1.5 × IQR of the hinge. Circles and crosses represent outliers and means, respectively.

Effect of Sampling Strategy and Cover on Target Sample Size
Overall, percent cover had a greater effect on sampling density than the identity of the benthic class itself. Indeed, low cover classes (i.e., rare, <1% mean cover) required a rather high level of sampling, from 40% up to 65% of the total area sampled, to be adequately represented. In contrast, organisms that had high coverage across the study area (i.e., 15-40% mean cover) only required a level of sampling ≤30% of the total area to be adequately captured (Figures 5 and 6). Such conspicuous organisms would act as good candidates for long-term monitoring as the sampling effort required to gain accurate and precise estimates of their coverage would be more efficient. These findings were supported when the percent cover of all organisms was plotted against sampling density ( Figure 6).
Significant differences in sampling size for both metrics were found between quadrat sizes. The effect of quadrat size was significant considering either number of quadrats (F 2567 = 1298.2, p < 0.001) or percent area (F 2267 = 63.8, p < 0.001) as was the effect of percent cover (F NQuad (1567) = 1402.8, p NQuad < 0.001; F %Area (2567) = 2044.3, p %Area < 0.001). The interaction of these factors was significant when sample size was expressed as number of quadrats (F 2567 = 691.9, p < 0.001) but not for the percent area. Post-hoc Tukey's HSD tests revealed that in both cases, all quadrat sizes were significantly different from each other (p < 0.0001). However, a visual comparison of sample size means shows that, when using percent area, the difference between quadrat sizes was subtle (mean response varies by about 10%) and uniform over percent cover. When using the number of quadrats as the measure of sampling effort however, the effects of quadrat size were more clear-cut (mean responses varied by up to 45%) and were strongly dependent on the percent cover of the benthic class identified, this was most pronounced for the less abundant classes. would be more efficient. These findings were supported when the percent cover of all organisms was plotted against sampling density (Figure 6). Figure 6. Relationship between percent cover and target sample size. Covers are displayed on a logaxis against target sample size expressed as either target-%Area (A) or target-NQuad (B).
Significant differences in sampling size for both metrics were found between quadrat sizes. The effect of quadrat size was significant considering either number of quadrats (F2567 = 1298.2, p < 0.001) or percent area (F2267 = 63.8, p < 0.001) as was the effect of percent cover (FNQuad(1567) = 1402.8, pNQuad < 0.001; F%Area(2567) = 2044.3, p%Area < 0.001). The interaction of these factors was significant when sample size was expressed as number of quadrats (F2567 = 691.9, p < 0.001) but not for the percent area. Post-hoc Tukey's HSD tests revealed that in both cases, all quadrat sizes were significantly different from each other (p < 0.0001). However, a visual comparison of sample size means shows that, when using percent area, the difference between quadrat sizes was subtle (mean response varies by about 10%) and uniform over percent cover. When using the number of quadrats as the measure of sampling effort however, the effects of quadrat size were more clear-cut (mean responses varied by up to 45%) and were strongly dependent on the percent cover of the benthic class identified, this was most pronounced for the less abundant classes.
These findings suggest that it is possible to determine how much area should be sampled with minimal information about abundance, prior to the survey campaign. For instance, to adequately (meet the target precision) sample 55% of all benthic classes present at the study site, the area sampled needs to be equivalent to 60 m 2 ,

Discussion
We used high-resolution large-area mosaics to assess the accuracy and precision of traditional photo-based benthic sampling methods in coral reefs. This study did not intent to characterise the benthos at reef level. The aim of this study was to identify the optimal level of sampling (highest target precision for the least effort) to characterise coral reef benthic communities (whatever they are) within each site. The study sites were intentionally selected to be different and together represented the

Discussion
We used high-resolution large-area mosaics to assess the accuracy and precision of traditional photo-based benthic sampling methods in coral reefs. This study did not intent to characterise the benthos at reef level. The aim of this study was to identify the optimal level of sampling (highest target precision for the least effort) to characterise coral reef benthic communities (whatever they are) within each site. The study sites were intentionally selected to be different and together represented the broad scale of heterogeneity found in shallow coral reef communities. The results revealed no significant difference in sample size whether the metric used was percent area or number of quadrats, but there was a significant difference between quadrat size. The 2 × 2 m quadrat size was the most efficient (or optimal according to this study) because it corresponded to the least effort (number of quadrats or area sampled, see Figure 7)). Overall, the most abundant benthic classes required the lowest sampling effort to deliver reliable (high precision) cover estimates. These results suggest that sampling effort is related to percent cover and thus provide an estimate of the level of sampling required to capture specific proportions. Existing data on abundance can be used in combination with the relationships presented here as a basis for determining the optimal sampling protocols for ongoing coral reef monitoring. Because this study aimed at generating relevant advice to practitioners, we chose to subsample from the maximum area covered by full quadrats. It is important to mention that while it might have been better to subsample smaller quadrats from within the larger quadrat size this would have not reflected real world monitoring. It is however important to keep in mind this while interpreting the results, in short, we wanted to address our research question from a sampling perspective and find what is optimal for each quadrat size.
The choice of sampling strategy for photoquadrat-based methods has been assessed in previous studies. These studies suggest that large quadrat sizes are more efficient [9] and when field time is limited, larger quadrat sizes should be used [33]. Importantly, we found the use of larger quadrat size appropriate when sampling effort is expressed as number of quadrats. In this case, more area can be covered by using larger quadrats. However, larger quadrats can come at the cost of data resolution. For example, fewer points per area in the case of in-situ point counts (if the number of points is conserved throughout quadrat sizes then resolution is lessened) or lower image resolution for photoquadrats (as the picture will be taken from a distance). In situations where it is desired to maintain resolution, sampling effort will scale with percent area rather than number of quadrats (e.g., need to take more photos at larger 'quadrat' sizes). In this situation, our work suggests smaller quadrat sizes are in fact more efficient (higher resolution) because they correspond to the highest precision. Ultimately, the choice of sampling strategy will depend on the spatial distribution of the target biota and how overall effort is tied to number of replicates versus total area. Therefore, the key factor will be the relative cost of field time versus annotation/scoring time. For in-situ photo-based sampling, field costs can be minimised by sampling fewer larger quadrats, but this will result in lower resolution photos to cover the larger area. To increase image resolution, multiple photos are needed to cover the larger area and thus time-cost in field is not reduced. Analysis cost could also be minimised with fewer larger quadrats but only if another cover estimation method than total area annotation is used, for example using point counts with low point densities per quadrat. However, using less quadrats or low point densities per quadrat will result in lower precision of estimates. One affordable and time-saving solution would be to use more than one camera at a time as in this study. Another solution would be to target taxa groups with dominant coverage in the system that can be monitored at lower resolution [18].
To date, the main limitations of traditional sampling approaches consist in the fact that they have been restricted to relatively small spatial scales because of the high-cost associated with getting high-resolution data over large areas [6,34]; or they used remote sensing techniques resulting in coarser spatial resolution data provision [35]. The application of close-range photogrammetry based on SfM algorithms presented here, along with techniques recently developed (e.g., autonomous and semi-autonomous underwater vehicles), can help fill the need for wider areal coverage (kilometre scales) and high spatial resolution (meter to sub-centimetre scale) data provision [5,11,12,36] to ensure current and comprehensive benthic composition mapping [37]. Moreover, to overcome the sampling limitation of assessing the true value of the parameter being quantified for a given area, high-resolution large-area imaging allows the acquisition of data (high-resolution digitisation of cover) that can be used as a baseline or 'truth' against which cover estimates would be compared [20].
In the future, repeated measurements of the same sites can be done to assess whether the sampling strategies presented here are able to detect changes in coral cover over time. More interesting investigations involving the use of this method would be to take random sample of the reef studied and assess the interaction between the number of quadrats required and the number of sites required to sample different patches of the reef. The use of large-area mosaics proved to be a promising tool for accessing high-resolution percentage cover estimates over a range of spatial and temporal scales and it is compatible with existing photoquadrat methods. This can then serve as the basis for developing new or alternate sampling designs and would allow a gradual transition from one to the other for coral reef monitoring. For instance, the approach presented here could be applicable to optimise point count as it is also quadrat-based sampling, only percent area is estimated from point intercept rather than tracing. The likely outcome of this would be increased target sample sizes (more quadrats needed to hit target) because the variance between subsamples will be greater due to the extra variance from another level of sampling within the quadrats (random placement of points). This study provides novelty in examining how differences sampling strategies influence the accuracy and precision of percent cover estimates derived from in-situ data (i.e. not simulated). Given the decline projections for coral reef trajectories, future sample points are more likely to present homogeneous communities and lower percent cover and thus, the guidelines set in this study and the importance to define adequate precision level will still be coherent to represent future communities as the work presented here specifically highlighted the effect of cover of taxa.

Conclusions
The use of underwater imagery as a tool for benthic monitoring is widespread. Here we provide guidance to scientists and managers conducting such surveys in quadrat-based sampling strategies that would deliver accurate and precise cover estimates. Results show that each of the benthic classes has different optimal sampling sizes to be adequately sampled within a given framework. For example, a sampling size equivalent to 60 m 2 of the total area sampled by all quadrats is needed to provide representative cover estimates of 55% of all benthic features at the target precision. Pilot study data on abundance can be used in combination with the relationships presented here as a basis to determine the optimal sampling protocols for ongoing coral reef monitoring. Funding: Financial support for field sampling was provided by a grant to M.B., W.F., and R.F. from the Great Barrier Reef foundation. Computing support for mosaic building was provided by grants to W.F., M.B., and R.F. from the University of Sydney, Faculty of Science Research Equipment and Infrastructure Scheme.