Addressing initialisation uncertainty for end-to-end ecosystem models: application to the Chatham Rise Atlantis model

Vidette L. McGregor; Elizabeth A. Fulton; Matthew R. Dunn

doi:10.7717/peerj.9254

Addressing initialisation uncertainty for end-to-end ecosystem models: application to the Chatham Rise Atlantis model

Vidette L. McGregor ¹, Elizabeth A. Fulton², Matthew R. Dunn¹

1Fisheries, National Institute of Water and Atmospheric Research Ltd, Wellington, New Zealand

2Oceans & Atmosphere, CSIRO, Hobart, TAS, Australia

DOI: 10.7717/peerj.9254

Published: 2020-06-03
Accepted: 2020-05-08
Received: 2019-09-18

Academic Editor: Andrew Robinson

Subject Areas: Aquaculture, Fisheries and Fish Science, Ecosystem Science, Mathematical Biology, Statistics, Population Biology
Keywords: Chatham Rise, Initial conditions, Chaos, Stability, Atlantis, Ecosystem model, Deep sea, Model validation, End-to-end, Sensitivity analysis

Copyright: © 2020 McGregor et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: McGregor VL, Fulton EA, Dunn MR. 2020. Addressing initialisation uncertainty for end-to-end ecosystem models: application to the Chatham Rise Atlantis model. PeerJ 8:e9254 https://doi.org/10.7717/peerj.9254

The authors have chosen to make the review history of this article public.

Abstract

Ecosystem models require the specification of initial conditions, and these initial conditions have some level of uncertainty. It is important to allow for uncertainty when presenting model results, because it reduces the risk of errant or non-representative results. It is crucial that model results are presented as an envelope of what is likely, rather than presenting only one instance. We perturbed the initial conditions of the Chatham Rise Atlantis model and analysed the effect of this uncertainty on the model’s dynamics by comparing the model outputs resulting from many initial condition perturbations. At the species group level, we found some species groups were more sensitive than others, with lower trophic level species groups generally more sensitive to perturbations of the initial conditions. We recommend testing for robust system dynamics by assessing the consistency of ecosystem indicators in response to fishing pressure under perturbed initial conditions. In any set of scenarios explored using complex end-to-end ecosystem models, we recommend that associated uncertainty analysis be included with perturbations of the initial conditions.

Introduction

Atlantis models are end-to-end ecosystem models that can be used to model everything in an ecosystem from sunlight to market, including feed-back loops and flow-on effects (Audzijonyte et al., 2017a). There are many published Atlantis models (Link, Fulton & Gamble, 2010; Ainsworth et al., 2011; Ainsworth, Schirripa & Morzaria-Luna, 2015; Fulton et al., 2011; Weijerman et al., 2014; Sturludottir et al., 2018; Ortega-Cisneros, Cochrane & Fulton, 2017; Porobic et al., 2019; McGregor et al., 2019b), and sensitivities and uncertainties are beginning to feature in the more recently published work (Ortega-Cisneros, Cochrane & Fulton, 2017; Sturludottir et al., 2018, Ortega-Cisneros et al., 2018; Hansen et al., 2019a, 2019b; McGregor et al., 2019b).

An important step that has not yet been addressed for Atlantis models, or any end-to-end ecosystem models, is to address initialisation uncertainty. In a review paper, Payne et al. (2015) found generally marine ecosystem models have not explicitly addressed uncertainty of initialisation, and more recently, Hansen et al. (2019a) noted it is not something that has been done for Atlantis models. Payne et al. (2015) speculated as to the likely effects of initialisation uncertainty in end-to-end models such as Atlantis, noting long-lived species might dampen effects, and short-lived species may amplify them. While accounting for uncertainties of such complex models is no small task, it is possible to focus on a sub-component of model dynamics rather than the entire model. Atlantis models are particularly well suited for this approach as they are structured using components. These components dynamically interact, and they define functionality for the physical, biological, fisheries, harvest and economic components of the system (Audzijonyte et al., 2017b). Rather than vary the initial conditions for thousands of spatially explicit parameters for the entire ecosystem, we focus this study on the biological sub-component and explore sensitivity to the initial conditions of species groups. These will have flow-on effects to the whole system, but the analyses focus on the effects as they relate to population dynamics of the modelled species.

Ecological theory has featured stability, chaos, the importance of initial conditions, and how these relate, although definitions seem to vary. The most commonly known attribute of chaotic dynamics is sensitivity to initial conditions, often referred to as the butterfly effect. In a system with chaotic dynamics, a very small change in its initial state will result in a different trajectory (Baker, Baker & Gollub, 1996). Stability often relates to variability, persistence and robustness of a system (Tilman, 1996; Ives, Gross & Klug, 1999). Multiple studies have investigated characteristics of an ecosystem that are linked with stability, both from a theoretical perspective, and from observation. May (1972) showed mathematically, that large complex systems with high levels of diversity are unstable. However, there seem to be exceptions to this rule, as later studies have shown. Roberts (1974) argued that most systems in practice appear to be more stable with more connections—contrary to the mathematical analysis of May (1972). Roberts (1974) showed if only feasible solutions are included in the analyses, such that no species may have a negative population, larger systems are actually more stable. May (1975) examined non-linear difference equations with respect to chaotic, cyclic and stable biological dynamics. Other aspects subsequently shown to increase stability of ecosystems include negative pairwise correlations (Tang & Allesina, 2014), species dispersal (Allesina & Tang, 2012), modularity (subsets of closely connected components) (Grilli, Rogers & Allesina, 2016), predator-prey relationships (Tregonning & Roberts, 1979), a high proportion of weak interactions (Olsen et al., 2016) and spatial structure (Fulton, 2001).

The Chatham Rise Atlantis model was selected for this study as it has been shown to have sound dynamics with respect to our current knowledge of the system (McGregor et al., 2019b), has been tested for sensitivity to its ocean forcing variables (McGregor et al., 2019b) and spawning stock recruitment assumptions (McGregor, Fulton & Dunn, 2019a), but is yet to be tested with respect to its initial conditions. The initial conditions for the Chatham Rise Atlantis model were specified to reflect the ecosystem in its unfished, or virgin state. We have varying levels of understanding of the components of this ecosystem in its unfished state, and as such, there are varying levels of confidence around the estimates for the initial conditions, with all components having some level of error.

This study presents an approach for addressing and exploring sensitivity to initial conditions of the biological sub-component of an end-to-end ecosystem model, with application to the Chatham Rise Atlantis model. We go beyond the question of whether the model is sensitive to its initial conditions, and analyse what features of the Chatham Rise ecosystem, and how we have modelled it, affect this answer. We discuss the likely impacts of our findings for future use of this ecosystem model. We highlight areas of potential future research with respect to model development, and to support decisions relating to the sustainable use of the Chatham Rise marine ecosystem resources.

Methods

The Chatham Rise Atlantis model is spatially defined using 24 dynamic polygons, and five water column depth layers. Species are modelled using 55 species groups, which include species of bacteria, detritus, phyto-plankton, invertebrates, fish, sharks, cetaceans and birds. Some species groups were modelled as biomass pools, and some with age-structure, using numbers-at-age and mean weight-at-age. For many of the species, we have estimates of biomass, growth rates, age of maturity, natural mortality, spatial distributions and diets, although some species have more knowledge gaps than others. McGregor et al. (2019b) characterised the species groups by keystoneness, responsiveness, and informance, and brought these together to highlight which data gaps are likely to influence model results. We can use these attributes to perturb the initial conditions in a meaningful way based on likely uncertainties, and they may add context when analysing the results. Keystoneness measures the effect changes in biomass of a species group has on the rest of the system; responsiveness measures how responsive a species group is to changes in biomass of other species groups within the system. Informance was a qualitative measure used to reflect both how well informed each species group was, and how well it performed in the model. It considered whether key dynamics such as growth, mortality rates, diets, and responses to fishing were all realistic based on current knowledge of these dynamics.

Sensitivity to initial conditions is characterised by two criteria in this study: (1) persisting variability between simulations, assessed at the species group level; (2) consistency of responses to fishing pressure, both at the species group level and at the system level. Variability of species biomass between simulations with perturbed initial conditions is important to consider as it may affect conclusions drawn from the model, and can provide error bounds around results. The consistency of responses to fishing pressure has also been considered as it is often the direction of a response that is of interest when exploring scenarios using ecosystem models. For example a conclusion that increasing fishing by 20% reduced diversity by 5–15% (with the range presented perhaps reflecting the initialisation uncertainty) is more informative than whether simulated biomasses of individual species groups varied across runs by less than 20%.

The analyses presented here were carried out in three main sections: (1) components of the modelled system were characterised with respect to attributes that may affect sensitivity to initial conditions; (2) the initial conditions were perturbed, and the resulting model simulations were compared; (3) correlations between component attributes and responses to perturbations of the initial conditions were analysed, thus linking the first two sections.

The models were run on a Linux cluster using Moab job scheduler, and used Atlantis version 6262M (trunk). We created a new initialisation.nc file for each set of perturbed initial conditions, and these were created using R (version 3.6.1) to read in the base initialisation file, scale the values, then create the new initialisation file. R was also used to create run files that interacted with Moab to schedule the model runs. Scripts used for this work are available on GitHub (McGregor, 2018) (https://github.com/mcgregorv/CRAM_chaos). The R scripts were all developed in a Windows environment.

Varying initial conditions

We varied the initial conditions for the number-at-age variables of age-structured species groups, and the biomass of biomass-pool species groups (Tables 1 and 2). For the biomass-pool species groups, biomass is the only option to perturb. Age-structured species groups could have errors in the specification of numbers-at-age and/or size-at-age, both of which affect the biomass-at-age. In a stock assessment model, size-at-age (or growth rates) are generally the same with respect to time, whereas there is more likely a difference in numbers with respect to time (especially before fishing compared to after fishing). Hence, a different virgin biomass in a stock assessment model would generally be made up of a different number of fish, rather than the same number of fish but a different size. To align with this, we perturbed numbers rather than size for the age-structured species groups. The resulting number of variables to perturb was 361, of which 341 were numbers-at-age (number of age-classes for a species group ranged from 2 to 10, with 10 being most common), 18 were the nitrogen content of biomass-pool species groups, and two were the silicate content of biomass-pool species (diatoms and microphytobenthos). The numbers-at-age of all age-classes for a given species group were scaled by the same amount for each simulation, such that the proportions-at-age were preserved. Again, this aligns most closely with the structure of a population’s virgin state in a stock assessment. The assumed M (instantaneous natural mortality) is preserved through the proportions-at-age based on an exponential decay curve, and the total numbers are what change when varying the initial conditions. Age-structured species groups were modelled with between 2 and 10 age-classes, and were perturbed by applying one scalar for all age-classes of a given species group. This significantly reduced the number of scalars required to 57, of which 37 were for scaling numbers-at-age for age-structured species groups.

Table 1:

List of age-structured species groups for CRAM (McGregor et al., 2019b).

‘Keystone’ is the keystone ranking from McGregor et al. (2019b), where 1 is the highest. ‘Informance’ is the informance rating from McGregor et al. (2019b) where 1 is the highest (‘No data gaps, performed well, abundance index available’) through to 4 which is the lowest (‘Poorly specified’). Ben: benthic; Dem: demersal; invert: in-vertivore; pisc: piscivore; Invert comm: commercial invertebrates; herb: herbivore; scav: scavenger.

Specices group	Main species	Keystone	Informance
Arrow squid	Arrow squid	19	3
Baleen whales	Southern right whales (Eubalaena australis)	25	4
Basketwork eel	Basketwork eels (Diastobranchus capensis)	18	3
Baxters dogfish	Baxter’s dogfish (Etmopterus baxteri)	28	2
Ben fish deep	Four-rayed rattail (Coryphaenoides subserrulatus)	35	2
Ben fish shal	Oblique banded rattail (Coelorinchus aspercephalus)	5	1
Black oreo	Black oreo (Allocyttus niger)	26	2
Bollons rattail	Bollons’ rattail (Caelorinchus bollonsi)	16	1
Cephalopod other	Squid & octopus	14	2
Cetacean other	Primarily sperm & pilot whales & dolphins	9	4
Dem fish pisc	Giant stargazer (Kathetostoma giganteum)	31	3
Elasmobranch invert	Primarily skates & dogfish	32	1
Elasmobranch pisc	Primarily semi-pelagic sharks	17	2
Epiben fish deep	Spiky oreo (Neocyttus rhomboidalis)	22	2
Epiben fish shal	Common roughy (Hoplostethus atlanticus)	8	2
Ghost shark	Dark ghost shark (Hydrolagus novaezealandiae)	30	2
Hake	Hake (Merlucciidae australis)	10	1
Hoki	Hoki (Macruronus novaezelandiae)	1	1
Invert comm herb	Paua (Haliotidae) & kina (Evechinus chloroticus)	21	2
Invert comm scav	Primarily scampi & crabs	33	2
Javelinfish	Javelinfish (Coelorinchus australis)	27	1
Ling	Ling (Genypterus blacodes)	11	2
Lookdown dory	Lookdown dory (Cyttus traversi)	20	1
Mackerels	Slender jack mackerel (Trachurus murphyi)	15	2
Orange roughy	Orange roughy (Hoplostethus atlanticus)	2	1
Pelagic fish lge	Southern bluefin tuna (Thunnus thynnus)	29	2
Pelagic fish med	Barracouta (Thyrsites atun)	7	3
Pelagic fish sml	Myctophids (Myctophidae)	4	3
Pinniped	NZ fur seal (Arctocephalus forsteri)	36	4
Reef fish	Blue cod (Parapercis colias)	24	2
Rock lobster	Rock lobster (Jasus edwardsii)	37	3
Seabird	Seabirds & shorebirds	6	4
Seaperch	Seaperch (Helicolenus spp.)	34	2
Shovelnosed dogfish	Shovelnosed dogfish (Deania calcea)	12	2
Smooth oreo	Smooth oreo (Pseudocyttus maculatus)	23	2
Spiny dogfish	Spiny dogfish (Squalus acanthias)	3	2
Warehou	Silver, white & blue warehou (Seriolella spp.)	13	3

DOI: 10.7717/peerj.9254/table-1

Table 2:

List of species groups modelled as biomass-pools for CRAM (McGregor et al., 2019b).

Zoo: zooplankton.

Species group	Description
Benthic Carniv	Benthic carnivores
Carniv Zoo	Planktonic animals (size 2–20 cm)
Carrion	Dead and decaying flesh
Deposit Feeder	Detritivores and benthic grazers
Diatoms	Diatoms (large phytoplankton)
DinoFlag	Dinoflagellates
Filter Other	Non-commercial benthic filter feeders
Gelat Zoo	Salps, ctenophores, jellyfish
Labile detritus	Organic matter that decomposes at a fast rate
Macroalgae	Macroalgae
Meiobenth	Benthic organisms (size 0.1–1 mm)
MesoZoo	Planktonic animals (size 0.2–20 mm)
Microphytobenthos	Unicellular benthic algae
MicroZoo	Heterotrophic plankton (size 20–200 μm)
Pelagic bacteria	Pelagic bacteria
Pico-phytoplankton	Small phytoplankton
Refractory detritus	Organic matter that decomposes at a slow rate
Sediment bacteria	Sediment bacteria

DOI: 10.7717/peerj.9254/table-2

Initially, we perturbed all species group initial conditions using the same scalar for all variables within each model run. The scalars we used were 0.5, 0.8, 0.9, 0.95, 1.05, 1.1, 1.2, 1.5. These scalars were chosen to cover a range from slight (±5%) to extreme (±50%) errors in the initial conditions.

As shifting initial conditions by the same amount may not give an indication as to how robust or sensitive the model is to mis-specification of the initial conditions where changes could vary in direction and magnitude, we next simulated multiple model runs, with the initial conditions scaled with some random variability. We scaled the initial conditions of each variable, sampling the scalar for each from a normal distribution, N (0, σ) with σ chosen based on how large we assumed a plausible change could be.

In total, we ran three sets of simulations, and repeated each set with and without fishing.

Set 1: All up or down. All species group initial conditions were scaled (numbers for age-structured, biomass for biomass-pool) with the same scalar for each run;

scalars ∈ {0.5, 0.8, 0.9, 0.95, 1.05, 1.1, 1.2, 1.5}

Set 2: High uncertainty. All initial conditions were scaled (numbers for age-structured, biomass for biomass-pool), with the scalars sampled from normal distributions with μ = 0 and σ set based on the informance ratings defined in McGregor et al. (2019b) (Fig. 1). Biomass-pool species groups were assumed poorly specified as these were not ranked in McGregor et al. (2019b).

Figure 1: Set 2 scalars used to perturb initial conditions, with scalars sampled from Gaussian distributions with μ = 0 and σ one of (0.025, 0.05, 0.1, 0.25) based on informance levels 1–4 respectively where (1) ‘No data gaps, performed well, abundance index available’ (green); (2) ‘Slight data gaps and/or poor performance’ (blue); (3) ‘Some (more substantive) data gaps and/or poor performance’ (magenta); (4) ‘Poorly specified’ (gold); (defined in McGregor et al. (2019b) and for reference in Table 1).

Download full-size image

DOI: 10.7717/peerj.9254/fig-1

Set 3: High keystone species. These runs only scaled the initial conditions of species groups likely to be most influential on the system. The species groups that ranked in the top 10 for keystoneness in McGregor et al. (2019b), and all biomass pool species groups were scaled using normally distributed scalars sampled with μ = 0 and σ = 0.25, giving 95% confidence intervals of ≈±0.5. All other species groups were unchanged (Fig. 2).

Figure 2: Set 3 scalars used to perturb initial conditions for high keystone and biomass-pool species groups.
Scalars sampled from the Gaussian distribution with μ = 0 and σ = 0.25.

Download full-size image

DOI: 10.7717/peerj.9254/fig-2

Ecosystem indicators

To help our understanding of the sensitivity of the whole model to perturbations of the initial conditions, we analysed ecosystem indicators from all simulations. We calculated a subset of the ecosystem indicators analysed for the base model in McGregor, Fulton & Dunn (2019a) (Table 3) at each timestep for all model simulations. Mean trophic level, diversity and the ratio of pelagic to total biomass were chosen as they responded to fishing scenarios for the Chatham Rise Atlantis model (McGregor, Fulton & Dunn, 2019a), but didn’t require catch for the calculation (so we could apply them to model runs here with or without fishing included). We analysed the results for consistencies or discrepancies in shifts of the ecosystem reflected through these indicators, with particular focus on the response of the system when heavy fishing became established during the mid-1970s.

Table 3:

Key ecosystem indicators evaluated for responses to perturbing the initial conditions.

Indicator	References
Mean trophic level	Pauly & Watson (2005) and Shin et al. (2018)
Diversity (modified Kempton’s Q)	Ainsworth & Pitcher (2006)
Biomass of pelagic fishes/biomass total	Link (2005)

DOI: 10.7717/peerj.9254/table-3

Characterising the species groups

If certain species groups appear to be more stable than others, we wanted to be in a position to investigate whether the more stable species groups have shared characteristics—for example are there links between sensitivities to changes in the initial conditions and how connected each species group is in the system, how abundant they are, how long they live, or some combination of these.

We characterised the species groups based on the base model presented in McGregor et al. (2019b) so we could test for links between these attributes of the species groups and sensitivities to changes in the initial conditions. We considered keystoneness, trophic level, biomass, animal size, lifespan, additional (background) mortality, number of trophic connections, and proportion of most dominant (‘top’) prey. All but the final three of these indices were available from McGregor et al. (2019b). The proportion of diet made up by most dominant prey, number of trophic connections, and the proportion of natural mortality that was made up of additional mortality were calculated for this study using R version 3.4.3.

Proportion of top prey

For each species group, we calculated the contribution to a predators diet from the single most dominant prey in their diet, as a proportion of biomass consumed, using averaged diets from the base model. The intent was to classify the extent to which each species group was eating as a specialist or generalist as they are modelled. It is possible for a predator to perform in the model as more of a specialist due to aggregation of species into groups—they could predate on several prey species that are modelled in the same species group. For each species group, we summed the prey eaten over the entire model region and all modelled years 1900–2015, then selected the largest proportion. Due to this averaging, a predator switching between two prey (for example) would appear similar to a predator consistently consuming those two prey. Hence, a predator would need to consistently predate on a given prey species group to be considered a specialist feeder for this metric.

Number of trophic connections

Trophic connections were calculated as ‘primary connections’ (predators and prey of the species group), ‘secondary connections’ (predators and prey of the predators and prey of the species group) and ‘tertiary connections’ (similarly) (illustration in Fig. 3). As sometimes a predator may eat a very small, negligible amount of a prey, we included a cut-off at 1%, such that a prey or predator was not included in the connections count if they made up less than 1% of the total prey consumed or predation pressure applied, respectively.

Figure 3: Illustration of primary and secondary tertiary trophic level connections, where the species group in the centre (shaded blue) has two primary connections, and five secondary connections.

Download full-size image

DOI: 10.7717/peerj.9254/fig-3

Additional natural mortality

There is the option in Atlantis to apply additional natural mortality either as a quadratic term, which is density dependent, or as a linear term (Audzijonyte et al., 2017a). The balance between additional natural mortality and mortality coming from dynamics within the model may affect the model’s stability. Higher levels of additional mortality reduce the strength of connections in the model, with 100% additional mortality effectively resulting in parallel single species models. Additional mortality was required for some species groups in the Chatham Rise Atlantis model that did not suffer sufficient natural mortality through predation, starvation or disease in the model to match estimates of mortality from the literature. For all age-structured species groups in this model, linear rather than quadratic mortality was applied as this is a close approximation to instantaneous natural mortality (M). When M is small, as it is when applied at small time-steps, e^−M can be approximated by 1 − x using the first two terms of its Taylor series expansion. Hence, if we take N_t to be the number of individuals at timestep t and N_t ₊ _δ to be the number at timestep t + δ where δ is small we get (1) $N_{t + δ} = N_{t} e^{- M_{δ}} \approx N_{t} (1 - M_{δ})$

As linear mortality, m_L, is applied at every timestep (12 h for this model), we can use M_δ to approximate m_L. This is, however, complicated by a temperature effect which is applied to m_L in Atlantis. Additional mortality in Atlantis are assumed related to metabolic rates, and hence are temperature dependent. The temperature effect is applied as a scalar (T_corr) calculated as a function of the current water temperature (T) (in a given cell at a given time) relative to a base temperature, set at 15 °C (Eq. (2)).

(2)

T_{c o r r} = 2^{(T - 15) / 10}

As temperature varies spatially and temporally, so does the scaled m_L. We calculated the additional applied mortality for each species group based on their spatial distribution, m_L values, and temperature corrections, using the median, upper and lower quartile, and 95% confidence intervals for the applied additional mortality to reflect the variability of temperature spatially and temporally. These were calculated for both juveniles and adults as m_L and spatial distributions were defined separately for these life stages.

Total realised mortality rates were estimated from the model by fitting an exponential decay curve to the proportions-at-age. By running the model with no fishing, the realised mortality consisted entirely of natural mortality, including sources within the model such as predation, as well as additional mortality from m_L. We then compared the total realised natural mortality with the range of additional mortality to estimate what proportion of natural mortality was coming from dynamics within the model, and what proportion was forced. We produced a weighted average for each species group that combined the proportions for adults and juveniles, weighted by the numbers of adults and juveniles respectively.

Modelling stability

We analysed variability as a result of perturbing the initial conditions for each species group by calculating the coefficient of variation (CV) of biomass between the runs. A species group with high variability between model simulations relative to the mean biomass from all simulations would have a high CV, and hence be considered highly sensitive to perturbations to the initial conditions.

We analysed the effects of perturbing the initial conditions by fitting a GLM (Generalised Linear Model) to the CV for the biomass of each species group across model runs. We used Atlantis model outputs following a 35-year burn-in period, to match the burn-in used in McGregor et al. (2019b). Variables from characterising the species groups (Table 4) using the base model were offered as possible explanatory variables, using a step-wise selection algorithm, with each iteration selecting the variable (or pair of interaction variables) that explained the largest proportion of the null deviance. This process was repeated until the additional deviance explained was less than 10%. This cut-off value was selected to limit the number of explanatory variables selected, while retaining most of the explained null deviance. We initially explored untransformed, and log (base 10) and cubed root transformations of the response variable (CV), with all modelled using the Gaussian distribution. The analyses presented here used the cubed root transformation as we found this produced greater homogeneity of residuals with respect to the fitted values.

Table 4:

Explanatory variables offered to explain effects of perturbing the initial conditions, and whether these were defined specifically for age-structured species groups or for all species groups.

Explanatory variables were analysed using the base model presented in McGregor et al. (2019b).

Variable	Description	All species groups
(1) Informance	Rating of how well informed and how well it performed in the base model
(2) TL	Trophic level	✓
(3) Keystone	Keystone ranking
(4) Response	Responsive ranking
(5) NumL1cons	Number of primary trophic connections	✓
(6) Lifespan	Approximate expected lifespan in years	✓
(7) propAdM	Proportion of adult natural mortality that is estimated to have come from additional (background) mortality
(8) propJuvM	Proportion of juvenile natural mortality that is estimated to have come from additional (background) mortality
(9) B0	Unfished biomass	✓
(10) PropByTopPrey	Proportion of diet made up by the most dominant prey group
(11) Linf	Expected maximum length (cm)
(12) ChaosAlt	Method used to perturb initial conditions (set 1, 2 or 3) and whether fishing was turned on or not	✓

DOI: 10.7717/peerj.9254/table-4

We could not model the biomass-pool species group CVs with respect to all attributes, as some attributes had not been analysed for biomass-pool groups (e.g. Keystone and Response), and some attributes relate to individuals, such as maximum size and instantaneous mortality. Hence, we fitted three versions of the GLM: (1) limited the species groups included in the analyses to species with age-structure in order to consider the full list of explanatory variables; (2) retained all species groups, but limited the explanatory variables offered to those that relate to all species groups; (3) limited the species groups to biomass-pool species groups, with the limited explanatory variables offered. Table 4 gives the full list of explanatory variables offered for biomass-pool (BP), age-structured (AS) and all-species (ALL) versions of the model. All possible paired interaction terms were also offered. PropByTopPrey was dropped from BP models as nearly half (8/17) of the biomass-pool species groups were not predators, and this variable only applies to predators.

We fitted the GLM to model outputs for each year (1900–2015) to test for temporal shifts in the effects (a separate GLM was fitted at each year). To allow for influence from the method of perturbing the initial conditions (all up or down, based on keystoneness, or based on uncertainty), we included this (‘ChaosAlt’) as a potential explanatory variable. We also explored splitting out the fished model runs from the unfished, or including this within ChaosAlt (Table 5).

Table 5:

ChaosAlt definitions for perturbing the initial conditions, and including fishing in the model or not.

ChaosAlt was offered as an ex-planatory variable to the GLMs.

ChaosAlt	Description	Included fishing
A	All up or down
B	All up or down	✓
C	Based on uncertainty
D	Based on uncertainty	✓
E	Based on keystoneness
F	Based on keystoneness	✓

DOI: 10.7717/peerj.9254/table-5

We fitted a summary GLM for each version (ALL, AS, BP species; with/without fishing included), using a subset of the years simulated by the models, where the explanatory variables selected for models fitted at each timestep were roughly consistent. We used these summary models to explore the effects of the selected explanatory variables on between-run CVs. We analysed the residuals to check for trends or biases in the model fits, and present these as well as the effects of selected explanatory variables.

Results

Variability from initial conditions

Variability between simulations with perturbed initial conditions remained high for some species groups, while others converged to almost identical outputs. Generally, the biomass-pool species groups were more likely to have persistent high CVs between model runs (Fig. 4). Fishing sometimes reduced the between-model CVs for age-structured species groups, such as for hoki, but the effects of fishing were not apparent in any biomass-pool species groups—in biomass trajectories or CVs between model runs (Fig. 5 for two examples; Appendix A for the full set of figures). Responses of age-structured species groups to fishing were generally consistent across model runs. This included direct effects of fishing on a species (such as hoki, hake, orange roughy and ling), and predation-release responses (such as cephalopods and pelagic fish). Exceptions were invert comm herb (primarily paua and kina), invert comm scav (primarily scampi), dem fish pisc (primarily giant stargazer) and seaperch, which all gave varied responses with fishing included in the model.

Figure 4: CVs from model runs with perturbed initial conditions.
(A) Age-structured species groups, (B) Biomass-pool species groups. Median (solid lines) and upper and lower quartiles (dot-dashed lines) for CVs from fished model runs (blue lines) and un-fished model runs (orange lines).

Download full-size image

DOI: 10.7717/peerj.9254/fig-4

Figure 5: Biomass trajectories for (A) deposit feeders and (B) hoki from model runs with perturbed initial conditions.
Models with fishing included (blue lines) and no fishing (orange lines), with CVs from across the model runs by time from fished models (midnight blue crosses) and un-fished models (dark orange pluses).

Download full-size image

DOI: 10.7717/peerj.9254/fig-5

Ecological indicators

Ecological indicators demonstrated variability from the perturbed initial conditions that generally persisted throughout the model simulations. However, the responses to heavy fishing from the mid-1970s were consistent across runs, with a decline in mean trophic level, a slight increase in diversity, and an increase in the ratio of pelagic biomass over total biomass (Fig. 6). There was a slight decline in mean trophic level from 1900 to 2015 in some of the unfished models, although the decline was approximately 0.02 of a trophic level over 100 years, so rather small. The ecological indicators present with more overlap between simulations than the individual species biomass outputs (Figs. 5 and 6).