Beyond visualizing catch-at-age models: Lessons learned from the r4ss package about software to support stock assessments

doi:10.1016/j.fishres.2021.105924

Fisheries Research

Volume 239, July 2021, 105924

https://doi.org/10.1016/j.fishres.2021.105924 Get rights and content

Abstract

Stock assessment analysts are exploring an increasingly diverse and complex range of models while also facing higher expectations for consistency, documentation, and transparency in reports and management advice, all within a tight timeline. Meeting these goals requires increased efficiency at all steps in the assessment process from data processing, through model development and selection, to report writing and review. Here, we describe one widely used tool that has proven successful in increasing the efficiency of the assessment process: the r4ss package, which supports the use of the Stock Synthesis modeling framework. What began 15 years ago as a tool to provide simple model diagnostics, including plots showing data and model results, has grown into a large collection of R functions to support many aspects of the assessment process. We provide an overview of the r4ss features and illustrate its utility with examples from recent applications. Finally, we discuss lessons learned from the ongoing development of r4ss that can be applied to similar efforts associated with the next generation of stock assessment packages.

Introduction

Assessment of fish stocks (hereafter referred to as “stocks”) is a necessary task, largely because of mandates by federal and regional governing bodies to provide information about stock status and apply harvest control rules to inform catch limits under harvest policies. While incorporating disparate data sources into a single population model (integrated analysis) to determine stock status is routine, understanding the fit to each data set and its associated influence on the model results can be challenging (Maunder and Punt, 2013; Maunder and Piner, 2015). A standardized set of visualization tools is key to providing understanding and transparency throughout this process for stock assessment analysts, reviewers, stakeholders, and managers. For example, standardized tools allow analysts to quickly understand model results and explore new model configurations during the model development and peer review processes; reviewers scrutinize the analyses and investigate other alternatives with the aid of visualization tools, ultimately deciding if the assessment results are appropriate for use by management; and lastly, stakeholders and managers need to understand the model results, and hence, need intuitive visualization tools to inform the range of management options and decide on which management measures to take.

Visualization tools can aid analysts throughout the assessment process. For example, Richards et al. (1997) found while developing a stock assessment for Pacific ocean perch (Sebastes alutus) that visualization tools allowed them to better understand their data sets and pinpoint data features that needed to be accommodated, develop a statistical catch-at-age model well suited to the data sets, and evaluate model output more thoroughly. Stock assessments often require hundreds of model runs. Tools for quickly visualizing model results allow analysts to more efficiently select among them. As an illustration of the power of automated workflows and visualization tools, calculating residuals by hand would take hours, while visualizing patterns in residuals already plotted can take just minutes. Visualization tools can also relieve the feeling of being time-poor when conducting stock assessments (Bentley, 2015). Aside from efficiency, a thorough and standardized set of tools for visualizing model output can help catch errors such as misspecified models and aid in the report writing process, as most stock assessment reports require numerous figures and tables.

The peer review process for stock assessments (e.g., Brown et al., 2020), to determine if assessment results can be used by management bodies for decision making, benefits from visualization tools. For example, Regular et al. (2020) found that interactive data and model dashboards improved their ability to communicate with stakeholders during the stock assessment review process for a northern cod stock in the northwest Atlantic. Producing standardized figures across assessments increases ease of understanding for readers and simplifies comparisons across assessments. Often, peer reviewers are tasked with evaluating modeling decisions and model results, ultimately deciding if the assessment results are appropriate for use by management. Requests for visualizations made during assessment review processes are often expected in subsequent reviews, especially if the same reviewers may be engaged in the future, and should be added to assessment analyst toolboxes, such that they can be better prepared for future reviews. Thus, this toolbox grows with each review and helps facilitate efficient reviews, because analysts are able to quickly produce desired output before it is asked for.

The Terms of Reference (ToR) for stock assessment reviews have also coevolved with visualization tools, increasing the value of standardization. For instance, 10 years ago the ToR for groundfish stock assessments conducted for the Pacific Fishery Management Council (PFMC, 2009) had an eight-point bulleted list of general stock assessment team deliverables, while the ToR used in 2019 (PFMC 2019) had a checklist of 74 elements within 18 sections with more specificity. These ToR changes have been driven in part by feedback from reviewers seeing the benefit of new visualizations and diagnostics for individual assessments as described above. The ToR changes, in turn, lead to wider adoption of the new approaches for analysts working to meet them, a shift which is easier when the analysts can use shared tools to meet the new standards.

Effectively translating complicated assessment models and results into an easily digestible form for fishery managers and stakeholders can be challenging, especially when presenting information across a large range of stocks (Dichmont et al., 2016). Presenting assessment results in a consistent manner across stocks can lessen the communication challenge, allowing for improved discussions between analysts, stakeholders, and managers. The development and application of an assessment toolbox for use by analysts facilitates this process without creating additional workload.

Communication methods for stock assessment results are not a frequent topic in fisheries science journals, but it is an area where new ideas are rapidly developing and which deserves greater prominence in the literature. The widespread adoption of the generalized integrated analysis platform Stock Synthesis (SS, Methot and Wetzel, 2013) provided an opportunity to develop a standardized set of visualization and automation tools, given a larger pool of potential applications, users, and contributors (Punt and Maunder, 2013). Here, we discuss how r4ss, an R package containing tools for working with SS models, has improved the stock assessment development and review processes for individual analysts, reviewers, and managers over its 15 years of active development. We also highlight lessons learned from developing and using r4ss that could be applied when developing new visualization tools for a new stock assessment modeling platform.

The r4ss package grew organically from a single code script written by a single author in 2005 for use in the R statistical programming language (R Core Team, 2020) to a large open-source R package with many contributors. Before r4ss was developed, the typical workflow for SS users was examining the output text files directly or importing them into Excel where figures were generated using Visual Basic scripts or created manually for each model. The figures were time-consuming to create, had limited reproducibility, and did not provide reviewers and managers with a consistent product with which they could become familiar as modifications for an individual model were rarely generalized for the benefit of other models. The original r4ss R script became widely used by the stock assessment team at the National Oceanic and Atmospheric Administration (NOAA) Northwest Fisheries Science Center (NWFSC) and grew in complexity as members of the assessment team provided suggestions for additions. The increase in use also increased the burden associated with maintaining the code, and in 2008 the lead developer role was shifted to a postdoctoral researcher which allowed for more directed development, facilitating the growth and use of the code to function across SS-assessed stocks. Shortly thereafter, the code was put under version control and released as open source to facilitate distribution and development, to increase transparency, and to reduce the burden of maintenance on any individual developer.

Although, in the early years of its development, most of the code was written by just two people, feedback from users was essential to improving the package. In particular, conversations with participants at the annual Inter-American Tropical Tuna Commission (IATTC) Stock Assessment Workshop series (since succeeded by the Center for the Advancement of Population Assessment Methodology workshops) led to significant steps forward in the project. The initial public release of r4ss took place during the 2008 IATTC workshop (Maunder, 2008); discussions at the 2009 workshop inspired the conversion of the script into a formal R package available on the Comprehensive R Archive Network (CRAN); and a demonstration of the Javascript viewer for Multifan-CL (SPC, 2010) associated with the 2011 workshop led to the development of an HTML viewer for r4ss plots. Formatting the r4ss script as an R package brought the benefits of structured documentation for each function; making the r4ss package available on CRAN made it easier to find and install (as CRAN is the first source most users will look to for R packages). The number of authors, all of whom have made substantial code contributions, has also grown from five in 2009 to 29 in 2020. The methods used to incorporate code into the r4ss codebase have also evolved from contributors emailing files to the lead developer, to GitHub pull requests that get automatically checked and manually reviewed before merging. Although the development workflow has grown more sophisticated, the organic evolution of r4ss leaves many legacy aspects of the code and package structure, which are typical of research software (Ram et al., 2019), but would be designed differently if starting from scratch today.

The r4ss package (github.com/r4ss/r4ss) includes functions designed to work with SS input and output files (Supplement 1). The main types of functions in the package are: 1) functions to read and plot information from SS output files to visualize model results; 2) functions to automate tasks associated with SS models that are routinely performed; and 3) functions to read, create or modify SS input files. In the examples, we will focus on functions to visualize model results and automate routine tasks.

Section snippets

Multimodel management (Pacific halibut)

The Pacific halibut (Hippoglossus stenolepis) stock assessment comprises four individual models which are used to create an ensemble for management use by the International Pacific Halibut Commission (IPHC; Stewart and Martell, 2015; Stewart and Hicks, 2018). Each of the models represent a different hypothesis regarding the best approach for modeling the stock dynamics. The four models vary in the length of the modeled period, the level of data aggregation, and data-weighting, among other

Collective experience of the authors

In addition to the examples above, r4ss has facilitated the formalization of many assessment authors’ “tips and tricks” for efficiently building, diagnosing, and reporting stock assessment models. Sharing of collective experience reduces the learning curve for new assessment authors and also provides structure to remind experienced authors of perennial pitfalls. This section reports a series of problems that we the authors have collectively encountered across a large number of individual stock

Discussion

Software packages are often described as black boxes (Dichmont et al., 2016) and fitting models to data has previously been described as an art rather than a science because of numerous non-trivial choices in the model development process (e.g., how to specify the model, how to weight the data). Fortunately, stock assessment scientists are formally trained in at least either model development or model fitting, helping to ensure that results fulfill mandates to provide the best available

CRediT authorship contribution statement

Ian G. Taylor: Conceptualization, Software, Writing - original draft, Writing - review & editing. Kathryn L. Doering: Conceptualization, Software, Writing - original draft, Writing - review & editing. Kelli F. Johnson: Conceptualization, Software, Writing - original draft, Writing - review & editing. Chantel R. Wetzel: Conceptualization, Software, Writing - original draft, Writing - review & editing. Ian J. Stewart: Conceptualization, Software, Writing - original draft, Writing - review &

Declaration of Competing Interest

The authors report no declarations of interest.

Acknowledgements

We thank Mark Maunder and Simon Hoyle for their role in organizing the IATTC and CAPAM workshops that have led to so much development of r4ss as well as our co-authors of the r4ss package: Z. Teresa A'mar, Sean C. Anderson, Andrew B. Cooper, LaTreese S. Denson, Robbie L. Emmet, Tommy M. Garrison, Andrea M. Havron, Allan C. Hicks, Watal M. Iwasaki, Neil L. Klaer, Gwladys I. Lambert, Carey R. McGilliard, Cole C. Monnahan, Iago Mosqueira, Kotaro Ono, André E. Punt, Megan M. Stachura, Christine C.

References (52)

S.K. Brown et al.
Patterns and practices in fisheries assessment peer review systems
Mar. Policy
(2020)
C.M. Dichmont et al.
A review of stock assessment packages in the United States
Fish. Res.
(2016)
M.N. Maunder et al.
A review of integrated analysis in fisheries stock assessment
Fish. Res.
(2013)
R.D. Methot et al.
Stock synthesis: a biological and statistical framework for fish stock assessment and fishery management
Fish. Res.
(2013)
K.R. Piner et al.
Evaluation of using random-at-length observations and an equilibrium approximation of the population age structure in fitting the von Bertalanffy growth function
Fish. Res.
(2016)
A.E. Punt
Some insights into data weighting in integrated stock assessments
Fish. Res.
(2017)
A.E. Punt et al.
Stock Synthesis: Advancing stock assessment application and research through the use of a general stock assessment computer program
Fish. Res.
(2013)
A.E. Punt et al.
Essential features of the next-generation integrated fisheries stock assessment package: a perspective
Fish. Res.
(2020)
I.J. Stewart et al.
A comparison of stock assessment uncertainty estimates using maximum likelihood and Bayesian methods implemented with the same model framework
Fish. Res.
(2013)
J.T. Thorson et al.
Model-based estimates of effective sample size in stock assessment models using the Dirichlet-multinomial distribution
Fish. Res.
(2017)

S.C. Anderson et al.

ss3sim: an R package for fisheries stock assessment simulation with Stock Synthesis

PLoS One

(2014)

S.C. Anderson et al.

Reproducible visualization of raw fisheries data for 113 species improves transparency, assessment efficiency, and monitoring

Fisheries

(2020)

N. Bentley

Data and time poverty in fisheries estimation: potential approaches and solutions

ICES J. Mar. Sci.

(2015)

E. Bocher et al.

Geospatial Free and Open Source Software in the 21st Century

(2012)

C. Boettiger et al.

Building software, building community: lessons from the rOpenSci project

J. Open Res. Softw.

(2015)

L.D. Brown et al.

Interval estimation for a binomial proportion

Stat. Sci.

(2001)

Carvalho, F., Winker, H., Courtney, D., Kapur, M., Kell, L., Cardinale, M., Schirripa, M., Kitakado, T., Yemane, D.,...

B. Fischhoff

The sciences of science communication

Proc. Natl. Acad. Sci. U.S.A.

(2013)

D.A. Fournier et al.

AD Model Builder: using automatic differentiation for statistical inference of highly parameterized complex nonlinear models

Optim. Methods Softw.

(2012)

R.C. Francis

Data weighting in statistical fisheries stock assessment models

Can. J. Fish. Aquat. Sci.

(2011)

C.J. Grandin et al.

Status of the Pacific Hake (whiting) Stock in U.S. And Canadian Waters in 2020

(2020)

J. Hester

Devtools 2.0.0

(2018)

S. Hoyle et al.

Status of yellowfin tuna in the eastern Pacific Ocean in 2004 and outlook for 2005

Inter-American Tropical Tuna Commission Stock Assessment Report

(2006)

W. Huber et al.

Orchestrating high-throughput genomic analysis with Bioconductor

Nat. Methods

(2015)

IPHC

Stock Assessment

(2020)

S. Killcoyne et al.

Managing chaos: lessons learned developing software in the life sciences

Comput. Sci. Eng.

(2009)

Cited by (11)

The good practices of practicable alchemy in the stock assessment continuum: Fundamentals and principles of analytical methods to support science-based fisheries management under data and resource limitations
2024, Fisheries Research
It is the exceptionally rare case one can directly and with little uncertainty measure fish absolute abundance through many stock generations in all areas of a stock’s range. Instead, we often seek “gold standard” stock assessments—models that use catch, abundance indices and biological compositions to produce precise and unbiased indicators of stock status for management use. Unfortunately, data and resource limitations affect our ability to collect all the desired information and apply methods with low uncertainty in the results. To confront this challenge of poorly informative data and low resource situations, a host of analytical approaches have been developed to engage the power of fisheries science to inform management decisions despite limitations. These methods are numerous and often challenging to understand and navigate, despite being simplified (though not simple) approaches. It is important to understand where these methods come from, how they can be used, and how to evaluate them. Often they are presented as alchemically providing golden outputs despite heavy assumptions and impure inputs. Here I aim to provide both scientific context of and guidance in organizing and applying so-called data and resource limited stock assessments. I offer a list of best practices by presenting fundamental principles of modelling and highlighting leading edge tools for organizing and conducting analyses under a variety of constraining conditions, offering a conceptualization of stock assessment expressing the interconnectedness of each method and how those can be largely unified under a common modeling framework. The concept of a stock assessment continuum is described, along with discrete examples in the form of a decision tree outlining the major modelling groups for a large variety of data availability scenarios. The basic approach to applied fisheries science and management is presented as interpreting uncertain model outputs (i.e, indicators) using reference points that can then be linked to management decisions via control rules that should express risk tolerance to meeting management objectives in light of uncertain outcomes. The role of simulation testing of management procedures is highlighted in order to evaluate robustness to uncertainty. While more and better data should be a focus of any management system, there is no excuse to wait for golden outputs. We have the tools and theory ready to help direct management of data and resource limited stocks now.
Implications of the maximum modelled age on the estimation of natural mortality when using a meta-analytic prior: The example of eastern Australian orange roughy (Hoplostethus atlanticus)
2023, Fisheries Research
Citation Excerpt :
MCMC convergence was assessed by (i) examination of trace plots to identify autocorrelation and slow mixing, (ii) assessment of the stationarity of the chain using the Geweke statistic (Gelman et al., 2013) and (iii) determining whether the Heidelberger and Welch test (Heidelberger and Welch, 1981, 1983; Gelman et al., 2013) was passed or not. The R packages coda (Plummer et al., 2006) and r4ss (Taylor et al., 2021) were used to produce the plots and statistics. Eastern Australian orange roughy is managed by the Australian Fisheries Management Authority (AFMA) as a Tier 1 stock within the Australian Federal Government’s Southern and Eastern Scalefish and Shark Fishery (SESSF; Smith et al., 2008).
The eastern Australian stock of orange roughy (Hoplostethus atlanticus) is a deep-water, long-lived species with a history of considerable exploitation during the late 1980s and early 1990s before being reduced to around 10% of unfished spawning biomass only a few years later, resulting in the closure of the fishery in 2006. Recent assessments have shown an increase in biomass, and the fishery was re-opened to targeted fishing in 2015. The stock is an example of the consequences of over-exploitation and of managed recovery. As such, conservation groups and fisheries managers are very interested in the status and future of the stock. Consequently, the assessment is both contentious and highly scrutinised. The current assessment uses the Stock Synthesis platform, with key inputs being annual catches, occasional acoustic surveys and age-composition data. Natural mortality (M) has been fixed at 0.04 yr⁻¹ in the model on which management advice was based for several assessment cycles, but the maximum likelihood estimate of M is closer to 0.03 yr⁻¹. Reducing the assumed value for M would lead to large reductions in catch, as determined by the Australian harvest control rule. A prior for M was developed based on assessments of orange roughy stocks in New Zealand and included in a Bayesian analysis in which M was treated as an estimable parameter. The median of the posterior for M is 0.0353 yr⁻¹ when the maximum age in the assessment (i.e., the ‘plus-group age’) is set to 80 years, but setting the plus-group age to 80 is not based on analysis of data. Increasing the plus-group age to 100 and 120 years leads to posterior medians for M of 0.0381 and 0.0393 yr⁻¹ respectively. This has consequences for catch limits under the Australian harvest control rule, with models that have older plus-group ages having higher estimated productivity and recommended biological catches. The results highlight the value of making use of the results from assessments of similar species to develop priors for M, especially for species that are poorly represented in the data sets on which current meta-analyses are based, and for the need to consider the choice of plus-group age when conducting assessments, particularly for long-lived species.
Investigating trends in process error as a diagnostic for integrated fisheries stock assessments
2022, Fisheries Research
Citation Excerpt :
Our analysis on the Indian Ocean yellowfin is shown throughout the main manuscript and we also provide an overview of trends in recruitment deviates from all tropical tuna stocks in a summarized way in the main text and for each stock as Supplementary material. The recruitment deviates were extracted from Stock Synthesis files using r4ss (Taylor et al., 2021) a package that contains a collection of R functions (R Core Team, 2021) for interacting with Stock Synthesis. In the stock assessment, the recruitment deviates were estimated from the first quarter of 1972 to the last quarter of 2018 (the time from when the CPUE series starts to two years before the final year of the assessment).
Integrated stock assessments consist of fitting several sources of catch, abundance, and auxiliary biological information to estimate parameters of equations that describe the population dynamics of fish stocks. Stock assessments are subject to uncertainty, and it is a common practice to characterize uncertainty using alternative hypotheses and assumptions within an ensemble of models to develop scientific advice for fisheries management. In this context, there is the need to assign levels of plausibility to each of the combinations of factors that ultimately reflect the uncertainty on different biological and fishery processes. In this study, we describe and apply a model diagnostic to identify trends in process error in recruitment deviation estimates within ensembles of integrated assessment models of tropical tunas. We demonstrate that assessment model ensembles for tropical tunas contain distinct scenarios with significant trends in process error that are overlooked, with the associated implications for fisheries management. Using the Indian Ocean yellowfin as a case study, we found that trends in recruitment deviates are linked to extreme productivity scenarios which strongly diverged in scale from deterministic models fitted without recruitment deviates. This indicates that when recruitment deviates show an increasing trend, these can compensate for the loss of biomass in periods of high catch beyond the surplus production. In these cases, variation in recruitment is not a random process, but rather takes the function of a compensatory, systematic driver in productivity. Significant trends in recruitment were positively correlated with increased standard deviations and auto-correlation coefficient, non-random residual pattern in fits to abundance indices, and particularly poor performance of the Age-Structured Production Model (ASPM) diagnostic. We suggest that trends in recruitment deviates can be caused by misspecification of the biological parameters used as fixed values in integrated assessment models. The process error diagnostic described here can provide a statistical criterion in support for hypotheses and assumptions when using ensembles of models to develop fisheries management advice.
Preface: Developing the next generation of stock assessment software
2022, Fisheries Research
A cookbook for using model diagnostics in integrated stock assessments
2021, Fisheries Research
Citation Excerpt :
Similarly, there has been a recent increase in the use of Stock Synthesis for benchmark assessments in Europe in place of the conventionally used VPA with extended survivor analysis or the state-space catch-at-age models such as SAM (ICES, 2019). The visualization of model outputs and implementation of diagnostics for Stock Synthesis is facilitated by the R package r4ss (Taylor et al., 2021; github.com/r4ss/r4ss). For each technique, we point readers to relevant citations or source code.
Integrated analysis has increasingly been the preferred approach for conducting stock assessments and providing the basis for management advice for fish and invertebrate stocks around the world. Many decisions are required when developing integrated stock assessments. For example, the analyst needs to decide whether the model fits the data, if the optimization was successful, if estimates are consistent retrospectively, and if the model is suitable to predict future stock responses to fishing. This study provides practical guidelines for implementing selected diagnostic tools that can assist analysts in identifying problems with model specifications and alternatives that can be explored to minimize or eliminate such problems. Emphasis is placed on reviewing the implementation and interpretation of contemporary model diagnostic tools. We first describe each diagnostic approach and its utility. We then proceed by providing a “cookbook recipe” on how to implement each of the diagnostics, together with an interpretation of the results, using two worked examples of integrated stock assessments with Stock Synthesis. Further, we provide a conceptual flow chart that lays out a generic process of model development and selection using the presented model diagnostics. Based on this, we propose the following four properties as objective criteria for evaluating the plausibility of a model: (1) model convergence, (2) fit to the data, (3) model consistency, and (4) prediction skill. It would greatly benefit the stock assessment community if the next generation of stock assessment models could include the diagnostic tests presented in this study as a set of open source tools.
Age, growth, and biomass projections of red porgy Pagrus pagrus (Teleostei, Sparidae) after the fishery collapse in southern Brazil
2024, Fisheries Management and Ecology

View all citing articles on Scopus

View full text

Published by Elsevier B.V.

Beyond visualizing catch-at-age models: Lessons learned from the r4ss package about software to support stock assessments

Abstract

Introduction

Section snippets

Multimodel management (Pacific halibut)

Collective experience of the authors

Discussion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgements

Mar. Policy

Fish. Res.

Fish. Res.

Fish. Res.

Fish. Res.

Fish. Res.

Fish. Res.

Fish. Res.

Fish. Res.

Fish. Res.

ss3sim: an R package for fisheries stock assessment simulation with Stock Synthesis

PLoS One

Reproducible visualization of raw fisheries data for 113 species improves transparency, assessment efficiency, and monitoring

Fisheries

Data and time poverty in fisheries estimation: potential approaches and solutions

ICES J. Mar. Sci.

Geospatial Free and Open Source Software in the 21st Century

Building software, building community: lessons from the rOpenSci project

J. Open Res. Softw.

Interval estimation for a binomial proportion

Stat. Sci.

The sciences of science communication

Proc. Natl. Acad. Sci. U.S.A.

AD Model Builder: using automatic differentiation for statistical inference of highly parameterized complex nonlinear models

Optim. Methods Softw.

Data weighting in statistical fisheries stock assessment models

Can. J. Fish. Aquat. Sci.

Status of the Pacific Hake (whiting) Stock in U.S. And Canadian Waters in 2020

Devtools 2.0.0

Status of yellowfin tuna in the eastern Pacific Ocean in 2004 and outlook for 2005

Inter-American Tropical Tuna Commission Stock Assessment Report

Orchestrating high-throughput genomic analysis with Bioconductor

Nat. Methods

Stock Assessment

Managing chaos: lessons learned developing software in the life sciences

Comput. Sci. Eng.