Skip to main content
Log in

Combining counts and incidence data: an efficient approach for estimating the log-normal species abundance distribution and diversity indices

  • Community ecology - Original research
  • Published:
Oecologia Aims and scope Submit manuscript

Abstract

Obtaining accurate estimates of diversity indices is difficult because the number of species encountered in a sample increases with sampling intensity. We introduce a novel method that requires that the presence of species in a sample to be assessed while the counts of the number of individuals per species are only required for just a small part of the sample. To account for species included as incidence data in the species abundance distribution, we modify the likelihood function of the classical Poisson log-normal distribution. Using simulated community assemblages, we contrast diversity estimates based on a community sample, a subsample randomly extracted from the community sample, and a mixture sample where incidence data are added to a subsample. We show that the mixture sampling approach provides more accurate estimates than the subsample and at little extra cost. Diversity indices estimated from a freshwater zooplankton community sampled using the mixture approach show the same pattern of results as the simulation study. Our method efficiently increases the accuracy of diversity estimates and comprehension of the left tail of the species abundance distribution. We show how to choose the scale of sample size needed for a compromise between information gained, accuracy of the estimates and cost expended when assessing biological diversity. The sample size estimates are obtained from key community characteristics, such as the expected number of species in the community, the expected number of individuals in a sample and the evenness of the community.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Beck J, Schwanghart W (2010) Comparing measures of species diversity from incomplete inventories: an update. Meth Ecol Evol 1:38–44

    Article  Google Scholar 

  • Brose U, Martinez ND (2004) Estimating the richness of species with variable mobility. Oikos 105:292–300

    Article  Google Scholar 

  • Bulmer M (1974) On fitting the poisson lognormal distribution to species abundance data. Biometrics 30:651–660

    Article  Google Scholar 

  • Cao Y, Williams DD, Williams NE (1998) How important are rare species in aquatic community ecology and bioassessment? Limnol Oceanogr 43:1403–1409

    Article  Google Scholar 

  • Cassie R (1962) Frequency distribution models in the ecology of plankton and other organisms. J Anim Ecol 31:65–62

    Article  Google Scholar 

  • Chao A, Shen TJ (2003) Nonparametric estimation of shannon’s index of diversity when there are unseen species in sample. Environ Ecol Stat 10:429–443

    Article  Google Scholar 

  • Chao A, Chazdon RL, Colwell RK, Shen TJ (2005) A new statistical approach for assessing similarity of species composition with incidence and abundance data. Ecol Lett 8:148–159

    Article  Google Scholar 

  • Connolly SR, Hughes TP, Bellwood DR, Karlson RH (2005) Community structure of corals and reef fishes at multiple scales. Science 309:1363–1365

    Article  PubMed  CAS  Google Scholar 

  • Connolly SR, Dornelas M, Bellwood DR, Hughes TP (2009) Testing species abundance models: a new bootstrap approach applied to indo-pacific coral reefs. Ecology 90:3138–3149

    Article  PubMed  Google Scholar 

  • DeVries PJ, Murray D, Lande R (1997) Species diversity in vertical, horizontal, and temporal dimensions of a fruit-feeding butterfly community in an ecuadorian rainforest. Biol J Linn Soc 62:343–364

    Article  Google Scholar 

  • Dorazio RM, Royle JA (2005) Estimating size and composition of biological communities by modeling the occurrence of species. J Am Stat Assoc 100:389–398

    Article  CAS  Google Scholar 

  • Dornelas M, Connolly SR (2008) Multiple modes in a coral species abundance distribution. Ecol Lett 11:1008–1016

    Article  PubMed  Google Scholar 

  • Downing J, Rigler F (1984) A manual on methods for the assessment of secondary productivity in fresh waters. Blackwells, Oxford

    Google Scholar 

  • Engen S, Lande R (1996) Population dynamic models generating the lognormal species abundance distribution. Math Biosci 173:85–102

    Article  Google Scholar 

  • Engen S, Lande R, Walla T, DeVries PJ (2002) Analyzing spatial structure of communities using the two-dimensional poisson lognormal species abundance model. Am Nat 160:60–73

    Article  PubMed  Google Scholar 

  • Engen S, Sæther BE, Sverdrup-Thygeson A, Grøtan V, Odegaard F (2008) Assessment of species diversity from species abundance distributions at different localities. Oikos 117:738–748

    Article  Google Scholar 

  • Engen S, Aagaard K, Bongard T (2011a) Disentangling the effects of heterogeneity, stochastic dynamics and sampling in a community of aquatic insects. Ecol Model 122:1387–1393

    Article  Google Scholar 

  • Engen S, Grøtan V, Sæther BE (2011b) Estimating similarity of communities: a parametric approach to spatio-temporal analysis of species diversity. Ecography 34:220–231

    Article  Google Scholar 

  • Etienne RS, Olff H (2005) Confronting different models of community structure to species-abundance data: a bayesian model comparison. Ecol Lett 8:493–504

    Article  PubMed  Google Scholar 

  • Fisher R, Corbet A, Williams C (1943) The relation between the number of species and the number of individuals in a random sample from an animal population. J Anim Ecol 12:42–58

    Article  Google Scholar 

  • Gaston K (1994) Rarity. Chapman and Hall, London

    Book  Google Scholar 

  • Golicher DJ, O’Hara RB, Ruiz-Montoya L, Cayuela L (2006) Lifting a veil on diversity: A bayesian approach to fitting relative-abundance models. Ecol Appl 16:202–212

    Article  PubMed  Google Scholar 

  • Gotelli N, Colwell R (2011) Estimating species richness. In: Magurran A, McGill B (eds) Biological diversity: Frontiers in measurement and assessment. Oxford University Press, Oxford, pp 39–54

  • Grøtan V, Engen S (2008) poilog: Poisson lognormal and bivariate Poisson lognormal distribution. R package version 0.4

  • Grøtan V, Lande R, Engen S, Sæther BE, DeVries P (2012) Seasonal cycles of species diversity and similarity in a tropical butterfly community. J Anim Ecol 81:714–723

    Google Scholar 

  • Grundy R (1951) The expected frequencies in a sample of an animal production in which the abundances are lognormally distrubuted. Biometrika 38:427–434

    Google Scholar 

  • Hessen DO, Walseng B (2008) The rarity concept and the commonness of rarity in freshwater zooplankton. Freshw Biol 53:2026–2035

    Article  Google Scholar 

  • MacArthur R (1965) Patterns of species diversity. Biol Rev 40:510–533

    Google Scholar 

  • MacKenzie DI, Kendall WL (2002) How should detection probability be incorporated into estimates of relative abundance?. Ecology 83:2387–2393

    Article  Google Scholar 

  • Magurran A (2011) Measurement biological diversity in time (and space). In: Magurran A, Mc Gill B (eds) Biological diversity: frontiers in measurement and assessment. Oxford University Press, Oxford, pp 85–93

  • Magurran AE, Henderson PA (2003) Explaining the excess of rare species in natural species abundance distributions. Nature 422:714–716

    Article  PubMed  CAS  Google Scholar 

  • Mao CX, Colwell RK (2005) Estimation of species richness: mixture models, the role of rare species, and inferential challenges. Ecology 86:1143–1153

    Article  Google Scholar 

  • May R (1975) Patterns of species abundance and distribution. In: Cody M, Diamond J (eds) Ecology and evolution of the community. Belknap Press, Cambridge

  • May R (1988) How many species on earth? Science 241:1441–1449

    Article  PubMed  CAS  Google Scholar 

  • McGill B (2011) Species abundance distributions. In: Magurran A, McGill B (eds) Biological diversity: frontiers in measurement and assessment. Oxford University Press, Oxford, pp 105–122

  • McGill BJ (2003) A test of the unified neutral theory of biodiversity. Nature 422:881–885

    Article  PubMed  CAS  Google Scholar 

  • McGill BJ, Etienne RS, Gray JS, Alonso D, Anderson MJ, Benecha HK, Dornelas M, Enquist BJ, Green JL, He FL, Hurlbert AH, Magurran AE, Marquet PA, Maurer BA, Ostling A, Soykan CU, Ugland KI, White EP (2007) Species abundance distributions: moving beyond single prediction theories to integration within an ecological framework. Ecol Lett 10:995–1015

    Article  PubMed  Google Scholar 

  • Novotny V, Basset Y (2000) Rare species in communities of tropical insect herbivores: pondering the mystery of singletons. Oikos 89:564–572

    Article  Google Scholar 

  • O’Hara RB (2005) Species richness estimators: how many species can dance on the head of a pin? J Anim Ecol 74:375–386

    Article  Google Scholar 

  • Pennington M (1983) Efficient estimator of abundance, for fish and plankton surveys. Biometrics 39:281–286

    Article  Google Scholar 

  • Prendergast JR, Quinn RM, Lawton JH, Eversham BC, Gibbons DW (1993) Rare species, the coincidence of diversity hot spots and conservation strategy. Nature 365:335–337

    Article  Google Scholar 

  • Preston F (1948) The commonness and rarity of species. Ecology 29:254–283

    Article  Google Scholar 

  • Purvis A, Hector A (2000) Getting the measure of biodiversity. Nature 405:212–219

    Article  PubMed  CAS  Google Scholar 

  • R Development Core Team (2008) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. http://www.R-project.org, ISBN 3-900051-07-0

  • Shannon C, Weaver W (1962) The mathematical theory of communication. University of Illinois Press, Illinois

  • Silva FR, Ferreira EJG, de Deus CP (2010) Structure and dynamics of stream fish communities in the flood zone of the lower Purus River, Amazonas state, Brazil. Hydrobiologia 651:279–289

    Article  Google Scholar 

  • Sugihara G (1980) Minimal community structure: an explanation of species abundance pattern. Am Nat 116:770–787

    Article  Google Scholar 

  • Tanner MA, Wing HW (1987) The calculation of posterior densities by data augmentation. J Am Stat Assoc 82:528–540

    Article  Google Scholar 

  • Taylor L (1984) Assessing and interpreting the spatial distributions of insect populations. Annu Rev Entomol 29:321–357

    Article  Google Scholar 

  • Thompson SK, Seber G (1996) Adaptive sampling. Wiley, New York

  • Vinson MR, Hawkins CP (1996) Effects of sampling area and subsampling procedure on comparisons of taxa richness among streams. J North Am Benthol Soc 15:392–399

    Google Scholar 

  • Walseng B, Hessen DO, Halvorsen G, Schartau AK (2006) Major contribution from littoral crustaceans to zooplankton species richness in lakes. Limnol Oceanogr 51:2600–2606

    Article  Google Scholar 

  • Walther BA, Moore JL (2005) The concepts of bias, precision and accuracy, and their use in testing the performance of species richness estimators, with a literature review of estimator performance. Ecography 28:815–829

    Article  Google Scholar 

  • Wittebolle L, Marzorati M, Clement L, Balloi A, Daffonchio D, Heylen K, De Vos P, Verstraete W, Boon N (2009) Initial community evenness favours functionality under selective stress. Nature 458:623–626

    Article  PubMed  CAS  Google Scholar 

  • Yoccoz NG, Nichols JD, Boulinier T (2001) Monitoring of biological diversity in space and time. Trends Ecol Evol 16:446–453

    Article  Google Scholar 

Download references

Acknowledgments

The study was supported by the Norwegian Research Council (WFR 185109).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Edwige Bellier.

Additional information

Communicated by Marc Mangel.

Electronic supplementary material

Below is the link to the electronic supplementary material.

PDF (689 KB)

Appendix

Appendix

For the simulation, three parameters are fixed: the expected total number of individuals in a sample E(N), the variance σ2, and the number of species s in the community. The parameter μ is deduced from these three parameters as follows: if the observed number of individuals for species i is \(N_i \sim \text{Poisson}(e^{X_{i}})\) and XN(μ, σ2), then:

$$ \begin{aligned} E(N_i)&=E\left(E\left(N_i\mid X\right)\right)=E\left(e^X\right)=e^{\mu+\frac{1}{2}\sigma^2},\\ E(N)&=E\left(\sum_{i} N_i\right)=s e^{\mu+\frac{1}{2}\sigma^2},\\ \ln(E(N))&=\ln s+\mu+\frac{1}{2} \sigma^2, \end{aligned} $$

giving:

$$ \mu=\ln \left(\frac{E(N)}{s}\right)-\frac{1}{2} \sigma^2. $$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bellier, E., Grøtan, V., Engen, S. et al. Combining counts and incidence data: an efficient approach for estimating the log-normal species abundance distribution and diversity indices. Oecologia 170, 477–488 (2012). https://doi.org/10.1007/s00442-012-2311-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00442-012-2311-2

Keywords

Navigation