Skip to main content
Advertisement
  • Loading metrics

Global terrestrial invasions: Where naturalised birds, mammals, and plants might spread next and what affects this process

Abstract

More species live outside their native range than at any point in human history. Yet, there is little understanding of the geographic regions that will be threatened if these species continue to spread, nor of whether they will spread. We predict the world’s terrestrial regions to which 833 naturalised plants, birds, and mammals are most imminently likely to spread, and investigate what factors have hastened or slowed their spread to date. There is huge potential for further spread of naturalised birds in North America, mammals in Eastern Europe, and plants in North America, Eastern Europe, and Australia. Introduction history, dispersal, and the spatial distribution of suitable areas are more important predictors of species spread than traits corresponding to habitat usage or biotic interactions. Natural dispersal has driven spread in birds more than in plants. Whether these taxa continue to spread more widely depends partially on connectivity of suitable environments. Plants show the clearest invasion lag, and the putative importance of human transportation indicates opportunities to slow their spread. Despite strong predictive effects, questions remain, particularly why so many birds in North America do not occupy climatically suitable areas close to their existing ranges.

Introduction

Understanding and predicting the spread of introduced species is one of the key conservation and ecological challenges of the 21st century [1]. However, we know little about what causes the introduced range of some species to increase rapidly, while other species remain in small, isolated populations years after establishing self-sustaining populations [2,3]. This major gap in our understanding prevents us from understanding how much of invasive spread is due to characteristics of the invader or the invaded environment. The most imminent threat is posed by the many thousands of species that are naturalised outside their native range and may continue to spread much more widely. However, there is a surprising lack of attention paid to the potential spread of already-naturalised species. Without understanding what has affected the spread of these species historically, we can assess neither which species are likely to spread further nor the geographic regions that will be most affected. This hampers pro-active management, of both already-introduced species and those yet to be introduced.

Species’ potential naturalised ranges are often calculated by “climate matching” to the conditions occupied in the native range [4]. This approach is widely used for invasion risk assessment [4]; however, the portion of this potential range that naturalised species fill is extremely variable [5, 6]. Some species spread rapidly [7, 8], and may even expand beyond native climate conditions [9], while others remain restricted to a portion of their potential range long after any expected invasion lag [6, 10]. The spatial distribution of suitable environmental conditions, introduction history, and species characteristics, all influence naturalised ranges [11], but their relative importance is unknown.

Concerning the spatial distribution of environmental conditions, climate matching rarely, if ever, accounts for the connectivity of suitable climate. In a landscape where suitable climate is continuous, it should be much easier for a species to spread than in a landscape with small and isolated fragments of suitable climate [12]. Concerning introduction history, time since introduction is a key factor predicting naturalised species range size [1315]. The number of individuals introduced (“propagule pressure”) also has an important role, especially soon after establishment [16,17]. Species introduced into habitat that is similar to their native range are more likely to establish and spread [18]. Species dispersed by humans are frequently more successful at spreading widely [14,19,20]. Concerning species characteristics, species that are able to increase population size quickly, compete effectively, adapt to novel environments, and spread widely are more likely to spread widely [21,22].

Evaluating the relative contribution of the above factors to range filling is crucial for invasion management and the approach best suited to predicting spread. For example, if low range filling is predominantly driven by invasion time lags and dispersal limitation, then already-introduced species could spread much more widely than they have to date [6]. In this case, we should focus research on spatial population dynamics. If species’ naturalised ranges are limited by habitat fragmentation [23], then we can categorise risk between landscapes and manage landscapes appropriately.

We undertake the first global assessment of the degree to which introduced birds, mammals, and plants, ranging from small annual herbs to long-lived large trees and from tiny herbivores to apex predators, have occupied climatically suitable areas in their introduced ranges. Our study is the first, to our knowledge, to incorporate climatic suitability when investigating invasion lags, and accounting for this major determinant of species’ naturalised ranges frees us to examine other drivers of spread robustly. We ask whether introduction history, species traits associated with establishment and spread, and the spatial distribution of suitable areas have hastened or slowed the spread of species. We identify geographic regions with striking invasion deficits, where many already-naturalised species could spread much more widely. We examine whether our evidence suggests that these species are indeed likely to continue to spread.

Results

Summary of introduced species and range filling

Every continent on Earth (except Antarctica) hosts at least 1 native and naturalised species from our dataset (S1 Fig). In their native ranges, birds occupied a median of 397 grid-cells, mammals 240, and plants 471 cells. Using range polygons, birds had a median native range size of 1.8 million km2, mammals 1.7 million km2, and plants 2.0 million km2. In all 3 taxonomic groups, naturalised range sizes were smaller: median values are 112,000 km2 for birds, 283,000 km2 for mammals, and 441,000 km2 for plants (Table 1).

thumbnail
Table 1. Correlates of range filling for each taxonomic group and model verification.

https://doi.org/10.1371/journal.pbio.3002361.t001

The proportion of climatically suitable areas successfully occupied by introduced species was in general low, but highly variable across species (S1 Table). Mammals occupied a median of 4% of available range, birds 1%, and plants 5%.

The areas to which the greatest number of already-naturalised birds could spread are in Mexico, the south-eastern and the south-western USA. Eastern Europe could face the spread of the greatest number of naturalised mammals. The greatest number of naturalised plant species could spread to the southern and eastern USA (and small pockets in north-western USA), Mexico, Eastern Europe, and southern, central, and western Australia (Fig 1). The geographic results come with the caveat that risk estimates can be affected by recording bias, the role of which we explore further below.

thumbnail
Fig 1. Areas where regionally naturalised species could expand further.

The numbers of (a) plants, (b) birds, and (c) mammals that could spread to each 10-min grid-cell based on the cell’s climatic suitability for each species. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905. Country and continent outlines were produced by the International Working Group on Taxonomic Databases for Plant Sciences (TDWG), specifically the WGSRPD Level 4 boundaries; data and usage notes can be found at (https://github.com/tdwg/wgsrpd).

https://doi.org/10.1371/journal.pbio.3002361.g001

Introduction history, species characteristics, and climate connectivity affecting range filling

Models successfully converged after 20,000 iterations. The proportion of range filling was highly variable across realms (Table 1 and S1 Table), and range filling was particularly high in Australia for plants and mammals, though not for birds (Table 1). For plants, time since introduction significantly increased range filling globally, age at first flowering significantly decreased range filling in Australia and the Nearctic, recording effort significantly decreased range filling in Australia and the Nearctic (Table 1 and Figs 2 and S2). For birds, contagion of suitable climate and natal dispersal distance significantly increased range filling globally, and the number of years since introduction significantly increased range filling in the Nearctic (Table 1 and Figs 2 and S3). For mammals, contagion of suitable climate significantly increased range filling globally. Natal dispersal distance was retained in the final model, even after several rounds of model simplification (i.e., removing it reduced model fit), though it did not have a statistically “significant” effect globally or in any realm (Table 1 and Figs 2 and S4).

thumbnail
Fig 2. Global parameter estimates of variables retained in the final Bayesian hierarchical model to correlate with species range filling for (a) plants, (b) birds, and (c) mammals.

A solid line signifies the estimate was consistently above or below 0 in >95% of simulations (and therefore judged as significant), a dashed line means it was not. The lighter shaded area shows the 95% probability density interval for the parameter estimate, and the darker shows the 50% interval. For regional effects, see Supporting information (S2S4 Figs). The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905.

https://doi.org/10.1371/journal.pbio.3002361.g002

Validation and sensitivity analyses

We identified areas where low recording effort could affect the number of species present or with potential to spread (S5 and S6 Figs).

We estimated the accuracy of climate matching predictions using the true skill statistic (TSS) scores generated using cross-validation. The median TSS score across all species was 0.62 (standard deviation = 0.21), which is considered to be “substantial” performance [24]. Species’ native presences in validation data were predicted very accurately; the median proportion of presences predicted to be in suitable climate was 0.99 (standard deviation = 0.06) and only 2 species had a prediction rate of under 0.6. Thus, it is unlikely that we underpredicted species’ potential naturalised ranges, based on the native range data available.

Analyses of range filling seem unlikely to have been affected by inclusion of species that have been recorded as naturalised in few grid-cells. Species range filling did increase when species have naturalised in more grid-cells, but the association was very weak for species that have naturalised in few grid-cells (S7 Fig). In a separate analysis, we did find that species niche filling increased with the number of grid-cells in which a species had naturalised, but with a very high degree of variability (S8 Fig). Therefore, to be sure that the number of grid-cells occupied did not affect estimates of what affects range filling, we re-ran the final model of factors affecting range filling for plants that had naturalised in >20 grid-cells. This analysis produced similar parameter estimates to the final model for species that had naturalised in >5 grid-cells (S2 Table), so the threshold of 5 was kept for all analyses.

When restricting potential ranges to a smaller portion of the native climate niche, geographic patterns of threat remain broadly similar (S9 Fig). However, the potential for birds to spread in North Africa and the Arabian Peninsula, and the potential for mammals to spread in Eastern Europe, does decline relative to other regions. The absolute number of plants that could spread within grid-cells in the Nearctic, Eastern Europe and Australasia, birds in the Nearctic, and mammals in Eastern Europe, declines by 40% to 50%.

Estimates for parameters retained in the final model are given as the mean estimate of all posterior draws, with the 2.5% and 97.5% estimates as confidence intervals in parentheses. Parameter estimates are given as the linear slope of the logit link equation. When parameter estimates vary across realms, this is indicated by providing the names of the realms in which it varies (Aus = Australian, Nea = Nearctic, Neo = Neotropical). Model verification data are given for the final models, including sample size, deviance information criterion (DIC) of the model, the effective number of parameters (pD), and correlation of the linear predictor against the link transformed response given as a pseudo R-squared. Note that a negative effect of recording effort means that more recorder effort in the potential naturalised range corresponded to lower range filling. See also S13 Table.

Discussion

The world is in no way saturated with naturalised species, even if no new species become naturalised in the future. Nearly all species we studied have yet to expand throughout most of the areas that are climatically suitable for them, within the biogeographic regions where they have naturalised. This is despite substantial time to invade: 25% have been established for over 150 years. The potential spread of naturalised species appears greatest in regions that are already heavily invaded, i.e., North America, Australia, and Europe (Fig 1). However, already-naturalised species also threaten areas of the world thought of as less invaded (Fig 1). In South America and Southern Africa around 200 regionally naturalised plants have the potential to spread widely—a number comparable to that in Australia and China, which have historically borne the brunt of biological invasions [2527]. Nonetheless, the potential for regionally naturalised species to spread in sub-Saharan Africa and the north of South America (Fig 1) is lower than one would expect from the globally high numbers of species naturalised there [1]. This is potentially encouraging given the recent increases in regional trade and transportation infrastructure in these regions [28] that could increasingly facilitate the spread of naturalised species. The potential for regionally naturalised species to spread in the eastern USA (particularly birds) and Mexico (particularly birds and plants) is higher than one would expect from the relative numbers of species naturalised there (Fig 1) [1]. Differences might be influenced by the broader taxonomic range measured in Dawson and colleagues [1] than that for which we could conduct climate matching. Nonetheless, the contrast illustrates how invasion threat cannot be characterised solely by the number of species naturalised.

The global potential for spread is high because range filling was consistently low (S1 Table). Why are such large areas as-yet unreached by species that have naturalised elsewhere in the biogeographic realm, and what does this suggest for the risk of spread?

Of the 3 taxonomic groups, time since introduction seemed to limit plant range filling most strongly. We suggest this is because plants generally have low dispersal [29] and are therefore most prone to time lags. However, dispersal ability itself did not significantly limit plants, potentially because long-distance dispersal events are so rare that they have little effect on range filling over the decades or centuries since introduction. Plant spread is often strongly dependent on human-assisted dispersal [19, 20]. Therefore, time since introduction may increase range filling by allowing for greater human transportation rather than natural dispersal. We note that horticulture does not seem to be the predominant human transportation mechanism here, as species associated with horticulture did not have significantly higher range filling than non-horticultural species. The potential importance of human transportation suggests that the threat in Fig 1a could be averted. For example, hundreds of plant species are naturalised in California (S1 Fig) and separated from climatically suitable areas in the eastern USA by mountains and desert. Only a few naturalised species have been transported between western and eastern USA [30], so controls on transport could be effective. Plants were the only taxon in which a non-dispersal trait affected range filling: flowering at an earlier age increased range filling, suggesting that a quick reproductive cycle assists range spread more so than natural dispersal over the decades or centuries since introduction.

Like plants, bird range filling was affected by time since introduction, but dispersal ability increased and contagion of climatically suitable areas decreased range filling. For mammals, while time since introduction limited range filling slightly when analysed alone, it was not retained in the final model. As with birds, contagion decreased mammal range filling, though there was no effect in the Nearctic, the realm where the potential for mammal spread was highest (Figs 1, 2, and S4).

Dispersal affected mammal range filling, but in the opposite direction to that for birds: strong mammalian dispersers tended to fill less of their potential naturalised ranges than weak dispersers. Mammal dispersal ability is highly positively correlated with body size [31], which typically correlates with having a larger home range size, requiring larger amounts of habitat [32]. In a preliminary analysis, both body mass and dispersal ability were trialled, but the high co-variance meant both could not be retained and of the 2, dispersal ability was retained as it was a better predictor. Therefore, difficulty establishing robust populations may hinder the spread of large-bodied species, masking any positive effect of dispersal ability. This interpretation is supported by the negative effect of contagion on mammal range filling, as fragmented habitat could slow both population establishment and natural dispersal. The contrasting results for dispersal and contagion between plants and the better dispersing mammals and birds suggest that fragmentation of suitable areas is more important when natural dispersal contributes heavily to spread.

We found no effect of population growth, generalism, or behavioural flexibility traits for mammals and birds, despite all having been shown to correspond to invasion success elsewhere [16, 17, 33]. It seems that for the relatively long-dispersing birds and mammals, once the availability of suitable climate is accounted for, interactions with native species and population growth rate are less important than simply being able to arrive in a new location.

We note that despite having the longest dispersal abilities in our dataset, birds displayed the lowest range filling. This may be because bird species had large native niches that cover a wide range of climates, a trend also found in multi-taxonomic comparisons of thermal niche width [34], and therefore have larger potential ranges than other taxa (particularly in the Afrotropical and Neotropical realms, S1 Table).

Both mammals and plants had strong differences in range filling between realms, with the highest level of range filling in Australia (S1 Table). This may be because of strong human influences on naturalised species ranges in Australia [35], where species have been introduced at or near multiple coastal cities throughout the areas climatically suitable for many of the species. Human-assisted dispersal from several population centres could help species spread, particularly plants [36]. Additionally, the dissimilar ecological assemblage between Australia and most other realms in the world could cause low biotic resistance [37]. In this case, a lack of ecologically similar species may reduce competition and predation and promote population growth and spread in the introduced species [3840].

Although pseudo-R2 values are convincing (Table 1), there remains unexplained variation in range filling. Of particular interest is the 26 birds introduced to Florida, of which 10 have not spread into any nearby climatically suitable areas in south-eastern USA, despite extensive transportation infrastructure and lack of apparent barriers (Fig 1). This is particularly surprising given high dispersal ability of birds. Some of these bird species may be restricted to their introduction locations because they are not truly self-sustaining and may be instead be supplemented by ongoing accidental releases [41]. Nevertheless, the large number of bird species that have not spread beyond Florida is still surprising. Potentially important factors we did not investigate directly are propagule pressure [14, 16, 17], repeated introductions that boost population size and genetic heterogeneity [16, 17], and the climatic suitability of initial sites of introduction [18]. In addition, introduced individuals often originate in a single part of the species’ native range. These individuals may have narrower climate tolerances than those of the species as a whole [42], and may only ever be able to fill a small part of the potential range calculated using the entire native range. If such local adaptation is behind the widespread lack of range unfilling, climate matching could be highly prone to overpredict potential ranges. While birds introduced into Florida show the most striking example of unexplained range filling, the above arguments could apply to other taxa and locations, and thus climate-matching should be treated with caution.

Variation in recorder effort [43] could affect our results in 3 ways. First, in understudied regions we may under-record species’ naturalised ranges (S5B, S5D, and S5E Fig) and thus overestimate the area to which they have yet to spread. However, the only effect of local recording effort we saw was the opposite: low effort corresponded with a slight increase range filling for plants (Table 1), an effect that was only significant in the Australian realm.

Second, the numbers of naturalised species may be underrepresented in under-recorded regions. It is likely our dataset overrepresents the number of naturalised species in the Nearctic, Palearctic, and Australasia (S10 Fig). Analyses exploring the possible effect of recording effort (S6 Fig) suggests that with less biased estimation of species numbers, eastern USA might no longer stand out as a global hotspot for the spread of naturalised birds (though the threat would remain moderate) and the threat in Mexico might become relatively low. The threat of bird spread in North Africa, the Arabian Peninsula, western Russia, south and south-eastern Asia, which is moderate based on raw species numbers (Fig 1), could have been substantially underestimated, and potentially be greater than the threat in eastern USA. Underestimation of naturalised species numbers could have obscured a large area of threat in China and pockets in Brazil. Underestimation would not alter the high threat from mammals and plants in parts of Eastern Europe and western Russia, but could have over-emphasised the threat in Western Europe relative to less well-recorded regions. One reason for geographic bias in species numbers is that we discarded species for which very little data were available. Many of these may be species that have narrow native ranges and climate niches, and have not spread widely once introduced, so pose relatively low threat. Therefore, adjusting for recorder effort does not necessarily represent threat more accurately than using uncorrected data. It is entirely plausible that the species in our dataset have greater potential naturalised ranges and ability to spread than those we excluded.

Third, if the native geographic ranges of naturalised species are under-recorded (S5A, S5C, and S5E Fig), we might underestimate species’ potential naturalised range size. This is unlikely to be the case for mammals and plants, since the native ranges of most species studied include the well-studied Western Europe and North America (S1A, S1C, and S1E Fig). However, many of the naturalised birds we studied are native to south Asia, South America, and sub-Saharan Africa, which have relatively low recording effort, so estimates of bird potential spread may be more conservative than those for mammals or plants. Moreover, there is now considerable evidence that many naturalised species undergo niche expansion, spreading to areas outside their native climate niche [9, 4446]. Therefore, our predictions of all naturalised species’ potential ranges could be conservative. We note there is no evidence of geographic trends in niche expansion, so this should not affect relative regional threat [9].

Given the potentially important effects of recorder effort, we suggest all large-scale threat assessments should explicitly explore the consequences.

A notable artefact in our results is the sharp division in invasion potential along the realm division between the east and west Palearctic, along the line of the Urals. This is a result of our realm definitions, as we did not consider species naturalised in one realm able to spread to the adjacent realm. Without this division, species native to East Asia would be considered native to Western Europe, and vice versa, when in reality many species have been introduced between the 2 regions by humans over recent centuries. Consequently, the number of species that could potentially spread from east to west Palearctic is likely underestimated, and the invasion deficit just east of the Urals may be higher than depicted in Fig 1.

Although our lists of naturalised species are not identical to the most up to date datasets of naturalised species [4749] for birds and mammals this is unlikely to affect our results (S1 Text). Our initial list of plants, however, contained <20% of the naturalised plants recorded by [48], though the geographic pattern of naturalisations within realms, is broadly similar to that in [48] (S1 Text and S10 Fig). However, comparisons of the invasion deficit between plants and other taxa should not be made.

The importance of time since introduction, dispersal ability, and landscape connectivity suggest that with time, plants, birds, and mammals may overcome barriers and spread more widely within the hotspots we identify. For plants in particular, the time lags observed in potential spread hotspots in N. America and Australia (Figs 1 and S2) suggest that more recent introductions are highly likely to spread, though the same is not necessarily true in the European hotspot where the time lag was not significant. For many mammals, their high potential for spread in Eastern Europe and Australia (and potentially North America, though this effect was not significant) seems to have been slowed by fragmentation of suitable climate, which may be overcome with time. For birds, it is strange that more species have not spread widely in their primary hotspot of potential spread in North America, and the factors we investigated failed to explain why. In the Nearctic realm, bird introduction dates are somewhat older, the landscapes they encounter somewhat less fragmented, and their range of dispersal ability similar to other realms (S4 Fig). Further predictors of range filling will be required to help know whether the threat in this hotspot will be realised.

Given that the majority of introduced species have low impact [50], how should we interpret the threat in hotspots of potential spread? First, even if the effect of individual species is small, “invasional meltdown” is common when multiple introduced species co-occur and amplify each other’s establishment and impact [51], and can be devastating [52, 53]. Although hard to predict, the more co-occurring introduced species in an area, the higher the likelihood of invasional meltdown. Second, species may not become problematic until after an initial period of lag, spread, and population growth [6]. Invasion lags are clearly present in our study species, so even species that are currently unobtrusive could become problematic, and this is clearly more likely in hotspots of potential spread. Finally, the large invasion deficits suggest that climate-matching does not, by itself, predict invasive spread in the short term, and more attention must be paid to other factors that determine species’ ranges.

Materials and methods

Collating naturalised species data and distributions

We identified birds, plants, and mammals that have established outside their native range following introduction by people. We included only species confirmed to have established and reproduced since 1770 on a mainland landmass outside their native continent, whose naturalised ranges do not rely exclusively on human activities such as irrigation or continued re-introduction, and for which we could clearly delineate their native range. For a full species list, see the Supporting information (S3S5 Tables).

Plant species were drawn from those not listed as “Casual Alien,” “unconfirmed naturalisation,” “Contaminant,” or “Native Weed” in Randall [54] and from the Global Invasive Species Information Network [55].

Bird species were compiled from known successful introduction events [41,56,57]. All migratory birds were removed from the list, due to difficulty defining a species’ range and climatic niche. Migratory status was confirmed using Handbook of the Birds of the World [58].

Mammal species were compiled from Capellini and colleagues [59]. A search was made for additional mammal species from various sources, but in final analyses none of these species were included due to either a lack of data on the known naturalised range or because long-term establishment could not be confirmed.

For all species, we obtained occurrence data from GBIF (downloaded 31 August 2017) using R’s dismo package [60]. For a record of all data sources, see the derived dataset at https://doi.org/doi.org/10.15468/dd.2zen56 [61]. Point data was then cleaned in several stages, firstly by removing points off-shore, then by removing points with low lat/lon precision (less than 10 arc-minute resolution), and then by removing points the exact centre of countries or states. Points were classified as either “native” or “naturalised” based upon sources listed in the Supporting information (S6 Table). Points that could not be validated using an independent state or national checklist as “native” or “naturalised” were discarded. Species that occupied fewer than 5 grid-cells after cleaning (at 10 arc-minute resolution) in their native or naturalised range were discarded. We mapped the potential and under-filled ranges for 65 mammal species, 114 bird species, and 616 plant species.

Although the process of filtering species to meet our criteria substantially reduced our species list, our initial sources of naturalised species are large and comprehensive. Therefore, we believe that our species lists are taxonomically broad, ecologically varied, and thus likely representative of the taxa introduced to each region. The 616 plant species in our list contain 391 genera and 116 families, the 114 bird species contain 28 families and 68 genera, and the 65 mammal species contain 50 genera and 24 families. In addition, while there is data bias towards some areas, notably Europe, North America, and Australia, there are examples of species in our database that are native to and naturalised in every major landmass on Earth except for Antarctica (S1 Fig).

We restricted predictions of each species’ potential naturalised ranges to the biogeographic realm/s into which that species were introduced. We used a published set of biogeographic realms [62], but with an additional distinction between western and eastern Palearctic along an approximate line of the Ural Mountains (S11 Fig). This was done because species that inhabited both western and eastern Palearctic were almost always native to one and naturalised in the other. The realms were created using multiple taxonomic groups, including birds, mammals, and plants, though the boundaries were similar to those created for each individual taxonomic group. For inter-taxonomic consistency, we used the same realms for every taxonomic group. Species that were found to be native and naturalised in the same biogeographic realm were removed from analysis due to the difficulty in exactly defining the native and naturalised ranges. If a species naturalised in multiple realms, each of the naturalised realms was examined separately.

Modelling species potential naturalised ranges

We modelled potential ranges using 3 climate variables: mean temperature of coldest month, mean temperature of warmest month, and total annual precipitation. These represent the most universal parsimonious variables that influence species ranges [44]. Including a larger number of variables results in forecasts of smaller potential ranges, and less transferability than the parsimonious set of variables [44]. Gridded climate data were downloaded from WorldClim at 10 arc-minute resolution. Each grid-cell contained average climatic variables from 1970 to 2000 [63].

For each species, we extracted the climate conditions in the entire biogeographic realms to which they were native or naturalised. Climate variables were scaled so all variables had a mean of 0 and an SD of 1, total annual precipitation was logged prior to scaling. Any occurrences in climate conditions that were not found in both the native and naturalised biogeographic ranges (no-analogue climate) were removed from analysis. We used principal components analysis (PCA) to produce a gridded climate space of 100 × 100 cells on 2 axes [64]. Within this climate space, we applied a kernel smoothed density function to GBIF records in order to calculate species’ occurrence densities in the climate conditions contained in each PCA grid-cell, which was then corrected by climate availability. A bivariate normal kernel was used, where the smoothing parameter was estimated using the ad hoc method, using the kernelUD function from the adehabitatHR package [65].

In order to measure each species’ potential naturalised range, we identified the PCA grid-cells that contain climate the species occupies in its native realm, i.e., the grid-cells in which native density was above 0. Note that the kernel density function creates some infinitesimally small densities, so PCA grid-cells with densities less than one thousandth of the value of the grid-cell with the highest occurrence density were considered to have a value of 0. We then identified the geographic grid-cells in the naturalised realm to which these PCA grid-cells corresponded. We restricted predictions of species’ potential naturalised ranges to the climate conditions present in both a species’ native and naturalised realms (i.e., “analogue climate”).

To measure the area of each species’ potential naturalised range to which it has yet to spread, we constructed a naturalised range polygon for each species using its GBIF occurrence data and level 4 geographic administrative units from the TDWG scheme [66]. Within each administrative unit, we calculated the occupied area using a convex hull polygon around each species’ occurrence data. We then aggregated all polygons for each species in each biogeographic realm. Any part of the species’ potential naturalised range that was not contained within these range polygons was classed as unoccupied. Range filling for each species in each naturalised realm was calculated as the proportion of the climatically suitable area that was filled by the naturalised range polygons.

Defining the climate niche using all PCA grid-cells the species occupies in its native realm could include outlying distribution points in climate conditions a species can poorly tolerate, inflating the species’ climate niche. As a sensitivity test, we recalculated potential ranges using only climate that fell with the 70% most densely occupied climate in the native realm.

The approach we used has 2 principal advantages over more standard species distribution model (SDM) approaches. First, applying kernel smoothers to a PCA of native and naturalised regions accounts for climate availability, which has been shown to have major effects on estimates of species’ potential naturalised ranges [64]. Second, the approach is akin to a “presence-only” SDM, which is more appropriate than a presence-absence SDM when modelling hundreds of species in multiple regions with highly varying recorder effort. If we were to have selected pseudo-absences to make presence-absence SDMs, in under-recorded areas many of these would have been false-absences, which would cause the SDM to underestimate species’ potential naturalised ranges [67]. Given the potential effects of recorder effort on our results (S5 and S6 Figs), the degree of underestimation would vary strongly between species and regions, and would have been much more severe than with a presence-only approach.

Validation of climate matching

We used cross-validation to evaluate how robust our climate matching method was, in particular whether we could have underestimated species’ native climate niches. To this end, we randomly split the native occurrence data into a training set (80% of the data) and a validation set (the remaining 20%). This was repeated 5 times, each with a different 20% of the native data. For each validation dataset, we generated 20 sets of pseudo-absences, each with the same number of pseudo-absences as native occurrences. Pseudo-absences were randomly selected from any grid-cell in the native realm outside of administrative units known to have native occurrences. We then ran the PCA model on the occurrences in each training dataset as described above and calculated the true presence accuracy and TSS [68] against the corresponding validation presence and pseudo-absence datasets. True presence accuracy describes what proportion of the true presences in the validation dataset were correctly predicted as a presence, a value of 1 indicates prediction was 100% accurate and therefore that the estimated native climate niche is extremely robust. TSS is a measure of sensitivity and specificity and requires both presences and absences. TSS scores based on presence-only models are quite sensitive to the method of selecting pseudo-absences [69], but repeated sampling from the background data mitigates this effect. However, for native species with small ranges that are mostly likely not at climatic equilibrium, true presence accuracy is a better sensitivity measure.

Sensitivity of invasion deficit estimates to recorder effort

We obtained published measures of recorder effort for plants [43, 70] and for birds and mammals, which compare the species that have GBIF records in a grid-cell with the species that are known to be in the region from surveys and expert knowledge. Meyer and colleagues [43] used these data to estimate the probability of detecting all known species in a given area. A value of zero indicates no recording effort and no species known to live there are detected, and a value of one indicates recording effort is sufficiently high that all species present will be recorded. Each 10-min grid-cell was assigned a detection probability by resampling from Meyer and colleagues [43] using a nearest neighbour method, which was necessary due to the different spatial resolutions of the 2 datasets. To calculate how numbers of native or naturalised species and potential for spread in each grid-cell might be altered once recording effort was compensated for, we multiplied the grid-cell value by the reciprocal of the detection probability. Detection probability was given a floor of 0.01% as otherwise the relative number of species once recording effort was accounted for could be hyper-inflated to unrealistic levels. It should be noted that Meyer and colleagues’ estimates were based on the recording effort of native species, not naturalised, but in the absence of quantitative data on the recording effort of naturalised species we thought it a sound assumption that the 2 would be closely correlated in most areas of the world. The corrected results do not necessarily represent a true prediction of the number of species, but highlight where uncertainty is highest and where large numbers of naturalised species are recorded despite poor recording effort. To our knowledge, this is the first time that a formal assessment of recorder effort has been applied to a global assessment of any aspect of biological invasions.

Introduction history, species characteristics, and climate connectivity affecting range filling

Introduction history.

The year of introduction is difficult to ascertain for most plants so the first confirmed record of occurrence in a realm was taken as the date of introduction, obtained from [71], the Atlas of Living Australia (ALA), Seebens and colleagues [72] and additional regional sources (see S7 Table). The year of introduction for birds was estimated using the first confirmed record from GAVIA [41] and Seebens and colleagues [71]. The year of introduction for mammals was estimated using the first confirmed record from Long [73] and Seebens and colleagues [71].

Previous studies have linked introduced species success to horticultural status and thus propagule pressure [74]. Therefore, whether a species was used in horticulture or not was extracted from Dave’s Garden PlantFiles (http://davesgarden.com/guides/pf/, accessed 25 May 2018) and from the Plant Information Online database (https://plantinfo.umn.edu/, accessed 25 May 2018).

We investigated whether range filling depended on biogeographic realm of naturalisation, which could indicate anthropogenic factors influencing spread or biotic resistance [23,75,76].

Species traits.

Rapid population growth likely increases range filling by allowing new populations to establish quickly and produce large numbers of propagules [21]. Traits associated with rapid population growth include time till sexual maturity [77] and seed/clutch size [21,59]. The age of a plant at first flowering and the seed number per flowering event were extracted from TRY [78]. For plants, we also investigated growth form obtained from TRY [78] and USDA Plants database. Plants were defined as either herbs, climbers, trees, shrubs, or ferns. For mammals, litter size, time till sexual maturity, and interbirth interval were taken from PanTHERIA [32]. For birds, clutch size and number of clutches per year were obtained from Myhrvold and colleagues [79].

Good competitive ability and adaptive capacity to novel environments could speed range expansion by allowing naturalised species to preempt resources from native species and invade novel niches. Corresponding traits include habitat generalism [16,17,21] and relative brain mass as an indicator of behavioural flexibility [16,80]. For both mammals and birds, habitat generalism was obtained for all species using the IUCN Habitat Classification Scheme (IUCN, 2023 [81]; accessed Nov 2018) and quantified as the number of general habitats as an integer and the number of sub-habitats as a decimal [21]. Information on brain residual size for mammals were taken from Sol and colleagues [80]. Data for brain size for birds was explored, but was not included as the data available would have resulted in a very limited sample size.

Dispersal ability has been linked to introduced species success, likely because it permits species to spread widely [22,36,77,82]. For many plant species, mean and maximum dispersal distance is frequently unknown, but dispersal distance can be estimated using a number of proxy life history traits [83]. Dispersal was estimated as a ranked category from 1 to 7, which correspond to increasing maximum dispersal distance on an approximately logarithmic scale [83]. Estimated dispersal distance varies depending on the species’ dispersal mode, plant height, habitat type, and taxonomic group [83]. We obtained the life history traits for calculating dispersal from the TRY database [78]. For birds, natal dispersal distance was estimated using diet, body mass, and wingspan [84]. Body mass and diet data were gathered from the EltonTraits database [85] and bird wingspan from del Hoyo and colleagues [58]. If only bird wing-length was available, bird wing-length was extrapolated to wingspan using the method in Garrard and colleagues [84]. For mammals, natal dispersal distance was estimated using body mass, home range size, and trophic level (for full method, see [86] from PanTHERIA [32]).

For all species, invasive status was determined by their description on GISIN (2015). Body mass was not used in the analysis as it covaried strongly with the estimated natal dispersal distance (Birds: Pearson’s correlation R-squared = 0.54; Mammals: R-squared = 0.93).

Spatial distribution of suitable areas.

We measured the fragmentation of climatically suitable areas in 2 different ways, the “contagion” of a landscape and the “clumpiness.” Both were calculated in the FRAGSTATS program [87]. These 2 metrics measure different aspects of fragmentation and have different consequences for interpretation [88].

Contagion describes how dominant and aggregated climatically suitable grid-cells are over a landscape and how interspersed it is with non-suitable grid-cells [87]. Values for contagion range from 0 (climatically suitable grid-cells are completely fragmented and rare) to 100 (climatically suitable grid-cells are completely dominant across the landscape). Contagion is a good measure of fragmentation when comparing within similar areas, such as in the same geographic realm. However, contagion typically correlates with total available area, and as a result it is not a good measure of fragmentation when comparing across geographic realms of different sizes.

Clumpiness describes how spatially aggregated suitable grid-cells are over a landscape, after accounting for the overall abundance of suitable grid-cells [87]. Values for clumpiness range between −1 (maximally disaggregated), 0 (spatially randomly distributed), and 1 (maximally aggregated). Clumpiness considers only the number of cell-adjacencies as a proportion of the total number of cells, so does not typically correlate with total available area, and therefore can be used to compare fragmentation between realms [89].

Local recording effort.

Ranges, and thus range filling, may be underestimated in areas of low recording effort. To investigate this, we used taxonomic coverage from the global sampling bias map of Meyer and colleagues [43]. For each species and realm, we calculated “local recording effort” as median taxonomic sampling coverage across all unoccupied grid-cells in the potential naturalised range.

For a summary list of all variables examined for each taxonomic group, see S8 Table.

Statistical analysis of characteristics corresponding to range filling

We conducted analyses for species for which we could obtain the necessary data: 242 plant species (484 establishment events), 35 bird species (50 establishment events), and 22 mammal species (46 establishment events).

For each taxonomic group, the relationships between species’ naturalised range filling and potential determinants were investigated using a hierarchical Bayesian model based on a beta distribution with a logit link. Realm, invasive status, growth form, horticultural status were treated as categorical, all other parameters were continuous. Body mass, home range size, and height were logged to improve linearity. All predictive parameters were centred on their mean and scaled by their standard deviation. We chose to use weakly informative priors throughout which favoured parameter estimates at or near zero (i.e., the null hypothesis that there is no relationship between predictor and response), but did not constrain the models from selecting nonzero estimates [90]. For categorical and hierarchical effects, we used a weakly informative half-Cauchy distribution for the standard deviation among categorical levels. This reflects the null hypothesis that there is no difference between levels and biases the model towards conservative parameter estimates at or near 0, and therefore avoids overestimating the size of categorical and hierarchical effects. An example model with 1 continuous variable and 1 hierarchical effect is included in S9 Table.

Models were run using a Markov chain Monte Carlo (MCMC) method in JAGS through the R package “R2jags” with a burn-in of 10,000 samples and checked for convergence after a further 20,000 samples, which was extended if estimates did not converge. Models were evaluated using the DIC, widely applicable information criterion (WAIC), leave-one-out (LOO) evaluation [91], and Pearson’s residual fit [92]. A pseudo-R2 for each model was also calculated as a squared sample correlation between the mean linear predictors and the link-transformed response.

Due to the large number of parameters, we carried out an initial screening process to identify variables of interest. We first investigated the relationship between range filling in each taxonomic group and each predictive variable individually and ran univariate models with each variable in turn, with naturalised biogeographic realm included as a hierarchical effect. Posteriors were checked for a single unimodal peak, and predictive variables whose posterior estimate centred near 0 (meaning the value 0 lay between the 5th or 95th percentile parameter estimates) were not analysed further. We also tested for realm-specific effects, by identifying regions where parameter estimates did not overlap with other regions’ (at 90% CI), and if variables showed strong trends in some realms but not others, they were retained in the final model.

In some cases, not all parameters identified by the univariate model process could be included in a multivariate simultaneously, due to strong co-variation in certain predictors. In these cases, we ran alternative multivariate models with either one of the predictors and compared results. In addition, models for birds and mammals with a full set of parameters (even when co-varying predictors were removed) frequently had issues with consistent convergence. If a model with a full set of parameters had issues with convergence or LOO, we trialled dropping parameters. In these cases, we would drop the least important variable in the full model to assist convergence, where the least important variable was defined as that with a mean parameter estimate nearest 0. If the resulting model still would not converge consistently, then another variable would be dropped. All of the variables dropped had mean parameter estimates at or near 0 (25% CI estimates all crossed 0), and no variable with obvious influence on the model was dropped in order to allow convergence. This process resulted in models that consistently produced stable, converged parameter estimates. Finally, we trialled several plausible, more parsimonious models by dropping less important variables to test if model fit could be further improved. All variables whose 95% CI estimates crossed 0 were trialled as candidates for dropping. The model was considered improved if there was a significant improvement in model fit (ΔDIC >5).

All variable combinations tested are included in S10S12 Tables. We considered the model with the lowest DIC value, and models with ΔDIC ≤5 above this to be equally plausible, excluding any models with convergence issues. For mammals and birds this identified a single, best model. In the case of plants, 2 models were equally plausible, and their parameter estimates were extremely similar (see S13 Table). In the main paper, we report the most parsimonious of the 2 models, which excluded variables which had very little influence on the model and whose parameter estimates all strongly centred on 0.

In the multivariate models, a parameter was classed as having a significant effect when mean, 2.5th and 97.5th percentile estimates fell above or below 0. We confirmed models were not mis-specified by checking that posterior parameter estimates were normal, as well as residual and LOO evaluation.

Sensitivity analysis for low number of occurrences and outlying climate

Analysis of species range filling is potentially sensitive to bias when including species with extremely low record numbers. We set a threshold of a minimum of 5 naturalised grid-cells, which allows rarer species and species with small total potential ranges to be included in the analysis. However, species with very few records may reflect a lack of detection rather than a failure to fill their range, and therefore, species niches and ranges may not be accurately characterised. We accounted for this by including 2 forms of sensitivity analysis.

First, we tested for a correlation between range filling and the number of occupied naturalised grid-cells using a Bayesian mixed model (S9 Table). The number of grid-cells was logged for normality and treated as a continuous parameter, and region was also included as a hierarchical effect. If species with very low numbers of grid-cells show drastically different patterns in range filling to other species, then it suggests that the overall model is very sensitive to species with very small ranges, and a higher threshold should be considered.

Second, we re-performed the final model of range filling in plants, but with a higher cutoff threshold of 20 grid-cells. If the model is robust to species with very low numbers of grid-cells, the model output should return similar parameter estimates. Unfortunately, a similar analysis could not be completed for birds or mammals as the smaller sample size associated with a higher threshold prevented model convergence.

Code used to perform all analyses, derived data, and data necessary to replicate analyses are available at https://doi.org/10.5281/zenodo.8205905. Shapefiles of each species estimated native and naturalised ranges, rasters of niche filling and expansion, diagnostic and validation statistics are too large to place in a repository and so are available upon request from the authors.

Supporting information

S1 Table. Summary statistics of introduced species and range filling for all taxonomic groups across all biogeographic realms.

“Median range size” is the median range size (calculated with minimum convex hull polygon) of all established naturalised species (measured in 1,000 km2). “Median potential range size” is the median area each species has available but has not colonised in each realm (measured in 1,000 km2). For each species, the total occupied area is divided by the total available area to return a range filling proportion. “Median proportion of filling” is the median range filling across all species in the given realm.

https://doi.org/10.1371/journal.pbio.3002361.s002

(DOCX)

S2 Table. Sensitivity analysis of correlates of range filling for plants.

This table presents results for the same model as in Table 1, but when only plants that have naturalised in >20 grid-cells were included. Estimates for parameters retained in the final model are given as the mean estimate of all posterior draws, with the 5% and 95% estimates as confidence intervals in parentheses. Parameter estimates are given as the linear slope of the logit link equation. When parameter estimates vary across realms, this is indicated by providing the names of the realms in which it varies (Aus = Australian, Nea = Nearctic, Neo = Neotropical). Model verification data are given for the final models, including sample size, DIC of the model, the effective number of parameters (pD), and correlation of the linear predictor against the link transformed response given as a pseudo R-squared.

https://doi.org/10.1371/journal.pbio.3002361.s003

(DOCX)

S3 Table. List of all plant species used in this study.

https://doi.org/10.1371/journal.pbio.3002361.s004

(DOCX)

S4 Table. List of all bird species used in this study.

https://doi.org/10.1371/journal.pbio.3002361.s005

(DOCX)

S5 Table. List of all mammal species used in this study.

https://doi.org/10.1371/journal.pbio.3002361.s006

(DOCX)

S6 Table. List of sources used to classify populations as native or naturalised.

Unless otherwise stated, plant sources were accessed June 2017, bird and mammal sources were accessed September 2017.

https://doi.org/10.1371/journal.pbio.3002361.s007

(DOCX)

S7 Table. List of sources that provided either first introduction date or first observed record of introduced species.

Taxa indicate which taxonomic group the source covered. All sources accessed March 2018.

https://doi.org/10.1371/journal.pbio.3002361.s008

(DOCX)

S8 Table. A summary of parameters trialled for plants, birds, and mammals.

All parameters in the “all variables trialled” list were included individually in a univariate Bayesian hierarchical model with region as the hierarchical effect. Parameters in bold in this column are variables that were significant (based on 90% CIs) either globally or in some at least some realms in the univariate model, and which were trialled in the multivariate model. Parameters in the “final variable list” are the parameters retained in the model for which results are described in the main manuscript.

https://doi.org/10.1371/journal.pbio.3002361.s009

(DOCX)

S9 Table. An example Bayesian model based on a beta distribution with a logit link written in JAGs.

This model contains 1 continuous parameter (beta1) and 1 hierarchical effect (cat1). N is the total sample size, N_cat1 is the total number of levels of cat1, y is the response variable, x1 is the continuous predictor variable (scaled to have a mean of 0 and a standard deviation of 2).

https://doi.org/10.1371/journal.pbio.3002361.s010

(DOCX)

S10 Table. A summary of all Bayesian hierarchical models trialled to correlate range filling with various traits and spatial features for plants.

Models 1 and 2 incorporate all variables that were important either at a global level or in some realms in univariate models (based on 90% CIs). Models were subsequently created as viable alternatives either to assist model convergence or to trial dropping unimportant variables (i.e., posterior estimates centred at or near 0). The final model used in the main manuscript is presented in bold. Alternative models that are equally viable are in italics.

https://doi.org/10.1371/journal.pbio.3002361.s011

(DOCX)

S11 Table. A summary of all Bayesian hierarchical models trialled to correlate range filling with various traits and spatial features for birds.

Models 1 and 2 incorporate all variables that were important either at a global level or in some realms in univariate models (based on 90% CIs). Models were subsequently created as viable alternatives either to assist model convergence or to trial dropping unimportant variables. The final model used in the main manuscript is presented in bold.

https://doi.org/10.1371/journal.pbio.3002361.s012

(DOCX)

S12 Table. A summary of all Bayesian hierarchical models trialled to correlate range filling with various traits and spatial features for mammals.

Model 1 incorporates all variables that were important either at a global level or in some realms in univariate models (based on 90% CIs). Models were subsequently created as viable alternatives either to assist model convergence or to trial dropping unimportant variables. The final model used in the main manuscript is presented in bold.

https://doi.org/10.1371/journal.pbio.3002361.s013

(DOCX)

S13 Table. Correlates of range filling for the 2 best, equally plausible models for plants.

Model numbers correspond to those in S10 Table. Estimates for parameters retained in the final model are given as the mean estimate of all posterior draws, with the 5% and 95% estimates as confidence intervals in parentheses. Parameter estimates are given as the linear slope of the logit link equation. When parameter estimates vary across realms, this is indicated by providing the names of the realms in which it varies (Aus = Australian, Nea = Nearctic, Neo = Neotropical). Model verification data are given for the final models, including sample size, DIC of the model, the effective number of parameters (pD), and correlation of the linear predictor against the link transformed response given as a pseudo R-squared. Note that a negative effect of recording effort means that more recorder effort in the potential naturalised range corresponded to lower range filling. Model 2 is presented in the main manuscript.

https://doi.org/10.1371/journal.pbio.3002361.s014

(DOCX)

S1 Fig. Raw number of study species’ native ranges (a, c, e) and naturalised ranges (b, d, f) that fall in each 10-min grid-cell.

Colours represent number of species. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905. Country and continent outlines were produced by the International Working Group on Taxonomic Databases for Plant Sciences (TDWG), specifically the WGSRPD Level 4 boundaries; data and usage notes can be found at (https://github.com/tdwg/wgsrpd).

https://doi.org/10.1371/journal.pbio.3002361.s015

(PNG)

S2 Fig. Plotted parameter estimates of all variables kept in the final model to explain plant species range filling.

On the left are global trends for each parameter, on the right are the realm hierarchical effects. (a) Years since introduction, (b) age of first flowering event; (c) the estimated local sampling effort. A solid line signifies the estimate was consistently above or below 0 in >95% of simulations (and therefore judged as significant), a dashed line means it was not. The lighter shaded area shows the 95% probability density interval for the parameter estimate, and the darker shows the 50% interval. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905.

https://doi.org/10.1371/journal.pbio.3002361.s016

(PNG)

S3 Fig. Plotted parameter estimates of all variables kept in the final mode to explain bird species range filling.

On the left are global trends for each parameter, on the right are the realm hierarchical effects. (a) Natal dispersal distance (km), (b) fragmentation of suitable climate (contagion), (c) years since introduction. A solid line signifies the estimate was consistently above or below 0 in >95% of simulations (and therefore judged as significant), a dashed line means it was not. The lighter shaded area shows the 95% probability density interval for the parameter estimate, and the darker shows the 50% interval. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905.

https://doi.org/10.1371/journal.pbio.3002361.s017

(PNG)

S4 Fig. Plotted parameter estimates of all variables kept in the final model to explain mammal species range filling.

On the left are global trends for each parameter, on the right are the realm hierarchical effects. (a) Fragmentation of suitable climate (contagion), (b) dispersal distance (logged km). A solid line signifies the estimate was consistently above or below 0 in >95% of simulations (and therefore judged as significant), a dashed line means it was not. The lighter shaded area shows the 95% probability density interval for the parameter estimate, and the darker shows the 50% interval. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905.

https://doi.org/10.1371/journal.pbio.3002361.s018

(PNG)

S5 Fig. Number of study species’ native ranges (a, c, e) and naturalised ranges (b, d, f) that fall in each 10-min grid-cell, after adjustment for recording effort (see Methods and Meyer and colleagues for a full description).

Colours represent the adjusted relative number of species. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905. Country and continent outlines were produced by the International Working Group on Taxonomic Databases for Plant Sciences (TDWG), specifically the WGSRPD Level 4 boundaries; data and usage notes can be found at (https://github.com/tdwg/wgsrpd).

https://doi.org/10.1371/journal.pbio.3002361.s019

(PNG)

S6 Fig. Relative threat from the spread of regionally naturalised species globally after adjustment for recording effort for (a) birds, (b) mammals, and (c) plants.

The number of species that could spread to each 10-min grid-cell is calculated in the same way as in Fig 1). This number was then multiplied by a measure of recording effort (proportion of known species per grid-cell that are actually reported in GBIF data, see Methods and Meyer and colleagues for a full description) to compensate for potential over- or under-recording of species. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905. Country and continent outlines were produced by the International Working Group on Taxonomic Databases for Plant Sciences (TDWG), specifically the WGSRPD Level 4 boundaries; data and usage notes can be found at (https://github.com/tdwg/wgsrpd).

https://doi.org/10.1371/journal.pbio.3002361.s020

(PNG)

S7 Fig. Correlation of number of naturalised grid-cells with proportion of range filling for plants (a), birds (b), and mammals (c).

A solid line signifies the estimate was consistently above or below 0 in >95% of simulations (and therefore judged as significant), a dashed line means it was not. The lighter shaded area shows the 95% probability density interval for the parameter estimate, and the darker shows the 50% interval. Point colour represents region, but as parameter estimates did not vary between region, only the global regression line is shown. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905.

https://doi.org/10.1371/journal.pbio.3002361.s021

(PNG)

S8 Fig. Correlation of number of naturalised grid-cells with proportion of niche filling for plants (a), birds (b), and mammals (c).

A solid line signifies the estimate was consistently above or below 0 in >95% of simulations (and therefore judged as significant), a dashed line means it was not. The lighter shaded area shows the 95% probability density interval for the parameter estimate, and the darker shows the 50% interval. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905.

https://doi.org/10.1371/journal.pbio.3002361.s022

(PNG)

S9 Fig. Relative threat from the spread of regionally naturalised species globally using only climate that fell with the 70% most densely occupied climate in the native realm for a+b) plants, c+d) birds, and e+f) plants.

Figures on the left depict the number of species that could spread to that area and are already naturalised within that realm. Figures on the right depict the discrepancy between threat metrics for 100% (Fig 1) and 70% niche overlap maps. Positive values mean that the number of species that are calculated to spread using 100% of occupied native climate is greater than when using 70% of occupied native climate. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905. Country and continent outlines were produced by the International Working Group on Taxonomic Databases for Plant Sciences (TDWG), specifically the WGSRPD Level 4 boundaries; data and usage notes can be found at (https://github.com/tdwg/wgsrpd).

https://doi.org/10.1371/journal.pbio.3002361.s023

(PNG)

S10 Fig. The proportion of species that fall in different geographic regions compared between our study and published databases for plants (GLONAF), birds (GAVIA), and mammals (DAMA).

Note that the geographic regions are specific to each published database, not to our study. Also note that proportions for published databases and for our study sum to >1, as some species have naturalised in multiple regions. The data underlying this figure can be found in https://doi.org/10.5281/zenodo.8205905.

https://doi.org/10.1371/journal.pbio.3002361.s024

(PNG)

S11 Fig. Biogeographic realms used in this paper.

Realms were defined by Holt and colleagues, with the addition of a line between the western and eastern Palearctic along the Ural Mountains. The data for this figure is not available from the authors of this study, but can be attained from Holt and colleagues. Credit: Journal Science/AAAS.

https://doi.org/10.1371/journal.pbio.3002361.s025

(PNG)

Acknowledgments

We would like to thank C. Meyer for providing data on global recording effort, Wayne Dawson and Chris Kaiser-Bunbury for comments on an earlier draft. Some of the text in this manuscript already appears in a thesis by Dr. Hakkinen [93].

References

  1. 1. Dawson W, Moser D, van Kleunen M, Kreft H, Pergl J, Pyšek P, et al. Global hotspots and correlates of alien species richness across taxonomic groups. Nat Ecol Evol. 2017;1:0186.
  2. 2. Blackburn TM, Pyšek P, Bacher S, Carlton JT, Duncan RP, Jarošík V, et al. A proposed unified framework for biological invasions. Trends Ecol Evol. 2011;26:333–339. pmid:21601306
  3. 3. Simberloff D, Gibbons L. Now you see them, now you don’t!—Population crashes of established introduced species. Biological Invasions. Kluwer Academic Publishers; 2004. p. 161–172. https://doi.org/10.1023/B:BINV.0000022133.49752.46
  4. 4. Pheloung PC, Williams PA, Halloy SR. A weed risk assessment model for use as a biosecurity tool evaluating plant introductions. J Environ Manage. 1999;57:239–251.
  5. 5. Aikio S, Duncan RP, Hulme PE. Lag-phases in alien plant invasions: separating the facts from the artefacts. Oikos. 2010;119:370–378.
  6. 6. Crooks JA. Lag times and exotic species: The ecology and management of biological invasions in slow-motion1. Écoscience. 2005;12:316–329.
  7. 7. Aagaard K, Lockwood J. Exotic birds show lags in population growth. Duncan R, editor. Divers Distrib. 2014;20:547–554.
  8. 8. Landschoff J, Lackschewitz D, Kesy K, Reise K. Globalization pressure and habitat change: Pacific rocky shore crabs invade armored shorelines in the Atlantic Wadden Sea. Aquat Invasions. 2013;8:77–87.
  9. 9. Häkkinen H, Hodgson D, Early R, Lenoir J. Plant naturalizations are constrained by temperature but released by precipitation. Glob Ecol Biogeogr. 2021;00:1–14.
  10. 10. Aikio S, Duncan RP, Hulme PE. Herbarium records identify the role of long-distance spread in the spatial distribution of alien plants in New Zealand. J Biogeogr. 2010;37:1740–1751.
  11. 11. Gallien L, Douzet R, Pratte S, Zimmermann NE, Thuiller W. Invasive species distribution models—how violating the equilibrium assumption can create new insights. Glob Ecol Biogeogr. 2012;21:1126–1136.
  12. 12. Årevall J, Early R, Estrada A, Wennergren U, Eklöf AC. Conditions for successful range shifts under climate change: The role of species dispersal and landscape configuration. Divers Distrib. 2018;24:1598–1611.
  13. 13. Byers JE, Smith RS, Pringle JM, Clark GF, Gribben PE, Hewitt CL, et al. Invasion Expansion: Time since introduction best predicts global ranges of marine invaders. Sci Rep. 2015;5:12436. pmid:26227803
  14. 14. Pyšek P, Manceur AM, Alba C, McGregor KF, Pergl J, Štajerová K, et al. Naturalization of central European plants in North America: species traits, habitats, propagule pressure, residence time. Ecology. 2015;96:762–774. pmid:26236872
  15. 15. Wilson JRU, Richardson DM, Rouget M, Procheş Ş, Amis MA, Henderson L, et al. Residence time and potential range: Crucial considerations in modelling plant invasions. Divers Distrib. 2007;13:11–22.
  16. 16. Vall-llosera M, Sol D. A global risk assessment for the success of bird introductions. J Appl Ecol. 2009;46:787–795.
  17. 17. Blackburn TM, Cassey P, Lockwood JL. The role of species traits in the establishment success of exotic birds. Glob Chang Biol. 2009;15:2852–2860.
  18. 18. Pyšek P, Hulme PE. Spatio-temporal dynamics of plant invasions: Linking pattern to process. Écoscience. 2005;12:302–315.
  19. 19. Wichmann MC, Alexander MJ, Soons MB, Galsworthy S, Dunne L, Gould R, et al. Human-mediated dispersal of seeds over long distances. Proc Biol Sci. 2009;276:523–532. pmid:18826932
  20. 20. Pyšek P, Jarošík V, Pergl J. Alien plants introduced by different pathways differ in invasion success: Unintentional introductions as a threat to natural areas. Traveset A, editor. PLoS ONE. 2011;6:e24890. pmid:21949778
  21. 21. Estrada A, Morales-Castilla I, Meireles C, Caplat P, Early R. Equipped to cope with climate change: traits associated with range filling across European taxa. Ecography. 2017 [cited 28 Feb 2018].
  22. 22. Moravcová L, Pyšek P, Jarošík V, Pergl J. Getting the Right Traits: Reproductive and Dispersal Characteristics Predict the Invasiveness of Herbaceous Plant Species. Li B, editor. PLoS ONE. 2015;10:e0123634. pmid:25906399
  23. 23. Lázaro-Lobo A, Ervin GN. Native and exotic plant species respond differently to ecosystem characteristics at both local and landscape scales. Biological Invasions 2020 23:1. 2020;23:143–156.
  24. 24. Landis JR, Koch GG. The Measurement of Observer Agreement for Categorical Data. Biometrics. 1977;33:159. pmid:843571
  25. 25. Cook GD, Dias L. It was no accident: deliberate plant introductions by Australian government agencies during the 20th century. Aust J Bot. 2006;54:601.
  26. 26. Hoffmann BD, Broadhurst LM. The economic cost of managing invasive species in Australia. NeoBiota. 2016;31:1–18.
  27. 27. Liu J, Liang SC, Liu FH, Wang RQ, Dong M. Invasive alien plant species in China: Regional distribution patterns. Divers Distrib. 2005;11:341–347.
  28. 28. Pojani D, Stead D. Sustainable Urban Transport in the Developing World: Beyond Megacities. Sustainability. 2015;7:7784–7805.
  29. 29. Tamme R, Götzenberger L, Zobel M, Bullock JM, Hooftman DAP, Kaasik A, et al. Predicting species’ maximum dispersal distances from simple plant traits. Ecology. 2014;95:505–513. pmid:24669743
  30. 30. Bradley BA, Early R, Sorte CJB. Space to invade? Comparative range infilling and potential range of invasive and native plants. Glob Ecol Biogeogr. 2015;24:348–359.
  31. 31. Whitmee S, Orme CDL. Predicting dispersal distance in mammals: a trait-based approach. Humphries M, editor. J Anim Ecol. 2013;82:211–221. pmid:22924343
  32. 32. Jones KE, Bielby J, Cardillo M, Fritz SA, O’Dell J, Orme CDL, et al. PanTHERIA: a species-level database of life history, ecology, and geography of extant and recently extinct mammals. Michener WK, editor. Ecology. 2009;90:2648–2648.
  33. 33. Szabo B, Damas-Moreira I, Whiting MJ. Can cognitive ability give invasive species the means to succeed? A review of the evidence. Front Ecol Evol. 2020;8:530130.
  34. 34. Rolland J, Salamin N. Niche width impacts vertebrate diversification. Glob Ecol Biogeogr. 2016;25:1252–1263.
  35. 35. Mack RN, Lonsdale WM. Humans as Global Plant Dispersers: Getting More Than We Bargained For: Current introductions of species for aesthetic purposes present the largest single challenge for predicting which plant immigrants will become future pests. Bioscience. 2001;51:95–102.
  36. 36. Wilson JRU, Dormontt EE, Prentis PJ, Lowe AJ, Richardson DM. Something in the way you move: dispersal pathways affect invasion success. Trends Ecol Evol. 2009;24:136–144. pmid:19178981
  37. 37. Harvey KJ, Nipperess DA, Britton DR, Hughes L. Australian family ties: does a lack of relatives help invasive plants escape natural enemies? Biol Invasions. 2012;14:2423–2434.
  38. 38. Jeschke J, Gómez Aparicio L, Haider S, Heger T, Lortie C, Pyšek P, et al. Support for major hypotheses in invasion biology is uneven and declining. NeoBiota. 2012;14:1–20.
  39. 39. Rejmánek M. A theory of seed plant invasiveness: The first sketch. Biol Conserv. 1996;78:171–181.
  40. 40. Levine JM, Adler PB, Yelenik SG. A meta-analysis of biotic resistance to exotic plant invasions. Ecol Lett. 2004;7:975–989.
  41. 41. Dyer EE, Cassey P, Redding DW, Collen B, Franks V, Gaston KJ, et al. The Global Distribution and Drivers of Alien Bird Species Richness. Ricklefs R, editor. PLoS Biol. 2017;15:e2000942. pmid:28081142
  42. 42. Chardon NI, Pironon S, Peterson ML, Doak DF. Incorporating intraspecific variation into species distribution models improves distribution predictions, but cannot predict species traits for a wide-spread plant species. Ecography. 2020;43:60–74.
  43. 43. Meyer C, Weigelt P, Kreft H. Multidimensional biases, gaps and uncertainties in global plant occurrence information. Lambers JHR, editor. Ecol Lett. 2016;19:992–1006. pmid:27250865
  44. 44. Early R, Sax DF. Climatic niche shifts between species’ native and naturalized ranges raise concern for ecological forecasts during invasions and climate change. Glob Ecol Biogeogr. 2014;23:1356–1365.
  45. 45. Hill MP, Gallardo B, Terblanche JS. A global assessment of climatic niche shifts and human influence in insect invasions. Glob Ecol Biogeogr. 2017;26:679–689.
  46. 46. Li Y, Liu X, Li X, Petitpierre B, Guisan A. Residence time, expansion toward the equator in the invaded range and native range size matter to climatic niche shifts in non-native species. Glob Ecol Biogeogr. 2014;23:1094–1104.
  47. 47. Biancolini D, Vascellari V, Melone B, Blackburn TM, Cassey P, Scrivens SL, et al. DAMA: the global Distribution of Alien Mammals database. Ecology. 2021;102:e03474. pmid:34273183
  48. 48. van Kleunen M, Pyšek P, Dawson W, Essl F, Kreft H, Pergl J, et al. The Global Naturalized Alien Flora (GloNAF) database. Ecology. 2019;100:e02542. pmid:30341991
  49. 49. Dyer EE, Redding DW, Blackburn TM. The global avian invasions atlas, a database of alien bird distributions worldwide. Sci Data. 2017;4:1–12. pmid:28350387
  50. 50. Vilà M, Basnou C, Pyšek P, Josefsson M, Genovesi P, Gollasch S, et al. How well do we understand the impacts of alien species on ecosystem services? A pan-European, cross-taxa assessment. Front Ecol Environ. 2010;8:135–144.
  51. 51. Braga RR, Gómez-Aparicio L, Heger T, Vitule JRS, Jeschke JM. Structuring evidence for invasional meltdown: broad support but with biases and gaps. Biol Invasions. 2018;20:923–936.
  52. 52. Green PT, O’Dowd DJ, Abbott KL, Jeffery M, Retallick K, Mac NR. Invasional meltdown: Invader–invader mutualism facilitates a secondary invasion. Ecology. 2011;92:1758–1768. pmid:21939072
  53. 53. Stanley MC, Nathan HW, Phillips LK, Knight SJ, Galbraith JA, Winks CJ, et al. Invasive interactions: can Argentine ants indirectly increase the reproductive output of a weed? Arthropod Plant Interact. 2013;7:59–67.
  54. 54. Randall RP. A global compendium of weeds. 3rd ed. Randall RP, editor. Perth, Australia: Department of Agriculture and Food Western Australia; 2017.
  55. 55. GISIN. Global Invasive Species Information Network, providing free and open access to invasive species data. In: USA [Internet]. 2015. http://www.gisin.org.
  56. 56. Lever C. Naturalised birds of the world. 1st ed. Bloomsbury Publishing; 2005. https://www.bloomsbury.com/uk/naturalised-birds-of-the-world-9780713670066/
  57. 57. Sol D, Maspons J, Vall-Llosera M, Bartomeus I, García-Peña GE, Piñol J, et al. Unraveling the life history of successful invaders. Science. 2012;337:580–583. pmid:22859488
  58. 58. del Hoyo J, Elliott A, Sargatal J, Christie DA, de Juana E. HBW Alive: Handbook of the Birds of the World Alive. In: Lynx Edicions, Barcelona [Internet]. 2018 [cited 20 Mar 2018]. https://www.hbw.com/.
  59. 59. Capellini I, Baker J, Allen WL, Street SE, Venditti C. The role of life history traits in mammalian invasion success. Wiens J, editor. Ecol Lett. 2015;18:1099–1107. pmid:26293900
  60. 60. Hijmans RJ, Phillips S, Leathwick J, Elith J. dismo: Species Distribution Modeling. R package version 13–9. 2022 [cited 2019 Jun 17]. https://CRAN.R-project.org/package=dismo.
  61. 61. Derived dataset GBIF.org. Filtered export of GBIF occurrence data. 2022.
  62. 62. Holt BG, Lessard J-P, Borregaard MK, Fritz SA, Araujo MB, Dimitrov D, et al. An Update of Wallace’s Zoogeographic Regions of the World. Science. 1979;2013(339):74–78. pmid:23258408
  63. 63. Fick SE, Hijmans RJ. Worldclim 2: New 1-km spatial resolution climate surfaces for global land areas. Int J Climatol. 2017 [cited 21 Mar 2018]. Available from: http://worldclim.org/version2.
  64. 64. Broennimann O, Fitzpatrick MC, Pearman PB, Petitpierre B, Pellissier L, Yoccoz NG, et al. Measuring ecological niche overlap from occurence and spatial environmental data. Glob Ecol Biogeogr. 2012;21:481–497.
  65. 65. Calenge C. The package “adehabitat” for the R software: A tool for the analysis of space and habitat use by animals. Ecol Modell. 2006;197:516–519.
  66. 66. Brummitt RK, Pando Francisco, International Working Group on Taxonomic Databases for Plant Science, Hunt Institute for Botanical Documentation. World geographical scheme for recording plant distributions. Published for the International Working Group on Taxonomic Databases for Plant Sciences (TDWG) by the Hunt Institute for Botanical Documentation, Carnegie Mellon University; 2001. https://www.kew.org/tdwg-world-geographical-scheme-recording-plant-distributions.
  67. 67. Early R, Sax DF. Analysis of climate paths reveals potential limitations on species range shifts. Ecol Lett. 2011;14:1125–1133. pmid:21955643
  68. 68. Lawson CR, Hodgson JA, Wilson RJ, Richards SA. Prevalence, thresholds and the performance of presence-absence models. Freckleton R, editor. Methods Ecol Evol. 2014;5:54–64.
  69. 69. Leroy B, Delsol R, Hugueny B, Meynard CN, Barhoumi C, Barbet-Massin M, et al. Without quality presence-absence data, discrimination metrics such as TSS can be misleading measures of model performance. J Biogeogr. 2018;45:1994–2002.
  70. 70. Meyer C, Kreft H, Guralnick R, Jetz W. Global priorities for an effective information basis of biodiversity distributions. Nat Commun. 2015;6:8221. pmid:26348291
  71. 71. Seebens H, Blackburn TM, Dyer EE, Genovesi P, Hulme PE, Jeschke JM, et al. No saturation in the accumulation of alien species worldwide. Nat Commun. 2017;8:14435. pmid:28198420
  72. 72. Atlas of Living Australia. 2018 [cited 2018 Aug 20]. http://www.ala.org.au.
  73. 73. Long JL. Introduced Mammals of the world. Their History, Distribution and Influence. Collingwood, Victoria, Australia, and CABI Publishing, Wallingford, United Kingdom. 2003.
  74. 74. van Kleunen M, Essl F, Pergl J, Brundu G, Carboni M, Dullinger S, et al. The changing role of ornamental horticulture in alien plant invasions. Biol Rev. 2018;93:1421–1437. pmid:29504240
  75. 75. Theoharides KA, Dukes JS. Plant invasion across space and time: factors affecting nonindigenous species success during four stages of invasion. New Phytologist. 2007;176:256–273. pmid:17822399
  76. 76. Roura-Pascual N, Hui C, Ikeda T, Leday G, Richardson DM, Carpintero S, et al. Relative roles of climatic suitability and anthropogenic influence in determining the pattern of spread in a global invader. Proc Natl Acad Sci U S A. 2011;108:220–225. pmid:21173219
  77. 77. Angert AL, Crozier LG, Rissler LJ, Gilman SE, Tewksbury JJ, Chunco AJ. Do species’ traits predict recent shifts at expanding range edges? Ecol Lett. 2011:677–689. pmid:21535340
  78. 78. Kattge J, Díaz S, Lavorel S, Prentice IC, Leadley P, Bönisch G, et al. TRY—a global database of plant traits. Glob Chang Biol. 2011;17:2905–2935.
  79. 79. Myhrvold NP, Baldridge E, Chan B, Sivam D, Freeman DL, Ernest SKM. An amniote life-history database to perform comparative analyses with birds, mammals, and reptiles. Ecology. 2015;96:3109–3000.
  80. 80. Sol D, Bacher S, Reader SM, Lefebvre L. Brain size predicts the success of mammal species introduced into novel environments. Am Nat. 2008;172(Suppl):S63–S71. pmid:18554145
  81. 81. IUCN. The IUCN Red List of Threatened Species. 2023. https://www.iucnredlist.org.
  82. 82. Václavík T, Meentemeyer RK. Equilibrium or not? Modelling potential distribution of invasive species in different stages of invasion. Divers Distrib. 2012;18:73–83.
  83. 83. Vittoz P, Engler R. Seed dispersal distances: a typology based on dispersal modes and plant traits. Bot Helv. 2007;117:109–124.
  84. 84. Garrard GE, McCarthy MA, Vesk PA, Radford JQ, Bennett AF. A predictive model of avian natal dispersal distance provides prior information for investigating response to landscape change. J Anim Ecol. 2012;81:14–23. pmid:21819396
  85. 85. Wilman H, Belmaker J, Simpson J, de la Rosa C, Rivadeneira MM, Jetz W. EltonTraits 1.0: Species-level foraging attributes of the world’s birds and mammals. Ecology. 2014;95:2027–2027.
  86. 86. Santini L, Di Marcc M, Visconti P, Baiserc D, Boitanl L, Rondinini C. Ecological correlates of dispersal distance in terrestrial mammals. Hystrix, the Italian Journal of Mammalogy. 2013;24:181–186.
  87. 87. McGarigal K, Cushman SA, Neel MC, Ene E. FRAGSTATS: Spatial Pattern Analysis Program for Categorical Maps. University of Massachusetts, Amherst; 2012. http://www.umass.edu/landeco/research/fragstats/fragstats.html.
  88. 88. Wang X, Blanchet FG, Koper N. Measuring habitat fragmentation: An evaluation of landscape pattern metrics. Tatem A, editor. Methods Ecol Evol. 2014;5:634–646.
  89. 89. Bradshaw CJA, Brook BW, Delean S, Fordham DA, Herrando-Pérez S, Cassey P, et al. Predictors of contraction and expansion of area of occupancy for British birds. Proc Biol Sci. 2014;281:20140744. pmid:24827448
  90. 90. Gelman A, Jakulin A, Pittau MG, Su YS. A weakly informative default prior distribution for logistic and other regression models. Ann Appl Stat. 2008;2:1360–1383.
  91. 91. Vehtari A, Gelman A, Gabry J. Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC. Stat Comput. 2017;27:1413–1432.
  92. 92. Cribari-Neto F, Zeileis A. Beta Regression in R. 2009 [cited 2019 Apr 11]. http://epub.wu.ac.at/726/.
  93. 93. Häkkinen H. Unexplained limits on species distributions: how important is climate in defining species’ range edges?: University of Exeter; 2020.