Abstract

Puccinia psidii is a major threat to members of the Myrtaceae worldwide. New Zealand is still free of P. psidii but possesses significant Eucalyptus plantations as well as valuable native tree species such as Metrosideros excelsa and Leptospermum scoparium that could be at risk in the event of an incursion. As part of preparing for a potential incursion, we generated a barcode reference library for >100 diverse species of Myrtaceae occurring in New Zealand by sequencing internal transcribed spacer (ITS), external transcribed spacer (ETS) and maturase K (matK), or by obtaining relevant sequences from GenBank. The Myrtaceae DNA barcoding database will enable rapid identification of large numbers of host species in the event of a myrtle rust incursion. We undertook a comparative analysis of the ability for the three mentioned loci to discriminate species. Interspecific divergence was assessed by mean interspecific distance, θ prime and minimum interspecific distance; intraspecific variation was evaluated by mean intraspecific difference, θ and coalescent depth. Overall identification efficiency of the three loci sequenced was determined using BLAST1 and Near Distance methods. Barcoding gaps between inter- and intraspecific divergences were also analysed. ITS and ETS share similar mean interspecific distance, θ prime and minimum interspecific distance values – both higher than obtained for matK. In contrast, in terms of intraspecific variation, matK had higher values than ITS and ETS in all three metrics analysed. ITS had the highest identification success rate for species followed by matK and ETS, respectively, as measured by BLAST1 and Near Distance methods. Overall identification success rate increases when a combination of ITS and matK in particular is used. The sequence data are not only a valuable reference collection for a myrtle rust response but also a national botanical resource.

Introduction

Puccinia psidii Winter, causal agent for what is known variously as myrtle rust, eucalyptus rust or guava rust, received particular attention when it was found to affect Eucalyptus L'Hér. in Brazil in the early 1900s (Joffily, 1944) and has since become an important biosecurity threat to many countries. The pathogen has extended its geographic range to other South and Central American countries. It has reached continental US (Marlatt and Kimbrough, 1979) and subsequently most of the Hawaiian Islands (Uchida et al., 2006). Japan (Kawanishi et al., 2009), Australia (Carnegie et al., 2010) and China (Zhuang and Wei, 2011) have not been spared, and most recently, records from South Africa (Roux et al., 2013), New Caledonia (F. Giblin, unpublished data) and Indonesia (A. McTaggart, personal communication) have been reported, despite warnings that P. psidii is a significant threat to Myrtaceae globally (Coutinho et al., 1998; Tommerup et al., 2003). In Australia, the presence of myrtle rust has taken on a new significance because the country represents one of the centres of diversity for Myrtaceae (Wilson et al., 2001), especially for species with capsular fruits such as Eucalyptus, which are almost entirely endemic to the region (Ladiges, 1997). The host range of P. psidii has expanded significantly since its introduction into Australia, infecting >250 species in 56 genera to date (Carnegie, 2015; Machado et al., 2015b) and with the potential to infect more as evidenced by artificial inoculation (Morin et al., 2012).

It is beyond the scope of this article to detail the taxonomy, biology and pathology of P. psidii, and readers are referred to Pegg et al. (2014) and Machado et al. (2015b) for an overview. Initial uncertainty regarding the identification of the pathogen in Australia (Elith et al., 2013) has been recognized as one of the reasons why the window of opportunity to eradicate has passed (Howard et al., 2015). Although a number of strains with differing infecting characteristics have been identified in South America, only a single strain is present in Australia (Loope and La Rosa, 2008; Graça et al., 2013; Machado et al., 2015a, b). Puccinia psidii in general harms young, developing tissue and depending on the host, infects young leaves, shoots, flower buds and/or fruit (Glen et al., 2007; Zauza et al., 2010) causing defoliation, branch dieback and abortion of flowers and fruits (Pegg et al., 2014). Unsurprisingly, myrtle rust has had a negative economic impact worldwide (e.g. Junghans et al., 2003; Ribeiro and Pommer, 2004; Loope and La Rosa, 2008), and in the context of Australia in particular, estimates by Cannon (2011) suggest an annual loss of 1.5 million m3 in plantation volume due to the disease. The full ecological and commercial impact of P. psidii in Australia will become evident within the foreseeable future (Carnegie and Lidbetter, 2012; Carnegie et al., 2015).

New Zealand is ‘myrtle rust’ free, for now. Climate niche modelling, based on recent climatic data, has highlighted regions in New Zealand's North Island that provide optimal conditions for growth and survival year round (Kriticos et al., 2013). However, the spectre of global warming will affect the range and impact of the pathogen (Bebber, 2015). There is concern that P. psidii may soon arrive in New Zealand (Goldson et al., 2015; Teulon et al., 2015) either by anthropogenic introduction (including myrtaceous nursery stock) or by long-distance dispersal (McKenzie, 1998; Clark, 2011). While other rust fungi have blown across the Tasman Sea and infected plants in New Zealand (McKenzie, 1998), there are indications that much of the long-distance spread of myrtle rust within Australia was due to human movement of infected material (Makinson, 2014). A negative correlation between spore release and increasing wind speed and turbulence intensity of the air was found (Zauza et al., 2015), but once released, the spores are capable of moving across smaller distances.

If the current strain of P. psidii in Australia was to arrive in New Zealand, it would threaten the country's mānuka honey industry utilizing Leptospermum scoparium J.R.Forst. & G.Forst and high-value environmental assets such as the iconic native trees Metrosideros excelsa Sol. ex Gaertn. (pohutukawa) and M. robusta A.Cunn. (northern rata), both of which are already under threat from browsing by possum (Trichosurus vulpecula Kerr). New Zealand only has 24 native species of Myrtaceae (De Lange and Rolfe, 2010), compared with more or less 2250 endemic species in Australia (CHAH Australian Plant Census, unpublished data). From cultivated New Zealand natives in Australia, it is known that P. psidii can infect M. excelsa, M. kermadecensis W.R.B.Oliv. and Lophomyrtus bullata Burret (Giblin and Carnegie, 2014). There are, however, numerous cultivated species of Eucalyptus recorded in New Zealand (386 species documented in the National Forestry Herbarium database) as well as a wide range of other genera with horticultural value. The strain of P. psidii present in Australia is currently not causing serious disease in mature eucalypt plantations, affecting only young trees in relatively small numbers (Carnegie, 2015) as opposed to one of the strains found in Brazil that severely affects eucalypt plantations (Graça et al., 2013).

New Zealand has been afforded the opportunity to prepare for a myrtle rust invasion. The Ministry for Primary Industries' (MPI) High Risk Site Surveillance programme is tasked with providing early detections of unwanted plant pests of tree species present in New Zealand (Stevens, 2008), and the programme has been put on a myrtle rust alert. The identification of species of Myrtaceae by morphological means can be time consuming and often requires a combination of characters, which may not all be present at a particular time. The more rapidly species of Myrtaceae can be identified, the more rapidly surveying for myrtle rust and a response to an incursion can take place. Accurate surveillance of infected hosts will enable MPI to consider appropriate biosecurity measures in order to protect threatened plant species and vegetation communities and to reduce the risk of spread to new areas by workers and visitors (Quinn and Buys, 2014). Moreover, transporting infected plants to herbaria across New Zealand for identification poses a risk of spore escape. In contrast, extracted DNA can be more safely transported, and with DNA barcoding, a large number of specimens can be identified in a relatively short space of time.

Collaboration between the National Forestry Herbarium and MPI Plant Health and Environment Laboratory scientists resulted in the development of a plant DNA barcoding database of >100 species of Myrtaceae occurring in New Zealand. DNA barcoding for identifying specimens relies on a reference library of sequences from known species. Sequence data generated from query specimens can then be identified by matching it to sequences in the reference library. The breadth and depth of the reference library determines the degree of identification success (Bergsten et al., 2012). Although there have been calls to improve the DNA barcoding process, e.g. Zhang et al. (2012), the power of current DNA barcoding procedures relies in its standardization and the subsequent scalability associated with that (Collins and Cruickshank, 2014). This work forms a key part of MPI's preparedness and ability to respond to damaging exotic pests and diseases in the event of an incursion (Quinn and Buys, 2014). We report on the efficacy of maturase K (matK), internal transcribed spacer (ITS) and the external transcribed spacer (ETS) to discern the selected species in our database.

Materials and methods

Taxon sampling

We compiled a species list after consulting the New Zealand National Forestry Herbarium (NZFRI) database, the New Zealand Virtual Herbarium and MPI data (species listed in the Plant Biosecurity Index, Plant Pest Information Network and High Risk Site Surveillance database). The following factors were considered when compiling this list: (1) maximization of sampling coverage across the family; (2) inclusion of species present in New Zealand that are known to be susceptible to myrtle rust (based on Australian myrtle rust host lists, e.g. www.dpi.nsw.gov.au/biosecurity/plant/myrtle-rust/hosts); (3) inclusion of species that have high economic significance (including about a dozen of the main species of Eucalyptus used in the forestry industry, Anonymous, 1995); (4) inclusion of species that have cultural significance (i.e. all native species) and (5) inclusion of species where sufficient material was available to provide three or more samples per species in order to identify interspecific variation. During the pilot phase of the project, loci for a small number of additional species other than those ultimately decided upon were found on GenBank or were in some instances generated by us. These additional data were included in the barcode reference library as well as the analysis reported on below. The full list of species in the barcode reference library, as well as corresponding GenBank numbers, is provided in Supplementary Tables S1–S3.

Barcoding loci

The Plant Working Group within the Consortium for the Barcode of Life (CBOL) recommends targeting two loci in the chloroplast genome for barcoding purposes, namely the ribulose-bisphosphate carboxylase (rbcL) gene and the matK gene. The combination of the two aforementioned loci has not proven to be equally informative across the entire plant world and overall has a relatively low discrimination success rate of 72 per cent at the rank of species (CBOL Plant Working Group, 2009). In particular, rbcL has fallen in disfavour when compared with matK (Hollingsworth et al., 2011). In contrast, the ITS from the nuclear genome is widely used in taxonomy and molecular phylogeny because it is easy to amplify and has a high degree of variation even between closely related species (Hollingsworth et al., 2011; Li et al., 2011). In particular, ITS2 has gained patronage for barcoding purposes and Chen et al. (2010) have shown that it has a discrimination success rate of 92.7 per cent in plants.

For the purposes of our work, we used matK and ITS (ITS1 and ITS2) as the primary loci and choose the 5′ ETS as a third region for a subset of taxa (Angophora Cav., Corymbia K.D.Hill & L.A.S.Johnson, Eucalyptus and Metrosideros Banks ex Gaertn.). ETS has not been promoted as a barcoding region (Hollingsworth et al., 2011), but has been used with success in combination with ITS in phylogenetic studies (e.g. Logacheva et al., 2010).

Obtaining samples

The existing collection in the National Forestry Herbarium provided 128 samples for DNA extraction. Together with specimens housed in the Auckland War Memorial Museum Herbarium (AK), herbarium specimens made up 43 per cent of samples obtained for this project. The majority of samples (54 per cent) were obtained from field-based collections and subsequently placed in silica gel. The remaining 3 per cent of the DNA extractions were obtained directly from fresh material. Voucher specimens for newly collected samples have been lodged at the National Forestry Herbarium (NZFRI).

DNA extraction

DNA was extracted from leaf samples using the NucleoSpin Plant II DNA extraction Kit (Cat. #740770.50, Macherey–Nagel), with minor modifications to the protocol. The modifications were as follows: (1) samples were chopped into small pieces and placed in FastPrep™ tubes from MP Biomedicals (containing ceramic bead and garnet chips); (2) 400 µl of Buffer PL2 and 13 µl RNase A were added and the tubes homogenized in an OmniRuptor from Omni International for two cycles of 5 m/s for 20 sec; (3) after incubation for at least 60 min at 65°C, 100 µl of Buffer PL3 was added and tubes incubated on ice for 5 min and (4) tubes were centrifuged for 5 min at maximum speed and the cleared supernatant was loaded onto the NucleoSpin® Filters. DNA was stored at −20°C.

Polymerase chain reaction and sequencing

Two different polymerase chain reaction (PCR) protocols were used in this study. Generally, ITS PCRs were more difficult to amplify and JumpStart™ REDTaq® ReadyMix (Sigma) was the polymerase that worked best (Protocol 1). matK and ETS PCRs generally worked well using our standard PCR protocol with HOT FIREPol® Blend Master Mix, Ready to Load (Solis BioDyne) (Protocol 2).

Protocol 1: PCR reactions contained 1× JumpStart™ REDTaq® ReadyMix, 10 µg bovine serum albumin, primers (Supplementary Table S4) to a final concentration of 0.3 µM. PCR was carried out on a Perkin Elmer PE9700 thermal cycler using the following programme: initial denaturation at 94°C for 3 min, followed by 35 cycles of 94°C for 30 sec, annealing at 56°C for 30 sec, 72°C for 1 min and a final 10 min extension step at 72°C.

Protocol 2: PCR reactions contained HOT FIREPol® Blend Master Mix, Ready to Load, primers to a final concentration of 0.3 µM. PCR was carried out on a PE9700 thermal cycler using the following programme: initial denaturation at 94°C for 15 min, followed by 35 cycles of 94°C for 30 sec, annealing at 58°C for 40 sec, 72°C for 1 min and a final 10 min extension step at 72°C.

Successful PCR products were treated with Exonuclease I (Fermentas) and FastAP Thermosensitive Alkaline Phosphatase (Fermentas) prior to sequencing. Sequencing was done either by Macrogen (Korea) or in-house on the 3500 Series Genetic Analyzer (Applied Biosystems®) using BigDye® Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems®).

Sequence length, inter- and intraspecific divergence, barcoding gap and specimen identification

The inter- and intraspecific divergences were calculated by first using ClustalW to align sequences and then calculating Kimura 2-parameter distances (Kimura, 1980) using PAUP (Swofford, 2002). Sequences with ambiguous bases or incomplete sequence lengths were excluded from the analyses, e.g. matK data for Blepharocalyx O.Berg., Darwinia capitellata Rye and D. meeboldii C.A.Gardner obtained from GenBank.

Interspecific divergence was assessed by (1) mean interspecific distance (Meier et al., 2008); (2) θ prime (the mean pairwise distances calculated for each species that has more than one representative, thereby eliminating biases associated with uneven sampling among taxa; Meyer and Paulay, 2005) and (3) minimum interspecific distance (Meier et al., 2008).

Intraspecific variation was evaluated by (1) mean intraspecific difference, (2) θ and (3) average coalescent depth (the maximum distance from tips of a node linking all sampled members of a species, i.e. ‘book-ending’ intraspecific variability; Lahaye et al., 2008).

DNA barcoding gaps (difference between inter- and intraspecific genetic distances within a group of organisms) were analysed following Ross et al. (2008 and references therein) and species identification success was calculated using the BLAST1 method (the assigned identification is that of the species associated with the best BLAST hit) as well as Nearest Distance method in which the assigned identification is that of the species of the sequence having the smallest genetic distance from the query (Ross et al., 2008).

Results

Lists of newly obtained barcode sequences (with GenBank accession and herbarium voucher numbers) as well as existing sequences sourced from GenBank (with GenBank numbers only) that make up the barcode reference library are provided for matK, ITS and ETS (Supplementary Tables S1–S3).

ITS and ETS share similar mean interspecific distance, θ prime and minimum interspecific distance values – both higher than obtained for matK (Table 1). In contrast, in terms of intraspecific variation, matK had higher values than ITS and ETS in all three metrics analysed. The barcoding gap assessments demonstrate that the intra- and interspecific variations of matK and ETS exhibit the most amount of overlap, and ITS the least (Figure 1).

Table 1

Analysis of inter- and intraspecific divergences in species analysed for barcoding Myrtaceae in New Zealand when including alleles (three loci) and excluding alleles (two loci)

DivergencesIncluding alleles
Excluding alleles
matKITSETSITSETS
Interspecific
 Mean interspecific distance0.0080 ± 0.00480.0291 ± 0.01960.0349 ± 0.02320.0294 ± 0.01920.0351 ± 0.0241
 θ prime0.0096 ± 0.00530.0353 ± 0.02420.0347 ± 0.01310.0359 ± 0.02430.0351 ± 0.0135
 Minimum interspecific distance0.0042 ± 0.00550.0179 ± 0.02600.0098 ± 0.01570.0181 ± 0.02630.0097 ± 0.0150
Intraspecific
 Mean intraspecific distance0.0040 ± 0.01580.0037 ± 0.00560.0062 ± 0.01190.0036 ± 0.00540.0066 ± 0.0123
 θ0.0043 ± 0.01500.0030 ± 0.00460.0026 ± 0.00430.0032 ± 0.00480.0029 ± 0.0051
 Coalescent depth0.0079 ± 0.02790.0053 ± 0.00750.0051 ± 0.00870.0054 ± 0.00740.0061 ± 0.0123
DivergencesIncluding alleles
Excluding alleles
matKITSETSITSETS
Interspecific
 Mean interspecific distance0.0080 ± 0.00480.0291 ± 0.01960.0349 ± 0.02320.0294 ± 0.01920.0351 ± 0.0241
 θ prime0.0096 ± 0.00530.0353 ± 0.02420.0347 ± 0.01310.0359 ± 0.02430.0351 ± 0.0135
 Minimum interspecific distance0.0042 ± 0.00550.0179 ± 0.02600.0098 ± 0.01570.0181 ± 0.02630.0097 ± 0.0150
Intraspecific
 Mean intraspecific distance0.0040 ± 0.01580.0037 ± 0.00560.0062 ± 0.01190.0036 ± 0.00540.0066 ± 0.0123
 θ0.0043 ± 0.01500.0030 ± 0.00460.0026 ± 0.00430.0032 ± 0.00480.0029 ± 0.0051
 Coalescent depth0.0079 ± 0.02790.0053 ± 0.00750.0051 ± 0.00870.0054 ± 0.00740.0061 ± 0.0123
Table 1

Analysis of inter- and intraspecific divergences in species analysed for barcoding Myrtaceae in New Zealand when including alleles (three loci) and excluding alleles (two loci)

DivergencesIncluding alleles
Excluding alleles
matKITSETSITSETS
Interspecific
 Mean interspecific distance0.0080 ± 0.00480.0291 ± 0.01960.0349 ± 0.02320.0294 ± 0.01920.0351 ± 0.0241
 θ prime0.0096 ± 0.00530.0353 ± 0.02420.0347 ± 0.01310.0359 ± 0.02430.0351 ± 0.0135
 Minimum interspecific distance0.0042 ± 0.00550.0179 ± 0.02600.0098 ± 0.01570.0181 ± 0.02630.0097 ± 0.0150
Intraspecific
 Mean intraspecific distance0.0040 ± 0.01580.0037 ± 0.00560.0062 ± 0.01190.0036 ± 0.00540.0066 ± 0.0123
 θ0.0043 ± 0.01500.0030 ± 0.00460.0026 ± 0.00430.0032 ± 0.00480.0029 ± 0.0051
 Coalescent depth0.0079 ± 0.02790.0053 ± 0.00750.0051 ± 0.00870.0054 ± 0.00740.0061 ± 0.0123
DivergencesIncluding alleles
Excluding alleles
matKITSETSITSETS
Interspecific
 Mean interspecific distance0.0080 ± 0.00480.0291 ± 0.01960.0349 ± 0.02320.0294 ± 0.01920.0351 ± 0.0241
 θ prime0.0096 ± 0.00530.0353 ± 0.02420.0347 ± 0.01310.0359 ± 0.02430.0351 ± 0.0135
 Minimum interspecific distance0.0042 ± 0.00550.0179 ± 0.02600.0098 ± 0.01570.0181 ± 0.02630.0097 ± 0.0150
Intraspecific
 Mean intraspecific distance0.0040 ± 0.01580.0037 ± 0.00560.0062 ± 0.01190.0036 ± 0.00540.0066 ± 0.0123
 θ0.0043 ± 0.01500.0030 ± 0.00460.0026 ± 0.00430.0032 ± 0.00480.0029 ± 0.0051
 Coalescent depth0.0079 ± 0.02790.0053 ± 0.00750.0051 ± 0.00870.0054 ± 0.00740.0061 ± 0.0123
Figure 1

The barcoding gap between interspecific (black) and intraspecific (grey) divergences for the three analysed loci for barcoding Myrtaceae in New Zealand. (A) matK, (B) ITS and (C) ETS.

At the species level, ITS had the highest identification success rate followed by matK and ETS, respectively, as measured by both methods (Table 2). The 90.4 per cent success rate of ITS (ITS1 and ITS2) determined by the BLAST1 method is considerably higher than the 76.1 per cent obtained by Yao et al. (2010) based on an analysis of ITS2 alone and a sample size of 34 676 dicotyledons. Because we sampled multiple samples per species, we were able to identify different alleles of ITS and ETS. Excluding alleles from the calculations did not have a large effect on the divergences and identification efficiency rates (Tables 1 and 2).

Table 2

Overall identification efficiency (%) of the loci sequenced in species analysed for barcoding Myrtaceae in New Zealand using the BLAST1 and Near Distance methods when including alleles (three loci) or excluding alleles (two loci)

MethodIncluding alleles
Excluding alleles
matKITSETSITSETS
BLAST181.890.476.090.375.4
Near distance63.174.747.576.349.1
MethodIncluding alleles
Excluding alleles
matKITSETSITSETS
BLAST181.890.476.090.375.4
Near distance63.174.747.576.349.1
Table 2

Overall identification efficiency (%) of the loci sequenced in species analysed for barcoding Myrtaceae in New Zealand using the BLAST1 and Near Distance methods when including alleles (three loci) or excluding alleles (two loci)

MethodIncluding alleles
Excluding alleles
matKITSETSITSETS
BLAST181.890.476.090.375.4
Near distance63.174.747.576.349.1
MethodIncluding alleles
Excluding alleles
matKITSETSITSETS
BLAST181.890.476.090.375.4
Near distance63.174.747.576.349.1

Of the 43 genera analysed, identification success of specimens to species was 100 per cent for those genera represented by a relatively small number of species in all three analysed loci (Table 3). Kunzea and Eucalyptus, represented by 5 and 27 species, respectively, in matK, had the two lowest overall success rates (20and 22 per cent, respectively) because sequences were shared by more than one species. Success rates in ITS were on average higher in species where matK success rate values were <100 per cent. Metrosideros and Syzygium were the only genera where the success rate of identifying specimens was lower with ITS compared with matK. While the taxon sampling for ETS was small, lower success rates were also present in the more species-rich genera. The 100 per cent success rate of ETS in identifying species of Metrosideros is noteworthy.

Table 3

Identification efficiency (%) of the three loci per genus analysed for barcoding Myrtaceae in New Zealand using the BLAST1 method

GenusmatK
ITS
ETS
No. of speciesNo. of samplesSuccess rate at the species level (%)No. of speciesNo. of samplesSuccess rate at the species level (%)No. of speciesNo. of samplesSuccess rate at the species level (%)
Acca1410014100
Agonis1410014100
Amomyrtus1210024100
Angophora38673810026100
Astartea1310013100
Backhousia131001310012100
Baeckea1210011100
Beaufortia1410014100
Blepharocalyx13100
Callistemon415754187512100
Calothamnus1310013100
Calytrix1310012100
Carpolepis1110011100
Chamelaucium3710013100
Corymbia41510041310041675
Darwinia2410024100
Eucalyptus2796222610550288732
Eugenia1510015100
Heteropyxis1510012100
Hypocalymma1410014100
Kunzea5152041950
Leptospermum5188052110011100
Lophomyrtus271002610011100
Lophostemon1310013100
Luma1310013100
Melaleuca719100619100
Metrosideros1452861462791461100
Micromyrtus1410011100
Myrciaria1310013100
Myrteola1110011100
Myrtus14100112100
Neomyrtus1310015100
Pimenta1310013100
Psidium141001410012100
Sannantha1310012100
Syncarpia1310012100
Syzygium6221006198311100
Taxandria3933412100
Thaleropia1310012100
Thryptomene1310014100
Tristaniopsis1410013100
Ugni1310013100
Xanthostemon1310014100
GenusmatK
ITS
ETS
No. of speciesNo. of samplesSuccess rate at the species level (%)No. of speciesNo. of samplesSuccess rate at the species level (%)No. of speciesNo. of samplesSuccess rate at the species level (%)
Acca1410014100
Agonis1410014100
Amomyrtus1210024100
Angophora38673810026100
Astartea1310013100
Backhousia131001310012100
Baeckea1210011100
Beaufortia1410014100
Blepharocalyx13100
Callistemon415754187512100
Calothamnus1310013100
Calytrix1310012100
Carpolepis1110011100
Chamelaucium3710013100
Corymbia41510041310041675
Darwinia2410024100
Eucalyptus2796222610550288732
Eugenia1510015100
Heteropyxis1510012100
Hypocalymma1410014100
Kunzea5152041950
Leptospermum5188052110011100
Lophomyrtus271002610011100
Lophostemon1310013100
Luma1310013100
Melaleuca719100619100
Metrosideros1452861462791461100
Micromyrtus1410011100
Myrciaria1310013100
Myrteola1110011100
Myrtus14100112100
Neomyrtus1310015100
Pimenta1310013100
Psidium141001410012100
Sannantha1310012100
Syncarpia1310012100
Syzygium6221006198311100
Taxandria3933412100
Thaleropia1310012100
Thryptomene1310014100
Tristaniopsis1410013100
Ugni1310013100
Xanthostemon1310014100

Calculations based on a subset of sequences provided in Supplementary Tables S1–S4 due to partial sequences and sequences with ambiguous bases being excluded.

Table 3

Identification efficiency (%) of the three loci per genus analysed for barcoding Myrtaceae in New Zealand using the BLAST1 method

GenusmatK
ITS
ETS
No. of speciesNo. of samplesSuccess rate at the species level (%)No. of speciesNo. of samplesSuccess rate at the species level (%)No. of speciesNo. of samplesSuccess rate at the species level (%)
Acca1410014100
Agonis1410014100
Amomyrtus1210024100
Angophora38673810026100
Astartea1310013100
Backhousia131001310012100
Baeckea1210011100
Beaufortia1410014100
Blepharocalyx13100
Callistemon415754187512100
Calothamnus1310013100
Calytrix1310012100
Carpolepis1110011100
Chamelaucium3710013100
Corymbia41510041310041675
Darwinia2410024100
Eucalyptus2796222610550288732
Eugenia1510015100
Heteropyxis1510012100
Hypocalymma1410014100
Kunzea5152041950
Leptospermum5188052110011100
Lophomyrtus271002610011100
Lophostemon1310013100
Luma1310013100
Melaleuca719100619100
Metrosideros1452861462791461100
Micromyrtus1410011100
Myrciaria1310013100
Myrteola1110011100
Myrtus14100112100
Neomyrtus1310015100
Pimenta1310013100
Psidium141001410012100
Sannantha1310012100
Syncarpia1310012100
Syzygium6221006198311100
Taxandria3933412100
Thaleropia1310012100
Thryptomene1310014100
Tristaniopsis1410013100
Ugni1310013100
Xanthostemon1310014100
GenusmatK
ITS
ETS
No. of speciesNo. of samplesSuccess rate at the species level (%)No. of speciesNo. of samplesSuccess rate at the species level (%)No. of speciesNo. of samplesSuccess rate at the species level (%)
Acca1410014100
Agonis1410014100
Amomyrtus1210024100
Angophora38673810026100
Astartea1310013100
Backhousia131001310012100
Baeckea1210011100
Beaufortia1410014100
Blepharocalyx13100
Callistemon415754187512100
Calothamnus1310013100
Calytrix1310012100
Carpolepis1110011100
Chamelaucium3710013100
Corymbia41510041310041675
Darwinia2410024100
Eucalyptus2796222610550288732
Eugenia1510015100
Heteropyxis1510012100
Hypocalymma1410014100
Kunzea5152041950
Leptospermum5188052110011100
Lophomyrtus271002610011100
Lophostemon1310013100
Luma1310013100
Melaleuca719100619100
Metrosideros1452861462791461100
Micromyrtus1410011100
Myrciaria1310013100
Myrteola1110011100
Myrtus14100112100
Neomyrtus1310015100
Pimenta1310013100
Psidium141001410012100
Sannantha1310012100
Syncarpia1310012100
Syzygium6221006198311100
Taxandria3933412100
Thaleropia1310012100
Thryptomene1310014100
Tristaniopsis1410013100
Ugni1310013100
Xanthostemon1310014100

Calculations based on a subset of sequences provided in Supplementary Tables S1–S4 due to partial sequences and sequences with ambiguous bases being excluded.

Discussion

The main aim of DNA barcoding is to establish a shared community resource of DNA sequences that can be used for organismal identification and taxonomic clarification (Hollingsworth et al., 2011). Unlike taxon- or clade-based approaches where all the species of a clade are barcoded, e.g. Seberg and Petersen (2009), others follow a floristic approach by targeting a variety of species in a particular area, e.g. de Vere et al. (2012). Our DNA barcode reference library encompasses both approaches. We focussed on the Myrtaceae occurring in New Zealand and therefore did not barcode all the taxa in the Myrtaceae clade. However, we targeted all native species and ensured at least all known exotic genera of Myrtaceae in New Zealand were represented in the reference library with particular emphasis on species commonly utilized in the forestry industry. The choice of loci ensures that our reference library is scalable and it can accommodate improved taxon sampling in the future.

Despite the near-universal usage of ITS sequence data in plant phylogenetic studies, its complex and unpredictable evolutionary behaviour can reduce its utility for phylogenetic or barcoding analyses (Álvarez and Wendel, 2003). While the extent of paralogy, polymorphic sites and sequence quality can be quantified (Bailey et al., 2003), it has been shown that where the presence of rDNA pseudogenes has gone undetected in previous studies of eucalypt phylogeny, the impact on reconstructions of taxon relationships was minimal where taxa were sampled sparsely (Bayly et al., 2008). There are Myrtaceae datasets that are free of pseudogenes (Steane et al., 2002; Murillo-Aldana et al., 2011), and there are those that are not (Bayly et al., 2008). One method to initially determine the presence of pseudogenes is to visually search for unusual sequences in the 5.8S rRNA gene (Bayly et al., 2008). An inspection of our entire ITS dataset revealed no unusual 5.8S rRNA sequences comparable with those reported by Bayly et al. (2008).

Like all other barcoding projects, we present relative rather than absolute discriminatory power of different barcoding regions because our taxon sampling is not complete for the Myrtaceae. Furthermore, different taxonomic opinions exist and can influence the taxon sampling and hence efficacy values. The efficacy (accuracy) of identifying species via barcoding data is reliant on the extent of, and separation between, intra- and interspecific divergences in the selected loci for a given set of taxa. The more overlap there is between genetic variation within species and genetic variation between related species, the less effective barcoding becomes. In our analysis, although ITS and ETS share similar average inter- and intraspecific divergences compared with matK, ITS possesses the largest gap in the distribution of intra- and interspecific divergences. ITS also outperforms matK and ETS in terms of overall identification efficiency. Overall, identification success increases when a combination of ITS and matK, in particular, is used in a reasoned way.

Besides the generated DNA barcode data, a benefit of our work has been the deposition of voucher specimens in Scion's National Forestry Herbarium; this improved resource will assist future identification of Myrtaceae based on morphology. Additionally, 22 existing herbarium specimens that were sampled for barcoding purposes and 8 existing GenBank sequences have been redetermined as a result of our work (M.H. Buys, unpublished data). No new definitive phylogenetic insights were gleaned from the DNA barcode data due to only a subsample of species per genus having being sampled and because only matK and ITS (and to a limited extent ETS) were analysed.

The Myrtaceae DNA barcode database will enable rapid identification of large numbers of myrtaceous specimens in the event of a myrtle rust incursion. This will help MPI to accurately survey the spread of the disease and thereby enable timely decision making and consideration of biosecurity measures in order to protect threatened plant species and reduce the risk of spread to new areas. As such, the Myrtaceae DNA barcode database is a national botanical resource.

Conflict of interest statement

None declared.

Funding

We acknowledge funding from New Zealand's Ministry of Business, Innovation and Employment core funding to Scion and the Ministry for Primary Industries’ Operational Research Programme through the Barcoding Myrtaceae-16209 contract.

Acknowledgements

The following nurseries and gardens are thanked for supplying material of lesser-known species in New Zealand: Te Horo Nurseries, Masons Garden Centre and Nurseries, Oratia Native Plant Nursery, Global Plants Garden Centre, Bason Botanic Gardens, Eastwoodhill Arboretum, Hamilton Gardens, Auckland Domain Nursery and Auckland Botanic Gardens, University of Auckland Grounds Department, Pukekura Park, Wellington Botanic Garden; in Australia: Kings Park, Perth and Royal Botanic Gardens, Sydney; in South Africa: Kirstenbosch Botanical Gardens. The following people assisted in obtaining plant material and their contribution is acknowledged: John Adam, Jessie Balding, Ian and Jocelyn Bell, Colin Bradshaw, Benny Bytebier, Bill Clarkson, Russell Fransham, Keith Hammett, Chris Inglis, Colin Ogle, Barbara Parris, John Rogers, Ernst van Jaarsveld, Mike Wilcox and Peter Wilson.

References

Álvarez
I.
,
Wendel
J.F.
2003
Ribosomal ITS sequences and plant phylogenetic inference
.
Mol. Phylogenet. Evol
.
29
,
417
434
.

Anonymous
.
1995
Special Purpose Timber Species
.
Ministry of Forestry
.

Bailey
C.D.
,
Carr
T.G.
,
Harris
S.A.
,
Hughes
C.E.
2003
Characterization of angiosperm nrDNA polymorphism, paralogy, and pseudogenes
.
Mol. Phylogenet. Evol
.
29
,
435
455
.

Bayly
M.
,
Udovicic
F.
,
Gibbs
A.
,
Parra-O
C.
,
Ladiges
P.
2008
Ribosomal DNA pseudogenes are widespread in the eucalypt group (Myrtaceae): implications for phylogenetic analysis
.
Cladistics
24
,
131
146
.

Bebber
D.P.
2015
Range-expanding pests and pathogens in a warming world
.
Annu. Rev. Phytopathol
.
53
,
335
356
.

Bergsten
J.
,
Bilton
D.T.
,
Fujisawa
T.
,
Elliott
M.
,
Monaghan
M.T.
,
Balke
M.
et al. .
2012
The effect of geographical scale of sampling on DNA barcoding
.
Syst. Biol
.
61
,
851
869
.

Cannon
A.
2011
Myrtle Rust – Forest Industry Issues Paper (Project No: PRC 218-1011)
.
Forest & Wood Products Australia
.
A.P. Ltd
.

Carnegie
A.
2015
First report of Puccinia psidii (Myrtle Rust) in Eucalyptus plantations in Australia
.
Plant Dis
.
99
,
161
.

Carnegie
A.
,
Lidbetter
J.
2012
Rapidly expanding host range for Puccinia psidii sensu lato in Australia
.
Australas. Plant Pathol
.
41
,
13
29
.

Carnegie
A.
,
Lidbetter
J.
,
Walker
J.
,
Horwood
M.
,
Tesoriero
L.
,
Glen
M.
et al. .
2010
Uredo rangelii, a taxon in the guava rust complex, newly recorded on Myrtaceae in Australia
.
Australas. Plant Pathol
.
39
,
463
466
.

Carnegie
A.J.
,
Kathuria
A.
,
Pegg
G.S.
,
Entwistle
P.
,
Nagel
M.
,
Giblin
F.R.
2015
Impact of the invasive rust Puccinia psidii (myrtle rust) on native Myrtaceae in natural ecosystems in Australia
.
Biol. Invasions
18
,
127
144
.

CBOL Plant Working Group
.
2009
A DNA barcode for land plants
.
Proc. Natl Acad. Sci. USA
106
,
12794
12797
.

Chen
S.
,
Yao
H.
,
Han
J.
,
Liu
C.
,
Song
J.
,
Shi
L.
et al. .
2010
Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species
.
PLoS One
5
,
e8613
.

Clark
S.
2011
Risk Analysis of the Puccinia psidii/Guava Rust Fungal Complex (Including Uredo rangelii/Myrtle Rust) on Nursery Stock
.
Ministry of Agriculture and Forestry
.

Collins
R.A.
,
Cruickshank
R.H.
2014
Known knowns, known unknowns, unknown unknowns and unknown knowns in DNA barcoding: a comment on Dowton et al
.
Syst. Biol
.
64
,
1005
1009
.

Coutinho
T.
,
Wingfield
M.
,
Alfenas
A.
,
Crous
P.
1998
Eucalyptus rust: a disease with the potential for serious international implications
.
Plant Dis
.
82
,
819
825
.

De Lange
P.
,
Rolfe
J.R.
2010
New Zealand Indigenous Vascular Plant Checklist
.
New Zealand Plant Conservation Network
.

de Vere
N.
,
Rich
T.C.
,
Ford
C.R.
,
Trinder
S.A.
,
Long
C.
,
Moore
C.W.
et al. .
2012
DNA barcoding the native flowering plants and conifers of Wales
.
PLoS One
7
,
e37945
.

Elith
J.
,
Simpson
J.
,
Hirsch
M.
,
Burgman
M.
2013
Taxonomic uncertainty and decision making for biosecurity: spatial models for myrtle/guava rust
.
Australas. Plant Pathol
.
42
,
43
51
.

Giblin
F.
,
Carnegie
A.
2014
Puccinia psidii (Myrtle Rust) – Australian Host List
.
http://www.anpc.asn.au/resources/Myrtle_Rust.html (accessed on 24 September, 2014)
.

Glen
M.
,
Alfenas
A.
,
Zauza
E.
,
Wingfield
M.
,
Mohammed
C.
2007
Puccinia psidii: a threat to the Australian environment and economy – a review
.
Australas. Plant Pathol
.
36
,
1
16
.

Goldson
S.
,
Bourdot
G.
,
Brockerhoff
E.
,
Byrom
A.
,
Clout
M.
,
McGlone
M.
et al. .
2015
New Zealand pest management: current and future challenges
.
J. R. Soc. N. Z
.
45
,
31
58
.

Graça
R.N.
,
Ross-Davis
A.L.
,
Klopfenstein
N.B.
,
Kim
M.S.
,
Peever
T.L.
,
Cannon
P.G.
et al. .
2013
Rust disease of eucalypts, caused by Puccinia psidii, did not originate via host jump from guava in Brazil
.
Mol. Ecol
.
22
,
6033
6047
.

Hollingsworth
P.M.
,
Graham
S.W.
,
Little
D.P.
2011
Choosing and using a plant DNA barcode
.
PLoS One
6
,
e19254
.

Howard
C.
,
Findlay
V.
,
Grant
C.
2015
Australia's transition to management of myrtle rust
.
J. For. Sci
.
61
,
138
139
.

Joffily
J.
1944
Ferrugem do eucalipto
.
Bragantia
4
,
475
487
.

Junghans
D.
,
Alfenas
A.
,
Maffia
L.
2003
Escala de notas para quantificação da ferrugem em Eucalyptus
.
Fitopatol. Bras
.
28
,
184
188
.

Kawanishi
T.
,
Uematsu
S.
,
Kakishima
M.
,
Kagiwada
S.
,
Hamamoto
H.
,
Horie
H.
et al. .
2009
First report of rust disease on ohia and the causal fungus, Puccinia psidii, in Japan
.
J. Gen. Plant Pathol
.
75
,
428
431
.

Kimura
M.
1980
A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences
.
J. Mol. Evol
.
16
,
111
120
.

Kriticos
D.J.
,
Morin
L.
,
Leriche
A.
,
Anderson
R.C.
,
Caley
P.
2013
Combining a climatic niche model of an invasive fungus with its host species distributions to identify risks to natural assets: Puccinia psidii sensu lato in Australia
.
PLoS One
8
,
e64479
.

Ladiges
P.Y.
1997
Phylogenetic history and classification of eucalypts
. In
Eucalypt Ecology: Individuals to Ecosystems
.
Williams
J.
,
Woinarski
J.
(eds).
Cambridge University Press
, pp.
16
29
.

Lahaye
R.
,
Van der Bank
M.
,
Bogarin
D.
,
Warner
J.
,
Pupulin
F.
,
Gigot
G.
et al. .
2008
DNA barcoding the floras of biodiversity hotspots
.
Proc. Natl Acad. Sci. USA
105
,
2923
2928
.

Li
D.-Z.
,
Gao
L.-M.
,
Li
H.-T.
,
Wang
H.
,
Ge
X.-J.
,
Liu
J.-Q.
et al. .
2011
Comparative analysis of a large dataset indicates that internal transcribed spacer (ITS) should be incorporated into the core barcode for seed plants
.
Proc. Natl Acad. Sci. USA
108
,
19641
19646
.

Logacheva
M.D.
,
Valiejo-Roman
C.M.
,
Degtjareva
G.V.
,
Stratton
J.M.
,
Downie
S.R.
,
Samigullin
T.H.
et al. .
2010
A comparison of nrDNA ITS and ETS loci for phylogenetic inference in the Umbelliferae: an example from tribe Tordylieae
.
Mol. Phylogenet. Evol
.
57
,
471
476
.

Loope
L.
,
La Rosa
A.-M.
2008
Analysis of the Risk of Introduction of Additional Strains of the Rust Puccinia psidii Winter (Ohia Rust) to Hawaii
.
US Geological Survey
.

Machado
P.d.S.
,
Alfenas
A.C.
,
Alfenas
R.F.
,
Mohammed
C.L.
,
Glen
M.
2015a
Microsatellite analysis indicates that Puccinia psidii in Australia is mutating but not recombining
.
Australas. Plant Pathol
.
44
,
455
462
.

Machado
P.d.S.
,
Glen
M.
,
Pereira
O.
,
Silva
A.
,
Alfenas
A.
2015b
Epitypification of Puccinia psidii, causal agent of Guava Rust
.
Trop. Plant Pathol
.
40
,
5
12
.

Makinson
B.
2014
Myrtle Rust-what's happening?
Australas. Plant Conserv
.
23
,
13
15
.

Marlatt
R.
,
Kimbrough
J.
1979
Puccinia psdii on Pimento dioica in south Florida
.
Plant Dis. Rep
.
63
,
510
512
.

McKenzie
E.
1998
Rust fungi of New Zealand – an introduction, and list of recorded species
.
N. Z. J. Bot
.
36
,
233
271
.

Meier
R.
,
Zhang
G.
,
Ali
F.
2008
The use of mean instead of smallest interspecific distances exaggerates the size of the “barcoding gap” and leads to misidentification
.
Syst. Biol
.
57
,
809
813
.

Meyer
C.
,
Paulay
G.
2005
DNA barcoding: error rates based on comprehensive sampling
.
PLoS Biol
.
3
,
e422
.

Morin
L.
,
Aveyard
R.
,
Lidbetter
J.
,
Wilson
P.
2012
Investigating the host-range of the rust fungus Puccinia psidii sensu lato across tribes of the family Myrtaceae present in Australia
.
PLoS One
7
,
e35434
.

Murillo-Aldana
J.
,
Ruiz
E.
,
Landrum
L.R.
,
Stuessy
T.F.
,
Barfuss
M.H.J.
2011
Phylogenetic relationships in Myrceugenia (Myrtaceae) based on plastid and nuclear DNA sequences
.
Mol. Phylogenet. Evol
.
62
,
764
776
.

Pegg
G.S.
,
Giblin
F.
,
McTaggart
A.
,
Guymer
G.
,
Taylor
H.
,
Ireland
K.
et al. .
2014
Puccinia psidii in Queensland, Australia: disease symptoms, distribution and impact
.
Plant Pathol
.
63
,
1005
1021
.

Quinn
B.
,
Buys
M.
2014
Getting ready for Myrtle Rust
.
Surveillance
41
,
27
28
.

Ribeiro
I.J.
,
Pommer
C.
2004
Breeding guava (Psidium guajava) for resistance to rust caused by Puccinia psidii
.
Acta Hortic
.
632
,
75
78
.

Ross
H.A.
,
Murugan
S.
,
Li
W.L.S.
2008
Testing the reliability of genetic methods of species identification via simulation
.
Syst. Biol
.
57
,
216
230
.

Roux
J.
,
Greyling
I.
,
Coutinho
T.
,
Verleur
M.
,
Wingfield
M.
2013
The myrtle rust pathogen, Puccinia psidii, discovered in Africa
.
IMA Fungus
4
,
155
159
.

Seberg
O.
,
Petersen
G.
2009
How many loci does it take to DNA barcode a Crocus?
PLoS One
4
,
e4598
.

Steane
D.A.
,
Nicolle
D.
,
McKinnon
G.E.
,
Vaillancourt
R.E.
,
Potts
B.M.
2002
Higher-level relationships among the eucalypts are resolved by ITS-sequence data
.
Aust. Syst. Bot
.
15
,
49
62
.

Stevens
P.
2008
High risk site surveillance (HRSS) – an example of best practice plant pest surveillance
. In
Surveillance for Biosecurity: Pre-Border to Pest Management
.
Froud
K.
,
Popay
I.
,
Zydenbos
S.
(eds).
The New Zealand Plant Protection Society
.

Swofford
D.L.
2002
PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4
.
Sinauer
.

Teulon
D.
,
Alipia
T.
,
Ropata
H.
,
Green
J.
,
Viljanen-Rollinson
S.
,
Cromey
M.
et al. .
2015
The threat of myrtle rust to Māori taonga plant species in New Zealand
.
N. Z. Plant Protect.
68
,
66
75
.

Tommerup
I.
,
Alfenas
A.
,
Old
K.
2003
Guava rust in Brazil – a threat to Eucalyptus and other Myrtaceae
.
N. Z. J. For. Sci
.
33
,
420
428
.

Uchida
J.
,
Zhong
S.
,
Killgore
E.
2006
First report of a rust disease on Ohia caused by Puccinia psidii in Hawaii
.
Plant Dis
.
90
,
524
.

Wilson
P.G.
,
O'Brien
M.M.
,
Gadek
P.A.
,
Quinn
C.J.
2001
Myrtaceae revisited: a reassessment of infrafamilial groups
.
Am. J. Bot
.
88
,
2013
2025
.

Yao
H.
,
Song
J.
,
Liu
C.
,
Luo
K.
,
Han
J.
,
Li
Y.
et al. .
2010
Use of ITS2 region as the universal DNA barcode for plants and animals
.
PLoS One
5
,
e13102
.

Zauza
E.
,
Couto
M.
,
Lana
V.
,
Maffia
L.
,
Alfenas
A.
2010
Vertical spread of Puccinia psidii urediniospores and development of eucalyptus rust at different heights
.
Australas. Plant Pathol
.
39
,
141
145
.

Zauza
E.
,
Lana
V.
,
Maffia
L.
,
Araujo
M.
,
Alfenas
R.
,
Silva
F.
et al. .
2015
Wind dispersal of Puccinia psidii urediniospores and progress of eucalypt rust
.
For. Pathol
.
45
,
102
110
.

Zhang
A.B.
,
Muster
C.
,
Liang
H.B.
,
Zhu
C.D.
,
Crozier
R.
,
Wan
P.
et al. .
2012
A fuzzy-set-theory-based approach to analyse species membership in DNA barcoding
.
Mol. Ecol
.
21
,
1848
1863
.

Zhuang
J.-Y.
,
Wei
S.-X.
2011
Additional materials for the rust flora of Hainan Province, China
.
Mycosystema
30
,
853
860
.

Supplementary data