Keywords
microbiome, microbiota, 16S, rrn operon, nanopore, canine, low-biomass, skin, dog
This article is included in the Nanopore Analysis gateway.
microbiome, microbiota, 16S, rrn operon, nanopore, canine, low-biomass, skin, dog
The microbiota profile of low-biomass samples such as skin is challenging for metagenomics. These samples are prone to containing DNA contamination from the host or exogenous sources, which can overcome the DNA of interest1,2. Thus, the usual approach is amplifying and sequencing certain genetic markers that are ubiquitously found within the studied kingdom rather than performing metagenomics. Ribosomal marker genes are a common choice: 16S rRNA and 23S rRNA genes to taxonomically classify bacteria3,4; and ITS1 and ITS2 regions for fungi5,6.
Until now, most studies of microbiota rely on second-generation sequencing (massive parallel sequencing), and target a short fragment of the 16S rRNA gene, which presents nine hypervariable regions (V1-V9) that are used to infer taxonomy7,8. The most common choices for host-associated microbiota are V4 or V1-V2 regions, which present different taxonomic coverage and resolution depending on the taxa9. V4 region represents better the whole bacterial diversity, although it fails to amplify Cutibacterium acnes (formerly known as Propionibacterium acnes), a ubiquitous skin commensal in humans. So, when performing a skin microbiota study, the preferred choice is V1-V2 regions, although they lack sensitivity for the Bifidobacterium genus and poorly amplify the Verrucomicrobia phylum10.
Apart from the biases derived from the primer choice, short fragment strategies usually fail to assign taxonomy reliably at the genus and species level. This taxonomic resolution is particularly useful when associating microbiota to clinics such as in characterizing disease status or when developing microbiota-based products, such as pre- or pro-biotics11. For example, in human atopic dermatitis (AD) the signature for AD-prone skin when compared to healthy skin was enriched for Streptococcus and Gemella, but depleted in Dermacoccus. Moreover, nine different bacterial species were identified to have significant AD-associated microbiome differences12. In canine atopic dermatitis, Staphylococcus pseudintermedius has been classically associated with the disease. Microbiota studies of canine atopic dermatitis presented an overrepresentation of Staphylococcus genus13,14, but the species was confirmed when complementing the studies using directed qPCRs for the species of interest13 or using a Staphylococcus-specific database and V1-V3 region amplification14.
With the launching of third-generation single-molecule technology sequencers, these short-length associated issues can be overcome by sequencing the full-length of the 16S rRNA gene (1,500 bp) or even the whole rrn operon (4,500 bp), which includes the 16S rRNA gene, ITS region, and 23S rRNA gene. The Oxford Nanopore Technologies MinIONTM sequencer is a single-molecule sequencer that is portable, affordable with a small budget and offers long-read output. Its main limitation is the high error rate.
Several studies assessing the full-length 16S rRNA gene have already been performed using Nanopore sequencing to: i) characterize artificial and already characterized bacterial communities (mock community)15–17; ii) characterize complex microbiota samples, from the mouse gut18, wastewater19, microalgae20 and dog skin21; and iii) characterize the pathogenic agent in a clinical sample22–24. On the other hand, only two studies have been performed using the whole rrn operon to characterize mock communities25 and complex natural communities26.
Here we aim to assess the potential of Nanopore sequencing using both the full-length 16S rRNA (1,500bp) and the whole rrn operon (4,500bp) in: i) a clinical isolate of Staphylococcus pseudintermedius, ii) two bacterial mock communities; and iii) two complex skin microbiota samples.
We used two DNA mock communities as simple, well-defined microbiota samples:
- HM-783D, kindly donated by BEI resources, containing genomic DNA from 20 bacterial strains with staggered ribosomal RNA operon counts (between 1,000 and 1,000,000 copies per organism per μl).
- ZymoBIOMICS™ Microbial Community DNA standard that contained a mixture of genomic DNA extracted from pure cultures of eight bacterial strains.
We also sequenced a pure bacterial isolate of Staphylococcus pseudintermedius obtained from the ear of a dog affected with otitis.
As a complex microbial community, we used two DNA sample pools from the skin microbiota of healthy dogs targeting two different skin sites: i) dorsal back (DNA from two dorsal samples from Beagle dogs); and ii) chin (DNA from five chin samples from Golden Retriever/Labrador crossed dogs). Skin microbiota samples were collected using Sterile Catch-All™ Sample Collection Swabs (Epicentre Biotechnologies) soaked in sterile SCF-1 solution (50 mM Tris buffer (pH 8), 1 mM EDTA, and 0.5% Tween-20). DNA was extracted from the swabs using the PowerSoil™ DNA isolation kit (MO BIO) and blank samples were processed simultaneously (for further details on sample collection and DNA extraction see 27).
There were two ribosomal markers evaluated in this study: full-length 16S rRNA gene (~1,500 bp) and the whole rrn operon (~4,500 bp). Before sequencing, bacterial DNA was amplified using nested PCR, with a first PCR to add the specific primer sets (Table 1) tagged with the Oxford Nanopore universal tag and a second PCR to add the barcodes from the barcoding kit (EXP-PBC001). Each PCR reaction included a no-template control sample to assess possible reagent contamination.
Oligo name | ONT Universal Tag | Sequence 5' to 3' | Amplicon | Ref. |
---|---|---|---|---|
16S-27F | TTTCTGTTGGTGCTGATATTGC | AGRGTTTGATYHTGGCTCAG | 16S, rrn | 28 |
16S-1492R | ACTTGCCTGTCGCTCTATCTTC | TACCTTGTTAYGACTT | 16S | 29 |
23S-2241R | ACTTGCCTGTCGCTCTATCTTC | ACCRCCCCAGTHAAACT | rrn | 28 |
For the first PCR, we targeted the full 16S rRNA gene using 16S-27F and 16S-1492R primer set and the whole rrn operon (16S rRNA gene–ITS–23S rRNA gene) using 16S-27F and 23S-2241R primer set (Table 1).
PCR mixture for full-length 16S rRNA gene (25 μl total volume) contained 5 ng of DNA template, 5 μl of 5X Phusion® High Fidelity Buffer, 2.5 μl of dNTPs (2 mM), 1 μl of 16S-27F (0.4 μM), 2 μl of 16S-1492R (0.8 μM) and 0.25 μl of Phusion® Hot Start II Taq Polymerase (0.5 U) (Thermo Scientific, Vilnius, Lithuania). The PCR thermal profile consisted of an initial denaturation of 30 s at 98°C, followed by 25 cycles of 15 s at 98°C, 15 s at 51°C, 45 s at 72°C, and a final step of 7 min at 72°C.
PCR mixture for the rrn whole operon (50 μl total volume) contained 5 ng of DNA template, 10 μl 5X Phusion® High Fidelity Buffer, 5 μl dNTPs (2 mM), 5 μl each primer (1 μM) and 0.5 μl Phusion® Hot Start II Taq Polymerase (1 U). The PCR thermal profile consisted of an initial denaturation of 30 s at 98°C, followed by 25 cycles of 7 s at 98°C, 30 s at 59°C, 150 s at 72°C, and a final step of 10 min at 72°C.
The amplicons were cleaned-up with the AMPure XP beads (Beckman Coulter) using a 0.5X and 0.45X ratio for the 16S rRNA gene and the whole rrn operon, respectively. Then they were quantified using Qubit™ fluorometer (Life Technologies, Carlsbad, CA) and volume was adjusted to begin the second round of PCR with 0.5 nM of the first PCR product or the whole volume when not reaching the required concentration (mostly for samples that amplified the rrn operon).
PCR mixture for the barcoding PCR (100 μl total volume) contained 0.5 nM of first PCR product, 20 μl 5X Phusion® High Fidelity Buffer, 10 μl dNTPs (2 mM), and 1 μl Phusion® Hot Start II Taq Polymerase (2 U). Each PCR tube contained the DNA, the PCR mixture and 2 μl of the specific barcode. The PCR thermal profile consisted of an initial denaturation of 30 s at 98°C, followed by 15 cycles of 7 s at 98°C, 15 s at 62°C, 45 s (for the 16S rRNA gene) or 150 s (for rrn operon) at 72°C, and a final step of 10 min at 72°C.
Again, the amplicons were cleaned-up with the AMPure XP beads (Beckman Coulter) using a 0.5X and 0.45X ratio for the 16S rRNA gene and the whole rrn operon, respectively. For each sample, quality and quantity were assessed using Nanodrop and Qubit™ fluorometer (Life Technologies, Carlsbad, CA), respectively.
In most cases, the different barcoded samples were pooled in equimolar ratio to obtain a final pool (1000–1500 ng in 45 μl) to do the sequencing library.
The Ligation Sequencing Kit 1D (SQK-LSK108; Oxford Nanopore Technologies) was used to prepare the amplicon library to load into the MinIONTM (Oxford Nanopore Technologies), following the manufacturer’s protocol. Input DNA samples were composed of 1–1.5 μg of the barcoded DNA pool in a volume of 45 μl and 5 μl of DNA CS (DNA from lambda phage, used as a positive control in the sequencing). The DNA was processed for end repair and dA-tailing using the NEBNext End Repair/dA-tailing Module (New England Biolabs). A purification step using 1X Agencourt AMPure XP beads (Beckman Coulter) was performed.
For the adapter ligation step, a total of 0.2 pmol of the end-prepped DNA were added in a mix containing 50 μl of Blunt/TA ligase master mix (New England Biolabs) and 20 μl of adapter mix and then incubated at room temperature for 10 min. We performed a purification step using Adapter Bead Binding buffer (provided in the SQK-LSK108 kit) and 0.5X Agencourt AMPure XP beads (Beckman Coulter) to finally obtain the DNA library.
We prepared the pre-sequencing mix (14 μl of DNA library) to be loaded by mixing it with Library Loading beads (25.5 μl) and Running Buffer with fuel mix (35.5 μl). We used two SpotON Flow Cells Mk I (R9.4.1) (FLO-MIN106). After the quality control, we primed the flowcell with a mixture of Running Buffer with fuel mix (RBF from SQK-LSK108) and Nuclease-free water (575 μl + 625 μl). Immediately after priming, the nanopore sequencing library was loaded in a dropwise fashion using the SpotON port.
Once the library was loaded, we initiated a standard 48 h sequencing protocol using the MinKNOW™ software v1.15.
The samples were run using the MinKNOW software. After the run, fast5 files were base-called and de-multiplexed using Albacore v2.3.1. A second de-multiplexing round was performed with Porechop v0.2.330, where only the barcodes that agreed with Albacore were kept. Porechop was also used to trim the barcodes and the adapters from the sequences (Figure 1).
Moreover, we removed 45 extra base pairs from each end that correspond to the length of the universal tags and custom primers. After the trimming, reads were selected by size: 1,200 bp to 1,800 bp for 16S rRNA gene; and 3,500 to 5,000 bp for the rrn operon. We mapped the sequences obtained to the rrn database using Minimap2 v2.931. Afterwards chimeras were detected and removed using yacrd v0.332.
To assign taxonomy to the trimmed and filtered reads we used to strategies: 1) a mapping-based strategy using Minimap231; or 2) a taxonomic classifier using What’s in my Pot (WIMP)33, a workflow from EPI2ME in the Oxford Nanopore Technologies cloud (based on Centrifuge software34).
For the mapping-based strategy, we performed Minimap2 again with the non-chimeric sequences. We applied extra filtering steps to retain the final results: we kept only those reads that aligned to the reference with a block larger than 1,000 bp (for 16S rRNA gene) and 3,000 bp (for the whole rrn operon). For reads that hit two or more references, only the alignments with the highest Smith-Waterman alignment score were kept. After filtering, the multimapping was mostly present in cases with entries that belonged to the same taxonomy.
The reference databases used in this study were:
- Mock database: a collection of the complete genomes that were included in each mock community, as described by the manufacturer. The HM-783D database was retrieved from NCBI using the reference accession numbers, while Zymobiomics mock community has already its database online on the Amazon AWS server.
- rrn database: sequences from the whole operon retrieved from Genbank25.
For the taxonomic classification using the WIMP workflow, which uses the NCBI database, only those hits with a classification score >300 were kept34.
An earlier version of this article can be found on bioRxiv (doi: https://doi.org/10.1101/450734)
After Albacore basecalling and Porechop processing, we lost around 5% of the initial reads (3-13%). After length trimming step, we lost more sequences (Table 2). In general, the samples amplified using 16S rRNA marker gene recovered a higher percentage of reads after the quality control when compared to the rrn operon: 74–95% vs. 32–80%. Particularly for rrn operon, the largest percentage of reads was lost during the length trimming step: some of the reads included in that barcode presented the length of the 16S rRNA gene.
After this first quality control, we performed an alignment with the mock and the rrn databases and checked for chimeras. Chimeras detected were dependent on the database used for the alignment. As a positive control, we used mock samples with their mock database. Chimera ratio was higher for 16S rRNA gene amplicons (around ~40%) than for rrn operon (~10%), suggesting that PCR conditions for the 16S rRNA gene need to be adjusted or the PCR cycles reduced.
To conclude, the final useful sequences when amplifying for either amplicon were ~40%. In 16S rRNA gene, sequences were lost in the chimera checking step. In the rrn operon, sequences were lost in the length trimming step, probably due to the underrepresentation of the amplicon in the flowcell, since we ran them together with full-length 16S rRNA amplicons in the same flow-cell.
Microbial Mock Community HM-783D contained genomic DNA from 20 bacterial strains with staggered ribosomal RNA operon counts (from 1,000 to 1,000,000 copies per organism per µl). The bacterial composition detected should be proportional to the operon counts. This mock community would allow us determining if our approach reliably represents the actual bacterial composition of the community, especially considering low-abundant species.
We analyzed HM-783D mock community against its own database, which contains only the 20 representative species. On the one hand, using 16S rRNA gene we were able to detect all the bacterial species present in the mock community, even the low-abundant ones. On the other hand, using the rrn operon we were able to detect only the most abundant species (at least 104 operon copies) (Figure 2). This could be due to the lower sequencing depth obtained with rrn when compared with 16S rRNA, and probably due to the underrepresentation of the rrn amplicon in the flowcell when running together with the full-length 16S rRNA amplicons in the same flow-cell, as detailed above. Moreover, the relative abundances of rrn operon sequences were more biased than those obtained from 16S rRNA gene sequencing, when compared to those expected, which confirmed that the primers for rrn need to be improved for universality.
Zymobiomics mock community presents the same amount of genomic DNA from 8 different bacterial species; the expected 16S rRNA gene content for each representative is also known, so we are able to determine if our approach represents the actual bacterial composition of the community reliably.
Both 16S rRNA gene and rrn operon sequencing were able to detect 8 out of 8 bacterial species for the Zymobiomics mock community, using Minimap2 and WIMP. The “Other taxa” group in Figure 3A can indicate: i) not expected taxa (wrongly-assigned species, or previous contamination); or ii) higher taxonomic rank taxa (sequences not assigned to species level).
Using the mock community database (that contains only the 8 members of that community), we aimed to assess the biases regarding the actual abundance profile. 16S rRNA gene better represented the bacterial composition of the mock community, when considering the abundances. The rrn operon amplification over-represented Escherichia coli and Staphylococcus aureus and under-represented Enterococcus faecalis.
The rrn database25 contains 22,351 different bacterial species, including representatives of the species in the mock community. When using the rrn database, we found that the rrn operon was a better marker than 16S rRNA: more than 98% of the sequences mapped to the corresponding species, and only <2% of the total sequences mapped to a wrong species with the rrn operon, whereas ~15% of the sequences were given the wrong taxonomy with 16S rRNA. We performed alpha diversity analyses using the same rrn database. The rrn operon hit 26 different species, whereas 16S rRNA over-estimated the actual diversity, with hits to 202 different species (Figure 3B). However, when considering abundances, the diversity values are more similar, with Shannon indices of 1.95 and 2.51, when using rrn operon and 16S rRNA, respectively (at 30,000 sequences/sample).
Using WIMP, we confirmed again the higher resolution power of rrn operon: ~70% of the sequences were assigned to the correct species compared to ~45% for the 16S rRNA gene. Among all the bacterial species included in the mock community, Bacillus subtilis presented the most trouble for the correct taxonomic classification. The theoretically expected abundance for B. subtilis is 17% using the 16S rRNA gene. When using WIMP, only 5% of the total sequences were correctly classified at the species level, another 5% was classified correctly at the genus level, and another 10% was incorrectly classified as other Bacillus species (Figure 3C).
Apart from the mock communities, we also sequenced an isolate of Staphylococcus pseudintermedius obtained from canine otitis. When using WIMP approach with rrn operon, 97.5% of the sequences were correctly assigned to the S. pseudintermedius. However, with the 16S rRNA gene, 68% of the sequences were correctly assigned at the species level and 13% at the genus (Table 3). The wrong assigned species for rrn operon was ~2.5%, compared to ~20% for the 16S rRNA gene. On the other hand, through mapping the sequences to rrn database using Minimap2, we obtained no hit to S. pseudintermedius, since there is no representative in the rrn database. Instead, they were hitting mostly to Staphylococcus schleiferi, which is a closely related species; there were also few hits to Staphylococcus hyicus and Staphylococcus agnetis. These results highlight the need of comprehensive databases that include representatives of all the microorganisms relevant to a microbiome to correctly assign taxonomy.
After the first analyses with the mock communities, we were able to detect that the taxonomic resolution was higher when using rrn operon; however, the abundance profile was more reliable using 16S rRNA marker gene. If a bacterial species is not present in the database, the mapping strategy will give us the closest sequence resulting to an inaccurate taxonomic profile, such as we have seen for the Staphylococcus pseudintermedius isolate.
Here, we aimed to taxonomically profile two complex and uncharacterized microbial communities from dog skin (chin and dorsal) using the two different markers and comparing the mapping strategy (Minimap2 and rrn database) with the WIMP workflow (NCBI database).
For chin samples of healthy dogs, we found a high abundance of Pseudomonas species followed by other genus with lower abundances such as Erwinia and Pantoea. Focusing on Pseudomonas, at the species level we were able to detect that the most abundant species was Pseudomonas koreensis, followed by Pseudomonas putida and Pseudomonas fluorescens (Figure 4A and Supplementary Table 1). On the other hand, dorsal skin samples were dominated by bacteria from the genera Stenotrophomonas, Sanguibacter, and Bacillus. We reached species level for Stenotrophomonas rhizophila and Sanguibacter keddieii. It should be noted that Glutamicibacter arilaitensis is the same species as Arthrobacter arilaitensis, but is the up-to-date nomenclature35 (Figure 4B and Supplementary Table 1). For both skin sample replicates, the results of the most abundant species converged and allowed for characterizing this complex low-biomass microbial community at the species level.
Finally, analyzing the dorsal skin samples, we also detected the presence of contamination from the previous nanopore run. We sequenced dorsal skin samples twice: one with a barcode previously used for sequencing the HM-783D mock community and another one with a new barcode (Table 2). We were able to detect mock community representatives within the re-used barcode (Figure 5). Some of them were found only in the sample that was using the re-used barcode (Sample_1); others were also present in the skin sample, such as Bacillus cereus or Staphylococcus aureus. In total, this contamination from the previous run was representing ~6% of the sample composition.
Full-length 16S rRNA and the rrn operon revealed the microbiota composition of the bacterial isolate, the mock communities and the complex skin samples, even at the genus and species level. Although Nanopore sequencing has a high error rate (average accuracy for the S. pseudintermedius isolate: 89%), we compensated this low accuracy with longer fragments to assess the taxonomy of several bacterial communities. In general, the longer the marker, the higher the taxonomical resolution both when using mapping software, such as Minimap2, or taxonomy classifiers such as WIMP in EPI2ME cloud.
When using EPI2ME (WIMP with NCBI database), the amplicons from the S. pseudintermedius isolate were assigned to the correct bacterial species in ~98% and ~68% of the cases, using rrn operon and 16S rRNA operon, respectively. In a previous study, Moon and collaborators used the full-length 16S rRNA gene for characterizing an isolate of Campylobacter fetus and the marker assigned the species correctly for ~89% of the sequences using EPI2ME23. The ratio of success on the correct assignment at species level depends on the species itself and its degree of sequence similarity in the selected marker gene. Within the Staphylococcus genus, the 16S rRNA gene presents the highest similarity (around ~97%) when compared to other genetic markers36. On the other hand, we observed that using the mapping strategy (through Minimap2) could lead to a wrong assigned species if the interrogated bacterium has not any representative on the chosen database. This strategy provides faster results than EPI2ME, but it needs an accurate comprehensive and representative database.
Analyses of the mock communities allowed us to detect whether our approach represented the actual bacterial composition reliably. Moreover, with the HM-783D staggered mock community –with some low abundant species– we were able to detect the sensitivity of both approaches. When using the 16S rRNA marker gene, we were able to detect all bacterial members of both mock communities. However, when using the rrn operon, some of the low-abundant species were not detected. The likely reason is that we obtained a lower number of reads for this marker, up to one magnitude. Mock communities also allowed us to detect the potential biases of our primer sets for both markers, since some of the species detected were over- and under-represented. Actinomyces odontolyticus and Rhodobacter sphaeroides seem to not amplify properly, neither with 16S rRNA gene, nor the rrn operon. Previous studies also detected the same pattern for these specific bacteria even when using or comparing different primer sets16,21. Overall, the 16S rRNA primer set seemed less biased than rrn operon. When using the rrn operon, E. coli and S. aureus were overrepresented, whereas others were underrepresented, suggesting that the primers should be improved for universality.
Focusing on the dog chin samples, we could detect that it was mostly Pseudomonas species that colonized: P. koreensis, P. putida, and P. fluorecens were the main representatives. Recently, Meason-Smith and collaborators found Pseudomonas species associated with malodor in bloodhound dogs37. However, these were not the main bacteria found within the skin site tested, but were in low abundance, differing from what we have found here. On the other hand, Riggio and collaborators detected Pseudomonas as one of the main genera in canine oral microbiota in the normal, gingivitis and periodontitis groups38. However, the Pseudomonas species were not the same ones that we have detected here. It should be noted that we had characterized these chin samples (and others) with 16S V1-V2 amplicons in a previous study27, where we found some mutual exclusion patterns for Pseudomonadaceae family. This taxon showed an apparent “invasive pattern”, which could be mainly explained for the recent contact of the dog with an environmental source that contained larger bacterial loads before sampling27. Thus, our main hypothesis is that the Pseudomonas species detected on dog chin came from the environment, since they have been previously isolated from environments such as soil or water sources39,40.
The most abundant species in dog dorsal skin samples were Stenotrophomonas rhizophila, Bacillus cereus, Sanguibacter keddieii, Sporosarcina psychrophila, Achromobacter xylosidans and Glutamicibacter arilaitensis. None of these specific bacterial species had previously been associated with healthy skin microbiota in human or dogs. Some of them have an environmental origin, such as Stenotrophomonas rhizophila, which is mainly associated with plants41; or Sporosarcina psychrophila, which is widely distributed in terrestrial and aquatic environments42. The Bacillus cereus main reservoir is also the soil, although it can be a commensal of root plants and guts of insects, and can also be a pathogen for insects and mammals43. Overall, environmental-associated bacteria have already been associated with dog skin microbiota and are to be expected, since dogs constantly interact with the environment27.
Regarding Stenotrophomonas in human microbiota studies, Flores et al. found that this genus was enriched in atopic dermatitis patients that were responders to emollient treatment44. However, previous studies on this skin disease found Stenotrophomonas maltophila associated to the disease rather than Stenotrophomonas rhizophila45. Achromobacter xylosoxidans has been mainly associated with different kind of infections, as well as skin and soft tissue infections in humans46. However, both dogs included in this pool were healthy and with representatives of both genus/species, a fact that reinforces the need to study the healthy skin microbiome before associating some species at the taxonomic level to disease. The other abundant bacteria detected on dog skin have been isolated in very different scenarios: Sanguibacter keddieii from cow milk and blood47,48; and Glutamicibacter arilaitensis (formerly Arthrobacter arilaitensis) is commonly isolated in cheese surfaces35,49.
Finally, some of the technical parameters used should be improved for better performance in future studies. In most cases we did not obtain enough DNA mass to begin with the indicated number of molecules for rrn operon amplicons. Thus, the flowcell contained an underrepresentation of rrn operon amplicons when compared to the full-length 16S rRNA gene. Moreover, in barcodes that contained rrn operon amplicons, a great percentage of reads were lost due to an inaccurate sequence size (~1,500 bp). One possible solution could be running each marker gene in different runs, so multiplexing samples with the same size amplicon to avoid underrepresentation of the larger one. When assessing chimera in mock samples using the specific mock database, we detected that the 16S rRNA gene formed a higher percentage of chimeras than rrn operon. Some options to improve that fact would include lowering PCR cycles performed. Better adjusting the laboratory practices would allow an increased DNA yield that meets the first quality control steps.
To conclude, both full-length 16S rRNA and the rrn operon retrieved the microbiota composition from simple and complex microbial communities, even from the low-biomass samples such as dog skin. Taxonomy assignment down to species level was obtained, although it was not always feasible due to: i) sequencing errors; ii) high similarity of the marker chosen within some genera; and iii) an incomplete database. For an increased resolution at the species level, the rrn operon would be the best choice. Further studies should be aiming to obtain reads with higher accuracy. Some options would include using the 1D2 kit of Oxford Nanopore Technologies, the new basecallers or the new flow cells with R10 pores. Finally, studies comparing marker-based strategies with metagenomics will determine the most accurate marker for microbiota studies in low-biomass samples.
The datasets analyzed during the current study are available in the NCBI Sequence Read Archive, under the Bioproject accession number PRJNA495486: https://identifiers.org/bioproject/PRJNA495486.
This work was supported by two grants awarded by Generalitat de Catalunya (Industrial Doctorate program, 2013 DI 011 and 2017 DI 037).
Supplementary Table 1. Taxa found on skin microbiota of healthy dogs.
List of all the taxa and their relative abundances found on dog skin microbiota samples (chin and dorsal skin). Results for both marker genes tested and for both approaches (Minimap2 + rrn database and WIMP with NCBI database).
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Partly
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Not applicable
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Partly
References
1. Shin J, Lee S, Go M, Lee S, et al.: Analysis of the mouse gut microbiome using full-length 16S rRNA amplicon sequencing. Scientific Reports. 2016; 6 (1). Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Molecular ecology of microbial systems
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: neuroinfection, encephalitis
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
References
1. Volden R, Palmer T, Byrne A, Cole C, et al.: Improving nanopore read accuracy with the R2C2 method enables the sequencing of highly multiplexed full-length single-cell cDNA. Proceedings of the National Academy of Sciences. 2018; 115 (39): 9726-9731 Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Genomics, long read sequencing, microbiome assembly
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Partly
Are sufficient details of methods and analysis provided to allow replication by others?
Partly
If applicable, is the statistical analysis and its interpretation appropriate?
Not applicable
Are all the source data underlying the results available to ensure full reproducibility?
Partly
Are the conclusions drawn adequately supported by the results?
Partly
Competing Interests: I am a co-owner of DNASense ApS (www.dnasense.com)
Reviewer Expertise: microbial biotechnology, nanopore sequencing
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Partly
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Human microbiome, Microbial genomics, Nanopore sequencing, Computational biology, Metagenomics
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | |||||
---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | |
Version 2 (revision) 01 Aug 19 |
read | ||||
Version 1 06 Nov 18 |
read | read | read | read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)