Molecular characterization of a new highly divergent Mobala related arenavirus isolated from Praomys sp. rodents

Simo Tchetgna, Huguette; Descorps-Declère, Stephane; Selekon, Benjamin; Kwasiborski, Aurelia; Vandenbogaert, Mathias; Manuguerra, Jean-Claude; Gessain, Antoine; Caro, Valérie; Nakouné, Emmanuel; Berthet, Nicolas

doi:10.1038/s41598-021-88046-5

Download PDF

Article
Open access
Published: 13 May 2021

Molecular characterization of a new highly divergent Mobala related arenavirus isolated from Praomys sp. rodents

Huguette Simo Tchetgna^1,2,
Stephane Descorps-Declère³,
Benjamin Selekon⁴,
Aurelia Kwasiborski⁵,
Mathias Vandenbogaert⁵,
Jean-Claude Manuguerra⁵,
Antoine Gessain^6,7,
Valérie Caro⁵,
Emmanuel Nakouné⁴^na1 &
…
Nicolas Berthet^2,5^na1

Scientific Reports volume 11, Article number: 10188 (2021) Cite this article

1412 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Arenaviruses represent a family of viruses that are naturally present in rodents belonging to subfamily Murinae, Neotominae or Sigmodontinae. Except for Lassa virus, little information is available on other Old-World arenaviruses. Here, we describe strain AnRB3214, a virus isolated from a presumed Praomys sp. rodent in the Central African Republic in 1981 and assigned to Ippy virus based on antigenic similarity. The strain was simultaneously sequenced on Illumina NovaSeq 6000 and MinION Mk1B devices and analysed with various bioinformatics tools. We show that the best genome coverage and depth were obtained with the Kaiju and Minimap2 classification and identification tools, on either the MinION or the Illumina reads. The genetic analysis of AnRB3214 fragments showed 68% to 79% similarity with the Mobala and Gairo mammarenaviruses at the nucleic acid level. Strain AnRB3214 had a truncated nucleoprotein smaller than that of other Old World arenaviruses. Molecular clock analysis suggests that this strain diverged from Mobala virus at least 400 years ago. Finally, this study illustrates the importance of genomics in the identification of archived viruses and expands on the diversity of African arenaviruses, because strain AnRB3214 is either a variant or a close relative of Mobala virus, and not Ippy virus.

Genome sequencing identifies “Limestone Canyon virus” as Montaño virus (Hantaviridae: Orthohantavirus montanoense) circulating in brush deermice in New Mexico

Article Open access 04 April 2024

Metagenomic analysis of viromes in tissues of wild Qinghai vole from the eastern Tibetan Plateau

Article Open access 14 October 2022

Metavirome of 31 tick species provides a compendium of 1,801 RNA virus genomes

Article Open access 05 January 2023

Background

Arenaviruses are enveloped, single-stranded RNA viruses belonging to family Arenaviridae, order Bunyavirales. With approximately 43 species, arenaviruses are subdivided into four genera (Antennavirus, Hartmanivirus, Reptarenavirus and Mammarenavirus) that infect fishes, snakes or mammals¹. Arenaviruses typically have ambisense, bisegmented RNA genomes. Each RNA segment encodes two non-overlapping open reading frames (ORFs) of opposite polarity. The small (S) 3500 nt genomic segment encodes the nucleocapsid protein (NP) and a glycoprotein precursor (GPC), which later matures into two membrane glycoproteins, GP1 and GP2. The large (L) ~ 7000 nt genomic segment codes for the RNA-dependent RNA polymerase (RdRp) and exceptionally a zinc-binding protein (Z) in the Mammarenavirus and Reptarenavirus genera^2,3. Interestingly, Antennavirus has a trisegmented RNA genome, with NP and GPC being on two distinct segments⁴.

Mammarenaviruses are geographically segregated into two monophyletic groups based on genomic features and antigenic properties. The New World (NW) arenaviruses, represented by the Tacaribe virus complex, are prevalent in North and Latin America; the Old World (OW) arenaviruses, represented by the Lassa-Lymphocytic choriomeningitis virus (LCMV) complex, are found in Africa^3,5. The natural reservoirs of mammarenaviruses are generally rodent subfamilies Murinae, Neotominae and Sigmodontinae, in which they establish persistent infections with no overt pathology for the animal host^6,7. These viruses seem to be associated with specific rodent host species, although many host-switching events have been recorded, which implies that virus-host co-evolution may not be as tightly linked as previously thought^6,8,9. Most mammalian arenaviruses are non-pathogenic to humans, but seven are recognized as haemorrhagic fever viruses including the Lassa and Lujo viruses in Africa and the Sabia, Machupo, Chapare, Junin and Guaranito viruses in Latin America^10,11. LCMV, which has a worldwide distribution, can cause neurological disease in immune-compromised individuals¹². Except for Lassa virus, knowledge of OW arenavirus diversity, evolution and involvement in human diseases is sparse. Several arenaviruses of unknown pathogenic potential have been described in African rodents such as Gbagroube, Kodoko and Menekre viruses in West Africa and Mopeia, Morogoro, Luna, Merino Walk, Mariental and Okahandja viruses in southern Africa². In Central Africa, wild and domestic rodents act as reservoirs of numerous parechoviruses, rhabdoviruses, phleboviruses, flaviviruses, picornaviruses, astroviruses, paramyxoviruses, orthopoxviruses and arenaviruses, some of which are directly responsible for human diseases, including monkeypox, Mokola virus infection or lymphocytic choriomenigitis^13,14,15.

Few arenaviruses are known in Central African rodents; however, LCMV, Souris, Mobala and Ippy viruses have been described in Mus, Praomys, Mastomys and Arvicanthis rodents in Gabon, Cameroon and the Central African Republic (CAR)^13,16,17,18. Seroconversion has been highlighted in the CAR for Mobala virus infection in humans in locations where the virus has been isolated from Praomys rodents^17,19. Interestingly, seroprevalence in humans seems to reflect virus isolation in its reservoir host. Indeed, for a prevalence of 7.7% and 3.2% in Praomys sp. rodents in the Bouboui and Gomoka villages, respectively, using the indirect fluorescent antibody test (IFAT), the prevalence in humans ranges from 3.1 to 3.8%; however, in Botambi, with a prevalence of 2.2% in the reservoir, no antibodies have been detected in the human population¹⁹. Seroconversion in humans for Souris and Ippy viruses remains to be determined.

Next-generation sequencing (NGS) platforms have been a breakthrough in many scientific fields. In virology, NGS has facilitated a better understanding of the diversity and evolutionary history of viruses, the discovery of new viruses and the rapid response to outbreaks through diagnosis of the causative agents^4,20,21,22. The leading NGS technologies, among which Illumina, are based on the incorporation of terminating nucleotides in a DNA polymerization process. These technologies, which produce very accurate and high-quality results, are widely used. However, they are usually cumbersome and only available in well-equipped, specialized laboratory facilities that can provide all the necessary utilities, such as a stable electric supply and internet connection. On the other hand, MinION uses a different approach in which a specific nanopore protein is used to recognize a k-mer in the sequence of interest via a disturbance in the basal electrical current. This innovation led to the miniaturization of the sequencing device and the ability to perform real-time sequence analysis^23,24. This portable tool is gaining interest because it brings more flexibility to sequencing, albeit with a high error rate, and can, therefore, be used in remote regions as a diagnosis tool or in response to outbreaks as seen in the recent Lassa, Ebola and Zika epidemics^{22,25,26,27,28,29}. Despite its shortcomings, MinION sequencers can be implemented in resource-limited settings such as the CAR as a first-line tool to respond instantly to emergencies or to analyse infectious samples subject to transportation restrictions.

Here, we describe a divergent arenavirus, the AnRB3214 strain isolated from wild Praomys rodents in the CAR in 1981. The strain was sequenced using the MinION Mk1B (Oxford Nanopore Technologies, Oxford, UK) and Illumina NovaSeq 6000 (Integragen, Ivry, France) devices and analysed with various bioinformatics pipelines.

Methods

Description of the strain

Strain AnRB3214, initially identified as Ippy virus, was isolated from a presumed Praomys sp. rodent caught near Botambi, Ombella-M’poko prefecture, CAR in October 1981 (Fig. 1). The initial isolation procedure was performed by intracerebral infection of two-day-old newborn mice up to the sixth passage. The viral isolate was then freeze-dried (lyophilized) for long-term conservation at room temperature.

Virus isolation

A volume of 0.2 mL of 1X PBS-diluted lyophilized virus was used to inoculate a litter of eight newborn mice (24–72 h old). Only mice that died after a 7-day follow-up were harvested and their brains were collected, crushed and suspended in a 1X PBS solution, and then filtered. This filtered brain suspension was used to infect fresh confluent Vero E6 cells culture in Dulbecco’s Modified Eagle’s Medium (DMEM, Gibco, Life Technologies, Carlsbad, California, USA) supplemented with 2% Foetal Calf Serum (FCS, Gibco, Life Technologies) and a 1% v/v antimicrobial solution (10,000 units/mL of penicillin, 10,000 µg/mL of streptomycin and 25 µg/mL of amphotericin B, Gibco, Life Technologies). The cell cultures were monitored for 7 days for cytopathic effects using an inverted optical microscope. The brain suspensions and cell culture supernatant were then stored at -80 °C until further use.

Nucleic acid extraction

Total RNA was extracted from the culture supernatant using the QIAamp Viral RNA Mini kit (Qiagen, Hilden, Germany) according to the manufacturer's protocol. These RNAs were then quantified using the Qubit RNA HS Assay kit (Invitrogen, Life Technologies, Carlsbad, California, USA) and stored at -80 °C.

Preparation of libraries for Illumina and MinION sequencing

The first cDNA strand was prepared using SuperScript III reverse transcriptase and random hexamers (Invitrogen, Life Technologies, Carlsbad, California, USA) on total RNA. The second cDNA strand was then prepared using the NEBNext Ultra II Non-Directional RNA Second Strand Synthesis Module kit (New England BioLabs, Hitchin, Hertfordshire, UK) according to the manufacturer's instructions. All final repair and adapter ligation steps were based on the use of the NEBNext Ultra II RNA Library Prep Kit for Illumina (New England BioLabs, Hitchin, Hertfordshire, UK). All DNA purification steps were performed using Agencourt AMPure XP beads (Beckmann Coulter, Woerden, Netherlands). The library quality was checked on the 2100 BioAnalyzer instrument (Agilent Technologies, Santa Clara, California, USA) and then sequenced on Illumina NovaSeq 6000 sequencer (Integragen, Ivry, France) to obtain 2 × 150 bp paired-end reads. In parallel, the double-stranded cDNA obtained after the end repair step was used to prepare the MinION library with the PCR Barcoding kit (SQK-PBK004) (Oxford Nanopore Technologies. Oxford, Oxfordshire, UK) according to the manufacturer's instructions. The library was multiplexed with five other samples and loaded on a R9.4 flow cell (FLO-MIN106) and sequenced on the MinION Mk1B device in 37 h. The raw sequencing data were collected using ONT MinKNOW software (version 19.05.0).

Bioinformatics analysis

The quality of the raw reads from the Illumina sequencing was first assessed, filtered, and then trimmed using CLC Genomics workbench v10.0.1 (Qiagen, Hilden, Germany). All trimmed reads were then de novo assembled using SPAdes v3.10 to obtain two contigs corresponding to the S and L segments of an arenavirus genome. All reads were mapped back on the two contigs previously obtained using Geneious Prime (Biomatters, Auckland, New Zealand). All the reads from the Illumina sequencing that have been mapped were grouped together in a constituted dataset referred to as "AnRB-3214-Illumina". Three subsets, derived from the AnRB-3214-Illumina dataset, were constituted by collecting random reads from the raw fastq files in such a way that the first dataset represented 10% of the initial reads, the second 1%, and the last 0.1%. In parallel, the raw data obtained from MinION sequencing were acquired in real-time in files containing 4000 sequences in FAST5 format. These files were base-called and demultiplexed using Guppy (version 3.4.1). As carried out for the Illumina raw reads, the MinION raw reads were mapped on the obtained sequences of the two segments using Minimap2³⁰. All the reads that have been mapped were grouped in a dataset referred to as "AnRB-3214-MinION". From this dataset, three sub-datasets were constituted in the same way as previously. Viral diversity was determined from the two above-described datasets (AnRB-3214-MinION and AnRB-3214-Illumina) and their three sub-datasets using three different taxonomic classification tools: Kaiju³¹, Centrifuge³², and Kraken2³³. These tools use reference databases in which the sequence of our AnRB-3214 strain was absent. In parallel with the use of these three taxonomic classification tools, two other traditional approaches were used for the identification of viral reads. The first approach consisted in a homology search from reference sequences using the BLASTN tool whereas the second approach was to map the reads on other reference sequences using Minimap2. To use these approaches, the genomes of viruses closest to our AnRB3214 strain were screened for in GenBank and identified as Mobala virus (DQ328876, AY342390) and Gairo virus (KJ855308, KJ855307). Then, we screened the AnRB-3214-MinION and AnRB-3214-Illumina datasets against these Mobala and Gairo virus sequences using the BLASTN (https://blast.ncbi.nlm.nih.gov/Blast.cgi) and Minimap2 tools. Hits produced by both methods were reformatted in fastq files and remapped to obtain bam files for both BLASTN and Minimap2. Bam files were converted into consensus sequences by applying BCFtools on the results of SAMtools mpileup.

Phylogenetic analysis and molecular clock estimation

S and L segment nucleic-acid sequences as well as the coding sequences of the NP, GPC, RdRp and Z genes were aligned using ClustalO in Unipro Ugene version 34.0³⁴ and manually edited. Then, phylogenetic trees were inferred for each segment using the maximum-likelihood (ML) method implemented in MEGA X (version 10.1.1) under the GTR + I + Γ4 nucleotide substitution model as determined by the best model finder in MEGA X³⁵. The node supports were estimated using 1000 bootstrap replicates.

Concurrently, a molecular clock was tested on concatenated S and L segments using Bayesian inference. S + L concatenates of genomic sequences were aligned using MAFFT v7³⁶ and manually edited with AliView³⁷. Starting from this alignment, ML and Bayesian analyses were conducted. A global ML evolutionary tree was reconstructed using IQ-TREE³⁸ according to a GTR + F + I + Γ4 substitution model, recommended by the “ModelFinder” model fitter included IQ-TREE. Branch robustness was assessed using the ultrafast bootstrap approximation with 1000 bootstrap replicates³⁹. The resulting ML tree was visualized and edited in FigTree (https://github.com/rambaut/figtree/releases/tag/v1.4.4). TempEst v1.5 was used for investigating the temporal signal of our molecular phylogeny⁴⁰. Analysis of the extreme values of residuals (tangential deviation from the regression line) permitted us to spot problematic sequences. Those genomes were discarded from the following steps. BEAST v1.10.4 was used for Bayesian Monte Carlo Markov chain (MCMC) analysis to estimate the divergence time of nodes⁴¹. We retained the simplest model: strict clock and constant population size with the HKY substitution model and four gamma categories. BEAST runs were completed checking chain convergence and sufficient sampling of the posterior space (ESS > 200). The final chronogram was generated using TreeAnnotator v1.10.4⁴¹.

Ethical statement

The study was approved by the institutional committee of the Institute Pasteur of Bangui. For the experiments realised out of CAR, material transfer agreements were established with regards to national and international policies. All the experiments carried on mice were done in accordance with relevant guidelines and regulations on the use of laboratory animals including the ARRIVE guidelines.

Results

Bioinformatics analysis

De novo assembly of the raw reads from Illumina sequencing allowed us to obtain two sequences with lengths of 3390 nt and 7390 nt for the S and L fragments, respectively. Then, mapping back all the raw reads identified a total of 9,275,654 reads corresponding to the arenavirus. The mean depth was of 1902 × and 155 × for the S and L fragments, respectively. Mapping also made it possible to identify a total of 77,133 reads from MinION sequencing. After mapping MinION data on these two sequences, the mean coverage was 18 × and 3x. The taxonomic assignment using three different classification tools was performed only on the dataset that contained only reads corresponding to our virus. Taxonomic assignments showed that the percentage of reads from the AnRB-3214-Illumina dataset assigned as belonging to the Arenaviridae family ranged from 0.02 to 0.4% whereas for reads belonging to the AnRB-3214-MinION dataset, taxonomic assignment percentages were 0.01% and 0.88% for Centrifuge³² and Kaiju³¹, respectively (Table 1).

Table 1 Percentages of reads identified as an arenavirus using different classification tools. The highest number of reads classified as belonging to arenaviruses were obtained with Kaiju tool for both MinION and Illumina libraries.

Full size table

Moreover, the percentage of reads from the computer-diluted Illumina datasets assigned as belonging to the Arenaviridae family only varied from 0.050 to 0.070% and from 0.020 to 0.024% for Kraken2³³ and Centrifuge, respectively. The variation was also low with Kaiju (0.45 to 0.46%) except for the 0.1% subset for which no reads were assigned. Excluding the 0.1% subset in which no MinION reads could be classified, the percentages in the other datasets ranged from 0.03 to 1.04% for computer-diluted datasets from MinION sequencing (Table 1). In parallel to these three classification tools, two other tools were used to identify viral reads, a mapping tool (Minimap2³⁰) and a homology search tool (BLASTN) using the sequence of the closest variant available in the GenBank (Mobala virus). From the AnRB-3214-Illumina dataset, a total of 26,726 reads (0.29%) were identified with Minimap2, but only 2364 (0.025%) were identified with BLASTN. Finally, although only 0.45% of the reads were classified with Kaiju, almost 100% of both the S and L segments were covered. The second best coverage was obtained with Minimap2 with 95% (573 positions uncovered), but using either BLASTN or the other two classification tools (Kraken2 and Centrifuge), the coverage percentages were lower, varying between 17% (BLASTN) and 45% (Kraken2) and were concentrated on only certain regions of the different fragments (Fig. 2).

Genome analysis

We obtained two genomic segments, an S segment of 3390 nt and an L segment of 7391 nt, each segment encoding two non-overlapping open reading frames (ORFs) in an ambisense organization typical of arenaviruses. The S segment encodes GPC [position 146 to 1618] and NP [position 3190 to 1646] and the L segment encodes the Z protein [position 153 to 458] and the RdRp protein [position 7256 to 570].

Nucleotide sequences and the deduced amino acid sequence of AnRB3214 strain were compared with those of other OW arenaviruses species. The S and L segments of the AnRB3214 strain are phylogenetically very distant from the other described Ippy virus strain DakAnB188d^42,43, with respectively 64% and 56% nucleic acid similarity, evidence that they belong to two distinct virus species. Although the two strains were identified in two locations in the CAR from Arvicanthis sp. (DakAnB188d) and Praomys sp. (AnRB3214) rodents almost 10 years apart, the AnRB3214 strain was classified as an Ippy virus due to cross-reactivity with DakAnB188d. Serum-based assays were widely used for the classification of viruses in the 1970-1980s and viruses were historically assigned to antigenic groups (Group A, Group B) given their antigenic relationships.

Additionally, for the S segment, at the nucleic acid level, the AnRB3214 strain displayed 79%, 72%, 70% and 68% similarity with Mobala virus, Gairo virus, Luna virus and Morogoro virus, respectively (Table 2). Similar trends were observed when comparing the L segment, where these viruses are the closest relatives with however divergences of 22%, 32%, 38% and 39%, respectively. Gairo, Luna and Morogoro viruses have been isolated from Mastomys natalensis in Tanzania and Zambia and the Mobala virus has been identified in Bouboui village in Praomys sp. from the CAR one year prior to the isolation of strain AnRB3214 in Botambi, another village of the same prefecture. When considering each protein in their respective segments, we observed an increase in the level of conservation. At the amino acid level, there was a high level of conservation for GPC with 94%, 83% and 82% of similarity with Mobala, Gairo, Luna, and Morogoro viruses (Table 2). The level of similarity with the other arenaviruses was low for the NP, Z and RdRp proteins. The Z protein of AnRB3214 was less conserved at the protein level than at the nucleic level and showed higher homology with that of Mobala virus, but less than 70% similarity with other close relatives, such as the Gairo and Luna viruses. Furthermore, the AnRB3214 NP appears to have been truncated of 43 to 78 amino acids (aa) at the 5′ end during the evolution of the virus. The AnRB3214 strain NP protein contains 515 aa, but OW arenavirus NP length ranges from 558 to 593 aa^44,45. Indeed, when compared to NP of OW arenaviruses like Lassa, Mopeia, Mobala, Gairo and Luna, the NP deletion of AnRB3214 corresponds to the first 53 aa of these viruses. This feature was already observed at the nucleic acid level with the NP gene length of 1545 nt with the other arenavirus NP genes ranging from 1674 to 1779 nt.

Table 2 Similarities of strain AnRB3214 with other Old-World arenaviruses at the nucleic acid (NA) and amino acid (AA) level and accession number.

Full size table

Phylogenetic trees were built for each segment and coding region using references available in GenBank. All the tree topologies confirmed the conclusions drawn from the pairwise comparison of the sequences. For example, strain AnRB3214 belongs to the cluster comprising the Mopeia, Morogoro, Luna, Gairo and Mobala viruses (Fig. 3).

The AnRB3214 strain is genetically very divergent from the Ippy virus DakAnB188d strain, because they are found in two distant groups in the phylogenetic trees. Moreover, strain AnRB3214 shares a most recent common ancestor (MRCA) with Mobala virus. Molecular clock analysis shows that the two viruses diverged from one another around the year 1600 AD with a substitution rate of 4.28 × 10^–4 subt/site/year (1374–1885, 95% highest posterior density (HPD)) (Fig. 4).

Discussion

In this study, the AnRB3214 strain was sequenced on two sequencing platforms, Illumina and MinION, and we used a diverse set of taxonomic classification and mapping tools to retrieve arenavirus-specific reads from the obtained datasets. We obtained 77,133 and 9,275,654 reads specific to arenaviruses from the MinION and Illumina datasets, respectively. It is no surprise that we obtained less reads from MinION sequencing as this technology produces longer reads compared to Illumina. Additionally, the high amount of reads obtained during our experiment is representative of the type of sample that was used (here, cell culture supernatant) compared with direct sequencing of biological samples in which the viruses are present at low abundance with high levels of the host nucleic acid background^26,46. When using reference viral genome to recover arenaviruses reads in our AnRB3213 dataset, the best read identifications were obtained with Kaiju and Minimap2. Few, if any, studies have compared tools for the classification of reads, either from long reads (MinION) or short reads (Illumina) for the analysis of divergent viruses. Minimap was initially developed for long reads, but has been updated to work with smaller paired-end reads, through a genuine management of gaps³⁰. Furthermore, Minimap2 has been shown to be a good tool for the alignment of nanopore reads, with a higher speed than GraphMap although with comparable accuracy⁴⁷. Contrary to Kraken2 and Centrifuge which primarily classify nucleotides, Kaiju uses amino acid sequences instead of genomic sequences for metagenomic classification. This feature allows Kaiju to overcome the limitations introduced by the redundancy of the genetic code and therefore make it suitable for investigating divergent organisms³¹. For more insight on our dataset and sequencing approaches, randomly generated subsets corresponding to 10%, 1%, and 0.1% of the overall initial outputs were generated for subsequent analysis to simulate a situation similar to that of a primary biological sample, i.e. from infected rodent reservoirs with low viral load. We showed that, except for the 0.1% dilution subset using Kaiju, arenavirus reads were detected by all the classification tools, although at varying rates. However, this test also demonstrated the limitations that can arise in the detection of such viruses in their reservoirs because the corresponding reads will be present in limited amounts.

OW mammarenaviruses are zoonotic viruses hosted by rodents of subfamily Murinae, which includes approximately 129 genera and 584 species. Members of about eight rodent genera are recognized as harbouring arenavirus species across the African continent^6,7. For example, Praomys rodent species have a very extended geographic range and can be found across Sub-Saharan Africa. Ippy virus, Souris virus and Mobala virus have been isolated from Praomys rodents and Praomys clade rodents host almost all the OW arenaviruses⁴⁸. Here, we described the AnRB3214 strain which was initially identified as an Ippy virus in 1981 in Botambi, CAR based on serological cross-reactivities. We demonstrated that strain AnRB3214 is distinct from the only other Ippy strain described to date, strain DakAnB188d, which also originates from the CAR. This classification was certainly due to cross-reactivity between the two viruses, because immunological assays were often used for virus identification and taxonomic assignment in the 1980s. Furthermore, “Ippy virus” has already been referred to as a complex comprising at least two viral species. Immunological microcompliment fixation data rapidly corroborated the presence of at least two distinct strains of Ippy virus, with low cross-reaction level, one of which was found in an Arvicanthis sp. rodent and the other in a Praomys sp. rodent⁷. With the description of the AnRB3214 strain, we confirm the huge arenavirus diversity in CAR rodents and the importance of the molecular characterization of viruses stored in biobanks.

Species delineation among arenaviruses is based on many criteria, including antigenic cross-reactivity, geographic distribution, host species and divergence at the amino acid level⁴⁹. For OW arenaviruses, the interspecies amino acid cut-off divergences vary according to the protein: 8.6–20.4% (GPC), 10.2–21.5% (NP), 24.2–30.2% (Z) and 20.8–36.6% (L)⁴⁹. Even so, some authors consider 12% divergence at the amino acid level in the NP enough to distinguish between two OW arenaviruses^50,51. However, the latter criterion raises significant limitations, especially when considering the intra-species diversity of Lassa virus and NW arenaviruses^48,52. Compared with Mobala virus at the amino acid level, strain AnRB3214 showed 6%, 13%, 17% and 17% genetic distance for the GPC, RdRp, Z and NP genes, respectively. Mobala virus and strain AnRB3214 were both isolated from unspecified Praomys rodents in the CAR in 1980 and 1981, respectively. Given the wide diversity of rodent species found in the forest areas of Central Africa, their morphological identification is often difficult and may sometimes require the use of molecular tools. Unfortunately, the absence of reference sequences for few rodent species mean that they cannot be identified⁵³. Furthermore, Praomys rodents, which are subdivided into at least 20 putative species, are morphologically similar to Mastomys and Hylomyscus rodents^54,55. Given that all these rodent species are abundant in CAR and that it is sometimes impossible to distinguish them on morphological criteria alone, it is difficult to confirm that strain AnRB3214 was isolated with certainty from a rodent of the Praomys genus. Mobala virus has also been isolated in Botambi, the same village from where strain AnRB3214 originates. Moreover, field and experimental evidence suggest that co-infections of a rodent species by two different arenaviruses are possible, although it has not yet been reported in the literature^6,54. It is therefore difficult to conclude whether strain AnRB3214 is distinct from Mobala virus or not, especially because no further information is available for either virus. Molecular clock analysis under the strict clock model shows that the MRCA of strain AnRB3214 and Mobala virus can be dated back to year 1600 AD, almost 400 years ago. OW arenaviruses are thought to have originated nearly 3000–7000 years ago, as illustrated by LCMV speciation events which are estimated to have occurred around 1000 to 5000 years ago and the spread of Lassa virus from Nigeria to Mali and Ivory Coast some 1000–2000 years ago^2,55,56,57. However, molecular clock analysis within Lassa virus lineages shows more contemporary virus speciation events, of less than 100 years up to 700 years^58,59, similar to what was observed for strain AnRB3214. Additionally, the substitution rate deduced for strain AnRB3214 matches that of other arenaviruses, ranging from 3.3 × 10^–4 to 6.3 × 10^–4 subt/site/year, rates similar to those for Gbagroube virus, LCMV or Lassa virus^55,59,60. Hence the circumstances that have led to the diversification of Mobala virus and strain AnRB3214 are probably similar to those observed within the Lassa virus lineages.

The AnRB3214 NP protein seems to be shorter than that of Mobala virus or the other OW arenaviruses, as it shows a ~ 53 aa deletion at the beginning of the N-terminal domain. Arenavirus NP proteins are essential for many aspects of the viral life cycle, such as the formation of the ribonucleoprotein complex when associated with RdRp, and the interference with the host immune system^45,61. The N- and C-terminal domains have been shown to be the main NP regions directly involved in NP-NP oligomerisation, ribonucleoprotein complex formation and RNA synthesis. In comparison with the results obtained through an amino acid mutagenesis experiments in the N-terminus using the Tacaribe virus, a NW arenavirus^44,62, we can conclude that the deletion observed in the AnRB3214 NP protein may not impact its biological function. However, further investigations are needed to understand the origin and consequences of this truncated nucleoprotein on viral particles.

More studies are needed to understand the implications of strain AnRB3214 for public health, especially in the CAR where civil unrest and political instability force the population to resort to feeding on forest-dwelling rodents. Although the prevalence of antibodies against Mobala virus in human population was associated with its prevalence in the rodent reservoirs^17,19, cross reactivity with arenavirus strain AnRB3214 cannot be rule out.

Conclusion

OW arenavirus diversity remains to be explored. Here, we revealed that strain AnRB3214, classified as an Ippy virus, is clearly a species close or identical to Mobala virus. This article shows once again the usefulness of molecular tools for taxonomic assignment, especially considering archived viral strains obtained using classical virology tools and classified based on antigenic similarities. In the CAR, 22 viral isolates identified as Ippy virus have been reported from Lemniscomys striatus, Praomys spp., Arvicanthis spp. and Mastomys spp. rodents, of which 16 were isolated from Praomys spp.^7,63. As seen in our study, these strains can represent distinct arenaviruses, especially because a specific arenavirus is usually associated with a single host species. Therefore, this study opens new ways to characterize the remaining CAR arenavirus strains, and especially those classified as Ippy virus and improve knowledge on the ecology of OW arenaviruses.

References

Radoshitzky, S. R. et al. ICTV Virus Taxonomy Profile: Arenaviridae. J. Gen. Virol. 100, 1200–1201 (2019).
Article CAS PubMed Google Scholar
Pontremoli, C., Forni, D. & Sironi, M. Arenavirus genomics: novel insights into viral diversity, origin, and evolution. Curr. Opin. Virol. 34, 18–28 (2019).
Article CAS PubMed Google Scholar
Charrel, R. N., de Lamballerie, X. & Emonet, S. Phylogeny of the genus Arenavirus. Curr. Opin. Microbiol. 11, 362–368 (2008).
Article PubMed Google Scholar
Shi, M. et al. Redefining the invertebrate RNA virosphere. Nature 540, 539–543 (2016).
Article CAS PubMed ADS Google Scholar
Wolff, H., Lange, J. V. & Webb, P. A. Interrelationships among arenaviruses measured by indirect immunofluorescence. Intervirology 9, 344–350 (1978).
Article CAS PubMed Google Scholar
Gonzalez, J. P., Emonet, S., de Lamballerie, X. & Charrel, R. Arenaviruses. Curr. Top. Microbiol. Immunol. 315, 253–288 (2007).
CAS PubMed PubMed Central Google Scholar
Salazar-Bravo, J., Ruedas, L. A. & Yates, T. L. Mammalian reservoirs of arenaviruses. Curr. Top. Microbiol. Immunol. 262, 25–63 (2002).
CAS PubMed Google Scholar
Forni, D. et al. Ancient Evolution of Mammarenaviruses: Adaptation via Changes in the L Protein and No Evidence for Host-Virus Codivergence. Genome Biol. Evol. 10, 863–874 (2018).
Article PubMed PubMed Central Google Scholar
Bowen, M. D., Peters, C. J. & Nichol, S. T. Phylogenetic analysis of the Arenaviridae: patterns of virus evolution and evidence for cospeciation between arenaviruses and their rodent hosts. Mol. Phylogenet Evol. 8, 301–316 (1997).
Article CAS PubMed Google Scholar
Fichet-Calvet, E. & Rogers, D. J. Risk maps of Lassa fever in West Africa. PLoS Negl. Trop. Dis. 3, e388 (2009).
Article PubMed PubMed Central Google Scholar
Briese, T. et al. Genetic detection and characterization of Lujo virus, a new hemorrhagic fever-associated arenavirus from southern Africa. PLoS Pathog. 5, e1000455 (2009).
Article PubMed PubMed Central CAS Google Scholar
Bonthius, D. J. Lymphocytic choriomeningitis virus: an underrecognized cause of neurologic disease in the fetus, child, and adult. Semin. Pediatr. Neurol. 19, 89–95 (2012).
Article PubMed PubMed Central Google Scholar
Nadine, N. et al. Evidence of lymphocytic choriomeningitis virus (LCMV) in domestic mice in Gabon: risk of emergence of LCMV encephalitis in Central Africa. J. Virol. 89, 1456–1460 (2015).
Article CAS Google Scholar
Doty, J. B. et al. Assessing monkeypox virus prevalence in small mammals at the human-animal interface in the Democratic Republic of the Congo. Viruses 9, 283 (2017).
Article PubMed Central CAS Google Scholar
Nakoune, E. et al. A Nosocomial outbreak of human monkeypox in the Central African Republic. Open Forum Infect. Dis. 4, ofx168 (2017).
Article PubMed PubMed Central Google Scholar
Gonzalez, J. P. et al. An arenavirus isolated from wild-caught rodents (Pramys species) in the Central African Republic. Intervirology 19, 105–112 (1983).
Article CAS PubMed Google Scholar
Gonzalez, J. P., McGomick, J. B., Saluzzo, J. F. & Georges, A. J. Les fièvres hémorragiques africaines d’origine virale: contribution à leur étude en République Centrafricaine. Cahiers ORSTOM Série Entomologie Médicale et Parasitologie 21, 119–130 (1983).
Google Scholar
Olayemi, A. et al. Arenavirus diversity and phylogeography of Mastomys natalensis Rodents, Nigeria. Emerg. Infect. Dis. 22, 694–697 (2016).
Article CAS PubMed PubMed Central Google Scholar
Georges, A. J. et al. Antibodies to Lassa and Lassa-like viruses in man and mammals in the Central African Republic. Trans. R. Soc. Trop. Med. Hyg. 79, 78–79 (1985).
Article CAS PubMed Google Scholar
Shi, M. et al. The evolutionary history of vertebrate RNA viruses. Nature 556, 197–202 (2018).
Article CAS PubMed ADS Google Scholar
Hetzel, U. et al. Isolation, identification, and characterization of novel arenaviruses, the etiological agents of boid inclusion body disease. J. Virol. 87, 10918–10935 (2013).
Article CAS PubMed PubMed Central Google Scholar
Quick, J. et al. Real-time, portable genome sequencing for Ebola surveillance. Nature 530, 228–232 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Loman, N. J. & Watson, M. Successful test launch for nanopore sequencing. Nat. Methods 12, 303–304 (2015).
Article CAS PubMed Google Scholar
Loman, N. J. et al. Performance comparison of benchtop high-throughput sequencing platforms. Nat. Biotechnol. 30, 434–439 (2012).
Article CAS PubMed Google Scholar
Mongan, A. E., Tuda, J. S. B. & Runtuwene, L. R. Portable sequencer in the fight against infectious disease. J. Hum. Genet. 65, 35–40 (2020).
Article PubMed Google Scholar
Faria, N. R. et al. Mobile real-time surveillance of Zika virus in Brazil. Genome Med. 8, 97 (2016).
Article PubMed PubMed Central Google Scholar
Kafetzopoulou, L. E. et al. Metagenomic sequencing at the epicenter of the Nigeria 2018 Lassa fever outbreak. Science 363, 74–77 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Wawina-Bokalanga, T. et al. Complete genome sequence of a new Ebola virus strain isolated during the 2017 Likati outbreak in the Democratic Republic of the Congo. Microbiol. Resour. Announc. 8, e00360 (2019).
Article PubMed PubMed Central Google Scholar
Gardy, J. L. & Loman, N. J. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat. Rev. Genet. 19, 9–20 (2018).
Article CAS PubMed Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Menzel, P., Ng, K. L. & Krogh, A. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat. Commun. 7, 11257 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Kim, D., Song, L., Breitwieser, F. P. & Salzberg, S. L. Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res. 26, 1721–1729 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wood, D. E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol. 20, 257 (2019).
Article CAS PubMed PubMed Central Google Scholar
Okonechnikov, K., Golosova, O. & Fursov, M. Unipro UGENE: a unified bioinformatics toolkit. Bioinformatics 28, 1166–1167 (2012).
Article CAS PubMed Google Scholar
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 35, 1547–1549 (2018).
Article CAS PubMed PubMed Central Google Scholar
Nakamura, T., Yamada, K. D., Tomii, K. & Katoh, K. Parallelization of MAFFT for large-scale multiple sequence alignments. Bioinformatics 34, 2490–2492 (2018).
Article CAS PubMed PubMed Central Google Scholar
Larsson, A. AliView: a fast and lightweight alignment viewer and editor for large datasets. Bioinformatics 30, 3276–3278 (2014).
Article CAS PubMed PubMed Central Google Scholar
Minh, B. Q. et al. Corrigendum to: IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 37, 2461 (2020).
Article PubMed PubMed Central Google Scholar
Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Article CAS PubMed Google Scholar
Rambaut, A., Lam, T. T., Max Carvalho, L. & Pybus, O. G. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2, vew007 (2016).
Article PubMed PubMed Central Google Scholar
Suchard, M. A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4, vey016 (2018).
Article PubMed PubMed Central Google Scholar
Swanepoel, R. et al. Identification of Ippy as a Lassa-fever-related virus. Lancet 1, 639 (1985).
Article CAS PubMed Google Scholar
Digoutte, J. P. Annual report of Institut Pasteur, Bangui, Central African Republic 59 (1970).
Levingston Macleod, J. M. et al. Identification of two functional domains within the arenavirus nucleoprotein. J. Virol. 85, 2012–2023 (2011).
Article CAS PubMed Google Scholar
Pinschewer, D. D., Perez, M. & de la Torre, J. C. Role of the virus nucleoprotein in the regulation of lymphocytic choriomeningitis virus transcription and RNA replication. J. Virol. 77, 3882–3887 (2003).
Article CAS PubMed PubMed Central Google Scholar
Quick, J. et al. Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples. Nat. Protoc. 12, 1261–1276 (2017).
Article CAS PubMed PubMed Central Google Scholar
Senol Cali, D., Kim, J. S., Ghose, S., Alkan, C. & Mutlu, O. Nanopore sequencing technology and tools for genome assembly: computational analysis of the current state, bottlenecks and future directions. Brief Bioinform. 20, 1542–1559 (2019).
Article PubMed CAS Google Scholar
Emonet, S. F., de la Torre, J. C., Domingo, E. & Sevilla, N. Arenavirus genetic diversity and its biological implications. Infect. Genet. Evol. 9, 417–429 (2009).
Article CAS PubMed PubMed Central Google Scholar
Emonet, S., Lemasson, J. J., Gonzalez, J. P., de Lamballerie, X. & Charrel, R. N. Phylogeny and evolution of old world arenaviruses. Virology 350, 251–257 (2006).
Article CAS PubMed Google Scholar
Bowen, M. D. et al. Genetic diversity among Lassa virus strains. J. Virol. 74, 6992–7004 (2000).
Article CAS PubMed PubMed Central Google Scholar
Radoshitzky, S. R. et al. Past, present, and future of arenavirus taxonomy. Arch. Virol. 160, 1851–1874 (2015).
Article CAS PubMed Google Scholar
Weaver, S. C. et al. Guanarito virus (Arenaviridae) isolates from endemic and outlying localities in Venezuela: sequence comparisons among and within strains isolated from Venezuelan hemorrhagic fever patients and rodents. Virology 266, 189–195 (2000).
Article CAS PubMed Google Scholar
Gryseels, S. et al. Role of Wildlife in Emergence of Ebola Virus in Kaigbono (Likati), Democratic Republic of the Congo, 2017. Emerg. Infect. Dis. 26, 2205–2209 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fulhorst, C. F., Ksiazek, T. G., Peters, C. J. & Tesh, R. B. Experimental infection of the cane mouse Zygodontomys brevicauda (family Muridae) with guanarito virus (Arenaviridae), the etiologic agent of Venezuelan hemorrhagic fever. J. Infect. Dis. 180, 966–969 (1999).
Article CAS PubMed Google Scholar
Coulibaly-N’Golo, D. et al. Novel arenavirus sequences in Hylomyscus sp. and Mus (Nannomys) setulosus from Cote d’Ivoire: implications for evolution of arenaviruses in Africa. PLoS ONE 6, e20893 (2011).
Article PubMed PubMed Central ADS CAS Google Scholar
Albarino, C. G. et al. High diversity and ancient common ancestry of lymphocytic choriomeningitis virus. Emerg. Infect. Dis. 16, 1093–1100 (2010).
Article PubMed PubMed Central Google Scholar
Andersen, K. G. et al. Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus. Cell 162, 738–750 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fichet-Calvet, E. et al. Spatial and temporal evolution of Lassa virus in the natural host population in Upper Guinea. Sci. Rep. 6, 21977 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Ehichioya, D. U. et al. Current molecular epidemiology of Lassa virus in Nigeria. J. Clin. Microbiol. 49, 1157–1161 (2011).
Article PubMed PubMed Central Google Scholar
Zhang, L. et al. Isolation and genomic characterization of lymphocytic choriomeningitis virus in ticks from northeastern China. Transbound Emerg. Dis. 65, 1733–1739 (2018).
Article CAS PubMed Google Scholar
Martinez-Sobrido, L., Giannakas, P., Cubitt, B., Garcia-Sastre, A. & de la Torre, J. C. Differential inhibition of type I interferon induction by arenavirus nucleoproteins. J. Virol. 81, 12696–12703 (2007).
Article CAS PubMed PubMed Central Google Scholar
D’Antuono, A., Loureiro, M. E., Foscaldi, S., Marino-Buslje, C. & Lopez, N. Differential contributions of tacaribe arenavirus nucleoprotein N-terminal and C-terminal residues to nucleocapsid functional activity. J. Virol. 88, 6492–6505 (2014).
Article PubMed PubMed Central CAS Google Scholar
Meunier, D. Y., McCormick, J. B., Georges, A. J., Georges, M. C. & Gonzalez, J. P. Comparison of Lassa, Mobala, and Ippy virus reactions by immunofluorescence test. Lancet 1, 873–874 (1985).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We acknowledge Dr Carolyn Engel-Gautier for editing the English in the manuscript. This study was supported by the World Organisation for Animal health (OIE) through the European Union (capacity building and surveillance for Ebola virus disease; EBO-SURSY, project reference: FOOD/2016/379-660).This project was also supported by the Chinese Academy of Sciences, a Shanghai Municipal Science and Technology Major Project (Grant No. 2019SHZDZX02) and External Cooperation Program of Chinese Academy of Sciences (Grant No. 153211KYSB20160001).

Author information

These authors equally contributed: Emmanuel Nakouné and Nicolas Berthet.

Authors and Affiliations

Centre for Research in Infectious Diseases, Yaoundé, Cameroon
Huguette Simo Tchetgna
The Center for Microbes, Development and Health, CAS Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai – Chinese Academy of Sciences, Discovery and Molecular Characterization of Pathogens, Shanghai, 200031, China
Huguette Simo Tchetgna & Nicolas Berthet
Center of Bioinformatics, Biostatistics and Integrative Biology (C3BI), Institut Pasteur, Paris, France
Stephane Descorps-Declère
Institut Pasteur de Bangui, Bangui, Central African Republic
Benjamin Selekon & Emmanuel Nakouné
Cellule d’Intervention Biologique d’Urgence, Institut Pasteur, Unité Environnement et Risques Infectieux, Paris, France
Aurelia Kwasiborski, Mathias Vandenbogaert, Jean-Claude Manuguerra, Valérie Caro & Nicolas Berthet
Unité d’Epidémiologie et Physiopathologie des Virus Oncogènes, Département de Virologie, Institut Pasteur, Paris, France
Antoine Gessain
Centre National de Recherche Scientifique (CNRS) UMR3569, Paris, France
Antoine Gessain

Authors

Huguette Simo Tchetgna
View author publications
You can also search for this author in PubMed Google Scholar
Stephane Descorps-Declère
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Selekon
View author publications
You can also search for this author in PubMed Google Scholar
Aurelia Kwasiborski
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Vandenbogaert
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Claude Manuguerra
View author publications
You can also search for this author in PubMed Google Scholar
Antoine Gessain
View author publications
You can also search for this author in PubMed Google Scholar
Valérie Caro
View author publications
You can also search for this author in PubMed Google Scholar
Emmanuel Nakouné
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Berthet
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.S.T., S.D.D. and N.B.: wrote the manuscript; S.D.D., H.S.T. and M.V.: performed the bioinformatic analyses; B.S. and A.K.: performed the experiments; J.C.M., A.G. and V.C.: reviewed and edited the manuscript; E.N. and N.B.; developed and designed the study.

Corresponding author

Correspondence to Nicolas Berthet.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Simo Tchetgna, H., Descorps-Declère, S., Selekon, B. et al. Molecular characterization of a new highly divergent Mobala related arenavirus isolated from Praomys sp. rodents. Sci Rep 11, 10188 (2021). https://doi.org/10.1038/s41598-021-88046-5

Download citation

Received: 29 December 2020
Accepted: 06 April 2021
Published: 13 May 2021
DOI: https://doi.org/10.1038/s41598-021-88046-5

This article is cited by

Predicting the evolution of the Lassa virus endemic area and population at risk over the next decades
- Raphaëlle Klitting
- Liana E. Kafetzopoulou
- Simon Dellicour
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.