Nearly Complete Genome Sequence of an Echovirus 30 Strain from a Cluster of Aseptic Meningitis Cases in California, September 2017

We report the nearly complete genome sequence of a human enterovirus, a strain of echovirus 30, obtained from a cerebrospinal fluid specimen from a teenaged patient with aseptic meningitis in September 2017.

E choviruses are members of the Enterovirus B species of the Enterovirus (EV) genus in the Picornaviridae family of nonenveloped, single-stranded, positive-sense RNA viruses. Echoviruses were named from the acronym enteric cytopathic human orphan virus at the time of their discovery in the 1950s but were later found to be associated with respiratory illness, hand-foot-and-mouth disease, and aseptic meningitis, similar to other enteroviruses (1).
According to the California Code of Regulations, meningitis cases are reportable to the California Department of Public Health (CDPH) within 1 day of identification of etiology (2). In the fall of 2017, a cluster of aseptic meningitis cases from a northern California high school were reported to the CDPH. The Viral and Rickettsial Disease Laboratory (VRDL) at the CDPH detected EV from 19 of 30 patients (63%) by real-time reverse transcription-PCR (RT-PCR), as previously described (3). We generated and analyzed partial capsid (viral protein 1 [VP1]) sequences using methods developed by Minnaar et al. (4). Fifteen of 19 (79%) EV-positive patients were confirmed to have echovirus 30 (E-30), using cerebrospinal fluid (CSF) samples. This cluster of E-30 meningitis cases is similar to previously reported E-30 aseptic meningitis cases (5,6) in symptoms and epidemiology.
Here, we report a nearly complete genome sequence from one of the E-30-positive CSF specimens. The CSF was processed by centrifugation, 0.45-m filtration, and nuclease treatment prior to extraction using the NucliSENS easyMAG system (bioMérieux, Durham, NC) (7). The extracted nucleic acids were then treated with DNase to yield RNA, which was subjected to random reverse transcription and PCR (7). The next-generation sequencing (NGS) library was prepared using a Nextera XT kit and sequenced on a MiSeq platform 300-cycle paired-end run (Illumina, San Diego, CA). The NGS data were analyzed using an in-house Centers for Disease Control and Prevention (CDC) pipeline which involves the removal of host sequences using bowtie2/2.3.3.1, primer removal, low-quality (below Q20) and read length (Ͻ50 nucleotides) filtering using cutadapt 1.18, read duplication removal using a Dedup.py script, de novo assembly using SPAdes 3.7 default parameters, and BLAST search of the resultant contigs (8). There were a total of 141,329 postprocessing FASTQ reads. The final consensus genome was inspected and annotated using Geneious v10.0.9 (9). The contig was built from 15,712 reads, assembled to an E-30 reference genome (GenBank accession number JX976773), and deemed nearly complete by comparison to the reference, and the termini were determined as part of the protocol (7). The total GC content is 48.3% for 7,155 bases. The average read coverage was 260-fold for the E-30 genome.
The genome sequence was designated E-30 USA/2017/CA-RGDS-1005. Its VP1 sequence was confirmed by the CDC Picornavirus Laboratory to be nearly identical to those of E-30s identified in an aseptic meningitis outbreak that occurred in the fall of 2017 in Nevada; it also has greater than 99% nucleotide identity to the VP1 sequences of E-30 strains from the southern United States identified by the CDC in May 2017 (GenBank accession numbers MG584831 and MG584832), as measured using the online version of blastn (https://blast.ncbi.nlm.nih.gov/Blast.cgi). The genome sequence of E-30 USA/2017/CA-RGDS-1005 shares less than 89% nucleotide identity (NI) and less than 98% amino acid identity (AI) with other publicly available E-30 sequences. The sequence contains the complete protein-coding region, with short sections in the untranslated regions (UTRs) missing due to a lack of read coverage (approximately 182 and 90 nucleotides of the 5= and 3= UTRs, respectively). The enterovirus polyprotein can be divided into one structural (P1-capsid) and two nonstructural (P2 and P3) regions. The polyprotein regions of the E-30 genome reported here share 96%, 88%, and 84% NI (P1, P2, and P3, respectively) with other E-30 sequences in GenBank.
Data availability. The E-30 sequence of USA/2017/CA-RGDS-1005 has been deposited in GenBank under the accession number MK238483. The quality-filtered FASTQ reads have been deposited in the Sequence Read Archive with the run accession number SRR10082176.

ACKNOWLEDGMENTS
The contributions of the California Department of Public Health Viral and Rickettsial Disease Laboratory were supported in part by the Epidemiology and Laboratory Capacity for Infectious Diseases Cooperative Agreement number 6 NU50CK000410 from the U.S. Centers for Disease Control and Prevention. This work was partly funded by federal appropriations to the Centers for Disease Control and Prevention, through the Advanced Molecular Detection Initiative line item.
The findings and conclusions in this report are those of the author(s) and do not necessarily represent the official position of the Centers for Disease Control and Prevention.