The complete mitochondrial genome of Peleteria iavana (Diptera, Tachinidae) in Guizhou, China, and its phylogenetic placement

Abstract The mitochondrial genome of the tachinid fly Peleteria iavana (Wiedemann, 1819), which belongs to the family Tachinidae, was obtained for the first time using high-throughput sequencing techniques. The complete mitochondrial genome is 15,697 bp in size and consists of 13 protein-coding genes (PCGs), two ribosomal RNA genes, 22 transfer RNAs and a non-coding control region. The nucleotide composition biases A and T, the overall A + T percentage is up to 78.9% of the entire mitogenome. A phylogenetic analysis of 30 species within the family Tachinidae suggested that P. iavana is most closely related to (Janthinomyia sp.+Lydina aenea). The P. iavana mitochondrial genome will be a fundamental resource for understanding the molecular phylogenetic relationships of the species-rich subfamily Tachininae of Tachinidae.


Introduction
The Tachininae is the second most speciose subfamily in Tachinidae, with 2,746 described species worldwide (O'Hara et al. 2020). As a parasitoid, it is an important natural enemy in most terrestrial ecological communities (Stireman et al. 2006). Genus Peleteria is a representative member of Tachininae, including 4 subgenera and 124 species in the world, while 2 species occur in Guizhou Province, China (O'Hara et al. 2009;2020). Peleteria iavana (Wiedemann, 1819) spreads throughout the Palearctic, Oriental, Afrotropical and Australasian Regions (O'Hara and Henderson 2020), and it is considered a native species of China (O'Hara et al. 2009). In this study, the mitogenome data of P. iavana is sequenced and annotated as the first mitogenome of genus Peleteria, and the phylogenetic placement of P. iavana in the family Tachinidae is explored.
To reveal the phylogenetic position of P. iavana, the dataset of 13 PCGs of previously reported 29 tachinid species were downloaded and their GenBank accession numbers were provided in Figure 3 (Nelson et al. 2012;Shao et al. 2012;Zhao et al. 2013;Li et al. 2017;Hou et al. 2018Hou et al. , 2019Pei et al. 2019;Seo et al. 2019;Luo et al. 2021;Yan et al. 2021;Li et al. 2022;Wang, Zhi, Yao et al. 2022). Additionally, Fucellia costalis from Anthomyiidae and Sarcophaga ruficornis from Sarcophagidae were chosen as outgroups. Preliminary multiple sequence alignments were performed using the Clustal W in MEGAX 10.2.6 (Kumar et al. 2018). The phylogenetic tree was reconstructed in the maximum-likelihood (ML) method with 1,000 bootstrap repeats using MEGA X 10.2.6 (Kumar et al. 2018).  protein-coding gene, control region, rRNA gene, tRNA gene.

Results
The 4.43 Gb raw data was yielded by the high-throughput sequencing techniques with 12,926,828 reads, and the 4.29 Gb (96.84%) clean data was obtained after filtering. The complete mitochondrial genome of P. iavana was assembled using 594,117 reads, and the average depth of coverage was assessed at 4561.6 X (Supplementary Figure 1). The average coverage depth for two regions nucleotides 14,853-14,977 (A þ T ratio: 94.4%) and nucleotides 15,147-15,390 (A þ T ratio: 93.5%) was 503.1 X (max: 635 X, min: 426 X) and 306.8 X (max: 632 X, min: 138 X), respectively. The reason that the two regions had relatively low coverage depth in the mitochondrial genome may be that the control region had a higher AT ratio and more repeat sequences. The complete circular mitogenome of P. iavana (GenBank accession number: OM156425.1) was 15,697 bp in length, with the base composition as follows: A (40.5%), T (38.4%), G (9.0%), and C (12.1%). The nucleotide composition was AT-biased (A þ T ratio: 78.9%). It contained 13 protein-coding genes (PCGs), two rRNA genes, 22 tRNA genes and one non-coding control region, which was a typical configuration of the tachinid mitogenome ( Figure 2). Four PCGs, two rRNA genes and eight tRNA genes were encoded by the light strand, while others were located on the heavy strand.
As shown in Figure 3, a phylogenetic analysis based on 13 PCGs revealed that P. iavana and (Janthinomyia sp.þLydina aenea) clustered together. Additionally, Hamaxiella brunnescens is closely related to ((CylindromyiaþDexiinae)þMacquartia spp.). Therefore, the subfamily Tachininae was not assigned to be a monophyletic group.

Conclusion
In the present study, we first reported the mitochondrial genome of P. iavana. It was a closed circular molecule of 15,697 bp in length, containing 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes, and a control region. The phylogenetic analysis suggested that Tachinidae was a monophyletic group and subfamily Tachininae has not formed a monophyly. P. iavana and (Janthinomyia sp.þLydina aenea) clustered together in the basal position. Hamaxiella brunnescens of tribe Palpostomatini from subfamily Tachininae was the sister group of (Dexiinae þ Cylindromyini), the result was consistent with the previous study on the phylogeny of Tachinidae based on the morphology and molecular data (Stireman et al. 2019). Eventually, as the sister group to Hamaxiella brunnescens, four species of genus Macquartia clustered together with high supporting values, the phylogenetic tree showed a similarity to the phylogenetic relationships of the Tachinidae in Li et al. (Li et al. 2022). These results provided an important basis for further studies on the mitochondrial genomics and phylogenetics of the family Tachinidae.

Ethical approval
Ethical approval (No. 1900074) was granted by the Animal Care Welfare Committee of Guizhou Medical University for the study.

Author contributions
According to International Committee of Medical Journal Editors (ICMJE) criteria, all authors including Peng Zhang, Yan Zhi, Chunchun Guo, Chuntian Zhang and Jiayu Liu participated in the conception and design of the work, acquisition and analysis of data, drafting and revising the work, and final approval of the version to be published. Also, all authors agreed to be accountable for all aspects of the work.

Disclosure statement
No potential conflict of interest was reported by the author(s).

Data availability statement
The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov/ under the accession no. OM156425.1. The associated BioProject, SRA and Bio-Sample numbers are PRJNA800680, SRR18030147 and SAMN25276555, respectively.