Abstract
The first reference genome assembly of the moss Physcomitrellopsis africana, a rare narrow endemic terricolous species from southeastern coastal forests of South Africa, is presented here. Phylogenetically, Physcomitrellopsis africana bridges the major evo-devo moss models Physcomitrium patens and Funaria hygrometrica, which diverged from their common ancestor 60-80 million years ago. The Physcomitrellopsis africana genome was assembled with both long Nanopore reads (73x) and short Illumina reads (163x). The 440 Mb assembly comprises 2,396 contigs and 23,493 protein-coding genes (BUSCO: C:96.0%[D:13.9%]), including two unique genes of putative microbial origin absent in close relatives. While the informatic approaches to genome assembly are becoming more standardized, best practices for contamination detection are less defined. The long reads sequenced for the Physcomitrellopsis africana genome contained approximately 12% contamination originating from microbial sources. This study describes the informatic processes employed to distinguish contaminants from candidate horizontal gene transfer events. Following the assembly and annotation, examination of whole genome duplication, and patterns of gene family expansion and contraction, were conducted. The genome bears signatures of two whole genome duplications shared with Physcomitrium patens and F. hygrometrica. Comparative analyses of gene family evolution revealed contractions associated with the light harvesting regulatory network in Physcomitrellopsis africana in comparison to Physcomitrium patens and F. hygrometrica. This first high-quality African bryophyte genome provides insights into genome evolution and HGT in an understudied moss lineage.
Article Summary The first draft genome of the rare moss, Physcomitrellopsis africana, endemic to South Africa’s southeastern coastal forests, is presented here. The 440 Mb genome fills a critical phylogenetic gap, enabling the first comparative analysis of the genomes of three moss species that diverge over the last 60-80 million years. The analysis uncovered 23,493 genes and provides new insights into genome evolution and gene family expansion/contraction in mosses. Rigorous contaminant filtering also identified two genes uniquely acquired through horizontal gene transfer.
Competing Interest Statement
The authors have declared no competing interest.