Research PaperNeuropeptide discovery in Proasellus cavaticus: Prediction of the first large-scale peptidome for a member of the Isopoda using a publicly accessible transcriptome
Introduction
Peptides represent the largest single class of compounds used by animals for paracrine/endocrine signaling, particularly within and from the nervous system (e.g., [1]). For more than half a century, crustaceans have been used to elucidate the basic principles underlying neuropeptidergic signaling, and much work has focused on the identification and characterization of native neuropeptides in these animals using a variety of techniques (e.g., [2], [3]). Most early studies employed biochemical isolation and Edman degradation to purify and characterize individual neuropeptides of interest (e.g., [4], [5], [6], [7], [8], [9], [10]), or used targeted molecular cloning for neuropeptide discovery (e.g., [11], [12], [13], [14]), with a peptidome for a species built over time by adding new discoveries in a one-by-one fashion. More recently, mass spectrometry, in particular accurate mass matching and de novo tandem mass spectrometric sequencing, has become a common means for neuropeptide identification (e.g., [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25]); mass spectral approaches have the capacity for rapid peptidome elucidation when sufficient starting material is available for analysis (e.g., [19], [26], [27], [28], [29], [30], [31], [32], [33]). However, for species that are small, rare and/or geographically inaccessible, it is often not practical to obtain the large pools of tissue needed for biochemical- and/or mass spectral-based neuropeptide identification. For these animals, in silico genome/transcriptome mining with subsequent bioinformatics peptide structural prediction provides an alternative means for neuropeptide discovery, as genomes and transcriptomes can often be produced using tissue from a single individual as the starting material.
Over the past several years, genome/transcriptome mining has been used extensively to predict peptidomes for a wide variety of crustacean species (e.g., [34], [35], [36], [37], [38], [39], [40], [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [51], [52], [53], [54]), including many for which biochemical and/or mass spectral approaches would be highly impractical due to their geographic inaccessibility and/or small size, e.g., Speleonectes c.f. tulumensis, an anchialine cave dwelling remipede [35], Euphausia crystallorophias, a species of Antarctic krill [51], and multiple planktonic/parasitic members of the Copepoda [36], [38], [41]. In fact, via this approach, large peptidomes have now been described for at least one member of most of the major taxa that comprise the Crustacea, one notable exception being isopods.
The Isopoda is a crustacean order within the class Malacostraca and superorder Peracarida; also included in the Malacostraca, among other taxa, are the Amphipoda, another order within the Peracarida, and the Decapoda and Euphausiacea, two orders within the superorder Eucarida (e.g., [55]). Over 10,000 species of isopods, many endemic, are currently known, with members of this taxon occurring globally and existing in essentially all possible aquatic and terrestrial habitats (e.g., [56], [57], [58]). The reproductive strategy seen in isopods, which typically involves internal fertilization, brooding of the young in a ventral pouch, direct development, small brood size, and often a limited number of lifetime broods, is likely a major contributor to the high degree of endemism and habitat specialization seen in this group of crustaceans (e.g., [58]). Interestingly, and despite the variations in physiology/behavior that must be used by members of the Isopoda that have colonized and succeeded in radically different habitats, little work has focused on identifying the native neuropeptides present in members of crustacean order (e.g., [46], [59], [60], [61], [62], [63], [64]), which undoubtedly are key players in modulating the physiological and behavioral control systems of these animals.
Here, using a publicly deposited transcriptome for Proasellus cavaticus (BioProject No. PRJEB14193; T. Lefebure, direct GenBank submission), a subcentemeter, blind, pigmentless, subterranean ground water dwelling member of the Isopoda [65], the first peptidome for an isopod has been deduced via in silico transcriptome mining and bioinformatics peptide prediction. Specifically BLAST searches of the P. cavaticus transcriptome were conducted using known arthropod neuropeptide precursors as the query sequences. Via this strategy, 49 putative peptide-encoding transcripts were identified, allowing for the prediction of the mature structures of a 171-sequence peptidome. The P. cavaticus peptidome contains members of 28 known arthropod neuropeptide families/subfamilies, as well as a large number of linker/precursor-related sequences whose structures do not place them into any generally recognized peptide group, and which may or may not be bioactive. Comparisons of the peptide isoforms predicted for P. cavaticus with those identified previously from members of the Amphipoda, Decapoda and Euphausiacea show that the structures of many peptide isoforms are broadly conserved in malacostracans, a finding that suggests that many peptides may serve conserved physiological roles within this crustacean class.
Section snippets
Database searches
Database searches were conducted on or before June 24, 2017 using methods modified from a well-vetted protocol (e.g., [32], [33], [34], [35], [36], [37], [38], [39], [40], [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [66], [67], [68], [69], [70], [71], [72], [73], [74], [75], [76], [77], [78], [79], [80], [81]). Specifically, the database of the online program tblastn (National Center for Biotechnology Information, Bethesda, MD; http://blast.ncbi.nlm.nih.gov/Blast.cgi) was set to
Results
Forty-one peptide families were searched for within the publicly accessible P. cavaticus TSA dataset (Table 1). In the interest of space, only those searches that resulted in the identification of putative precursor-encoding transcripts are described here, with the data presented in alphabetical order based on family name. All proteins listed as “full-length” exhibit a functional signal sequence (including a “start” methionine) and are flanked on their C-terminus by a stop codon. Proteins
Peptide discovery in Proasellus cavaticus
Decreasing costs and improved technology for high-throughput nucleotide sequencing has led to growing molecular resources (i.e., genomes and transcriptomes) for a variety of crustacean species (e.g., [34], [51], [79], [81], [103], [104], [105], [106], [107], [108], [109], [110], [111], [112], [113], [114], [115], [116], [117], [118], [119]). These datasets have greatly facilitated gene discovery in the Crustacea, including the identification of sequences encoding the precursors for peptide
Funding
This work was supported by the National Science Foundation (grant number IOS-1353023) and the Cades Foundation of Honolulu, Hawaii.
Acknowledgements
This article is SOEST Contribution No. 10219.
References (119)
- et al.
Orcokinin: a novel myotropic peptide from the nervous system of the crayfish, Orconectes limosus
Peptides
(1992) - et al.
Molecular cloning of crustacean pigment dispersing hormone precursor
Biochem. Biophys. Res. Commun.
(1992) - et al.
Molecular cloning of crustacean red pigment concentrating hormone precursor
Biochem. Biophys. Res. Commun.
(1993) - et al.
Mass spectrometric characterization of crustacean hyperglycemic hormone precursor-related peptides (CPRPs) from the sinus gland of the crab, Cancer productus
Peptides
(2005) - et al.
Mass spectrometric identification of pEGFYSQRYamide: a crustacean peptide hormone possessing a vertebrate neuropeptide Y (NPY)-like carboxy-terminus
Gen. Comp. Endocrinol.
(2007) - et al.
High-mass-resolution direct-tissue MALDI-FTMS reveals broad conservation of three neuropeptides (APSGFLGMRamide, GYRKPPFNGSIFamide and pQDLDHVFLRFamide) across members of seven decapod crustaean infraorders
Peptides
(2007) - et al.
Molecular and mass spectral identification of the broadly conserved decapod crustacean neuropeptide pQIRYHQCYFNPISCF: the first PISCF-allatostatin (Manduca sexta- or C-type allatostatin) from a non-insect
Gen. Comp. Endocrinol.
(2010) - et al.
Mass spectrometric elucidation of the neuropeptidome of a crustacean neuroendocrine organ
Peptides
(2012) - et al.
Mass spectrometric characterization of the neuropeptidome of the ghost crab Ocypode ceratophthalma (Brachyura, Ocypodidae)
Gen. Comp. Endocrinol.
(2013) - et al.
Neuropeptidomic analysis of the brain and thoracic ganglion from the Jonah crab Cancer borealis
Biochem. Biophys. Res. Commun.
(2003)