Pathogenic Protist Transmembranome database (PPTdb): a web-based platform for searching and analysis of protist transmembrane proteins

Lee, Chi-Ching; Huang, Po-Jung; Yeh, Yuan-Ming; Chen, Sin-You; Chiu, Cheng-Hsun; Cheng, Wei-Hung; Tang, Petrus

doi:10.1186/s12859-019-2857-7

Volume 20 Supplement 13

Selected articles from the 8th Translational Bioinformatics Conference: Bioinformatics

Research
Open access
Published: 24 July 2019

Pathogenic Protist Transmembranome database (PPTdb): a web-based platform for searching and analysis of protist transmembrane proteins

Chi-Ching Lee^1,2,
Po-Jung Huang^2,3,
Yuan-Ming Yeh²,
Sin-You Chen¹,
Cheng-Hsun Chiu^2,4,
Wei-Hung Cheng⁵ &
…
Petrus Tang^4,5

BMC Bioinformatics volume 20, Article number: 382 (2019) Cite this article

1510 Accesses
1 Citations
2 Altmetric
Metrics details

Abstract

Background

Pathogenic protist membrane transporter proteins play important roles not only in exchanging molecules into and out of cells but also in acquiring nutrients and biosynthetic compounds from their hosts. Currently, there is no centralized protist membrane transporter database published, which makes system-wide comparisons and studies of host-pathogen membranomes difficult to achieve.

Results

We analyzed over one million protein sequences from 139 protists with full or partial genome sequences. Putative transmembrane proteins were annotated by primary sequence alignments, conserved secondary structural elements, and functional domains. We have constructed the PPTdb (Pathogenic Protist Transmembranome database), a comprehensive membrane transporter protein portal for pathogenic protists and their human hosts. The PPTdb is a web-based database with a user-friendly searching and data querying interface, including hierarchical transporter classification (TC) numbers, protein sequences, functional annotations, conserved functional domains, batch sequence retrieving and downloads. The PPTdb also serves as an analytical platform to provide useful comparison/mining tools, including transmembrane ability evaluation, annotation of unknown proteins, informative visualization charts, and iterative functional mining of host-pathogen transporter proteins.

Conclusions

The PPTdb collected putative protist transporter proteins and offers a user-friendly data retrieving interface. Moreover, a pairwise functional comparison ability can provide useful information for identifying functional uniqueness of each protist. Finally, the host and non-host protein similarity search can fulfill the needs of comprehensive studies of protists and their hosts. The PPTdb is freely accessible at http://pptdb.cgu.edu.tw.

Background

Bioactive molecules that cross through the extracellular barrier mainly rely on channels, pores, or energy-consuming pumps composed of transporter proteins [1]. Transporters are specifically necessary for parasitic protists to salvage essential molecules for survival, such as nucleotides/ nucleosides, carbohydrates, and amino acids, from the human host [2,3,4,5,6,7,8]. In addition, membrane transporters also play a role in the drug resistance of protists [9]. These pivotal roles of transporters for parasitism strongly prompt parasitologists to characterize protist transporters in order to extend knowledge in protist pathogenesis and innovate new treatment strategies. Indeed, more than 50% of therapeutic drugs targeted to receptors or transporters on the membrane, emphasizing the importance of membrane proteins in disease control [10]. However, it is difficult to characterize the biological functions of membrane proteins because of uncertain experimental procedures [11]. Thus, a functional prediction based on protein sequence would be helpful to concentrate on the interested membrane proteins. Although whole genome sequencing of important human pathogenic protists has been conducted during the last decade, information about protist transporters has been lacking. Therefore, the goal of this study was to build the human Pathogenic Protist Transmembranome database (PPTdb) to provide classification and system-wide comparison to accelerate the exploration of protist transporters.

Several databases collecting transporter information have been constructed as a resource of known and putative transporter proteins. The Transporter Classification Database (TCDB) is the only classification organization for transporters that documents at least 1000 transporter families based on function and phylogeny derived from published studies [12]. TransportDB 2.0 has recorded potential transporters from ~ 2700 sequenced genomes using the following criteria: (1) primary amino acid sequence identity, and (2) predicted structural homology and topology [13]. PPTdb classification was based on protein sequence alignment to known human transporters, coupled with transmembrane domain prediction and gene ontology (GO) categorization to increase the accuracy of transporter annotation in sequenced protist genomes.

Advanced applications such as expression profiles and polymorphisms of transporters are included in the Human Transporter Database (HTD) [14]. The Yeast Transporter Protein database (YTPdb) contains membrane topology, post-translational modifications, and a “wiki-like” freely updatable platform [15]. The PPTdb is not only a resource but also an interactive functional search engine for potential protist transporters. The query section of PPTdb is composed of sequence identity to known human transporters and conserved transporter characteristics. The summary of a PPTdb search consists of a pairwise functional comparison of protist to human transporters that distinguishes either protist-specific transporters or human homologs. Furthermore, a functional comparison between two protists can be examined to highlight the commonalities between or uniqueness of transporters in each organism. An iterative search panel is another option for multiple functional queries in order to obtain more specific targets.

As the first transporter collection for protists, the PPTdb has globally analyzed more than one million protein transporters collected from sub-databases of the EupathDB family and from the NCBI genome archive. The classification of potential protist transporters depends on the sequence similarity to human host transporters, putative transmembrane domains, and GO functional groupings. The user-friendly interface allows one to use different strengths to filter out the desired transporters and to conduct inter-species comparisons to humans and other protists. Additionally, the iterative functional mining allows for the entry of more than one keyword to find possible transporters within one round of searching. Comparing to previous databases, PPTdb offers an easy-to-use data querying interface, such as human homolog gene filter, iterative functional mining, and PPTdb also contains regularly updated gene and amino acid data by a back-end automatically data processing pipeline. The PPTdb thus provides a platform for parasitologists to understand the field of transporters and discover new therapeutic targets in important protists by comparing human homologous genes and protist uniqueness transporter genes.

Construction and content

Sequence characterization and annotation

The data processing pipeline was mainly built by service-side command-line PHP and a suite of in-house developed text-processing and data retrieval modules which were implemented by Bash scripting language.

To gather the general and taxonomic information of each organism, we developed an automatic data retrieving and processing pipeline. Customized shell and PHP scripts were used to extract general and taxonomic information from NCBI taxonomy data portal. The taxonomy data were used to build the hierarchical species selection module on the front page of the website. To ensure that all the annotations were annotated using the same software environment, we re-annotated all the protein sequences based on functional and secondary structure information, such as transmembrane prediction by TMHMM, and functional annotations by PROSITE, Pfam, and Gene Ontology.

Web-interface and database architecture

The PPTdb is a database providing real-time user interaction functionalities. Its user interface was implemented by HTML5, jQuery, and Ajax which can be opened by most modern web browsers. Functional annotations, species categories, sequence alignments and general information were stored in a relational database implemented by MySQL, which guarantees short latency and instant responses by online queries.

The PPTdb collected all protein sequence BLASTp results; traditional SQL queries such as ‘join’ and ‘nested-query’ require significant waiting time. To minimize the data querying time, we designed an SQL and PHP two-step conditional query methodology. The first step was called SQL query. Information for each species was separately stored in a MySQL data table, which can reduce SQL querying time, and multiple species queries could be executed in parallel. User-selected gene lists sent from the web interface were ignored in the first stage query and passed on to step 2, and all genes fitting the functional and sequence similarity conditions were selected from the database. The second step was the server-side gene query. The gene lists accepted from step 1 were stored in memory by creating a memory-cached search. According to our tests, this strategy could be executed at least twice as fast as a nested SQL query.

Amino acid sequences of all protist proteins are stored as FASTA files in our file-based database. A set of sequence-retrieving programs is used to extract user-selected gene sequences from web-based queries.

Database contents

The PPTdb collected all known protist-annotated protein sequences from the EupathDB and NCBI genome archive [16,17,18]. There were 139 protist genomes composed of Entamoeba and Acanthamoeba (11 species), Cryptosporidium (12 species), Giardia (6 species), Microsporidia (26 species), Piroplasma (8 species), Plasmodium (23 species), Toxoplasma (26 species), Trichomonas (one species), and Trypanosomatidae (26 species). All the human transporter protein sequences were retrieved from the TCDB data portal which provides the comprehensive annotations of transporter sequences, including functional classifications. To construct human transporter sequences for pathogen and host studies, we selected all human transporter sequences based on the annotation records of TCDB. Table 1 shows the summary of database statistics.

Table 1 Database statistics of PPTdb

Full size table

The PPTdb serves as a knowledge portal, an online functional mining platform, and a cross-species comparison tool specific to protist transporter proteins and human hosts. It is freely accessible at http://pptdb.cgu.edu.tw. Figure 1 shows the analysis workflow and data mining processes of the PPTdb.

The database had collected 1,055,827 protein sequences which were annotated based on the same annotating software and parameters, including 465,391 transmembrane domains, 4429 Pfams, 1064 superfamilies, 1896 GO terms, 668 PROSITE patterns, and 695 PROSITE profiles. Figure 2 represents the interactive web interface of the PPTdb.

Utility and discussion

Transmembrane domain filter

The PPTdb annotated all proteins of all collected protists by TMHMM, which is the most widely used transmembrane domain (TM) prediction tool. Users could select transmembrane candidate proteins of interest with a specific number of TMs, such as a six-TM potassium channel, or a single TM domain for an alpha-helix. Using the same TMs could easily identify proteins with similar structures or biological functions, which was easier than searching an entire dataset.

One-click potential transporter gene finder

We collected all 16,478 transporter proteins downloaded from the TCDB and annotated them by Gene Ontology terms to construct a transporter protein functional ontology dataset. Proteins collected in the TCDB share approximately 1300 GO terms (Additional file 1). Functional ontology terms were used to identify putative transporter proteins including functionally clarified and hypothetical proteins. Comparing to existing protist resources and collecting the feedbacks of testing members, the one-click potential transporter finder may be the most straightforward biological function-based gene finder for protists, of which annotations and sequences had not been comprehensively completed.

Sequence homology search for human transporters

Primary sequence alignment is an effective method to identify proteins as human homologs. Protists and their human-host homolog genes could be used on studies of drug design and host-parasite interactions, because most functional elements share similar sequence identities, such as secondary structural domains and conserved functional elements. The PPTdb executed all-against-all amino acid sequence alignment by BLASTp on proteins of protist organisms and human transporter proteins adopted from TCDB. It also provided a sequence homology search interface allowing the user to preset the search criteria, including sequence identity, similarity, and a ratio of alignment length versus length of target or query proteins. A higher ratio indicates similarity to human proteins. The search result was visualized by a Venn diagram helping users to select or rule out human homology proteins. Users could click the checkbox on the Venn diagram to select protist-unique proteins, human transporter homology proteins, or both. Genes fitting the above criteria were summarized in functional component charts, and listed in Type-n-Search dynamic tables on the bottom side of the web page. Moreover, the table delivered transporter classification (TC) IDs for the human protein homologs which were identified by BLASTp alignment.

Functional component charts

The selected putative transporter genes using the above filters were summarized using functional component charts by delivering the top 10 domains/components, allowing users to identify the most dominant functional elements from the genomic point of view. There were five functional component charts: GO, Pfam, PROSITE pattern, PROSITE profile, and superfamily. All of them were highly associated with biological functions.

Dynamic type-n-search table

All the listed data including genes and GO terms were delivered by a Type-n-Search table, allowing users to narrow down the selection data. The data rows could be sorted by ascending or descending value (text data were sorted by alphabetical order, while the numeric data were sorted by number) by clicking the header in the data table. The keyword search box located in the top-right corner of the table could perform partial string matching for all columns in the table.

Download page for sequence and annotation retrieval

The PPTdb provided a data retrieval tool for protein sequences and annotations targeting genes selected by the functional filters and homology search tool mentioned above. Download by annotations delivers the gene ID and annotations. Download by BLAST results contained all the primary sequence similarity results of users’ selected genes against all protein sequences recorded in the TransportDB. Download by sequences provided all amino acid sequences formatted in a FASTA text file. All three download links were dynamically generated by the user modifying any search parameters above. Files were zipped in a tar.gz format which could be unzipped by conventional file compression tools such as 7-zip in Windows and MacOS or tar in Linux operating systems.

Iterative functional mining

Most protist resources provided basic search functionalities including a gene ID/name or keyword search. However, if a user wanted to search by specific biological function such as “calcium channel”, it was not easy to ensure that everyone can type the correct term. The PPTdb provided a real-time GO description search. Users could type only a few characters of the description, and the system searched our back-end database in the background and returned suggested terms by the Type-n-Search data table. In contrast to other auto-filled search boxes (such as Google search), the user could only select one suggested option. In our iterative functional mining interface, users could do the secondary search using the query box in the top-right corner of the data table to narrow down the result from hundreds of returned elements. All the suggested elements could be added into a collecting box which was similar to a shopping cart in the online shopping website.

Unlike other search interfaces that only allowed the user to use single terms for database searching, the collecting box supports multiple iterations of searches. Items returned from the database of every search could be put into the collecting box together (Fig. 2c). Moreover, items put into the collecting box could also be removed by clicking the “x” button on the left-hand side of each item (Fig. 2b). For example, users could use “calcium channel”, “ion channel”, and “voltage-gated channel” as keywords to query the database in three independent searches and add all the items into the collecting box. There were 96 GO descriptions associated with “ion channel”; then, the secondary search could be used to remove all 9 “ligand-gated” items in the search results. Finally, all 90 items associated with “ion channel” except 9 “ligand-gated” items could be added into the collecting box. The collecting box serves as a functional filter that could be coupled with the previously mentioned TM filter, the one-click potential transporter finder, and the human transporter homolog filter (Fig. 2b). These four filters comprise the “iterative evidence-driven putative transporter system mining” workflow of PPTdb.

Pairwise functional compositional comparisons

Interspecies comparisons of current protist resources focused on the number of genes, transporter classes, or sequence similarities. However, constraint-based comparison, which allowed users to pre-define specific search parameters or select a subset from the whole genome, was still lacking in protist resources. The PPTdb provided pairwise functional element comparison through the Venn diagrams. Users could select unique (left- and right-hand side) or shared (intersecting region of the Venn diagram) functional elements. After that, the Type-n-Search data table delivered genes that contain selected functional elements for users’ further detailed investigations.

Data retrieving and automatically update functionality

There were several automatically updating scripts to guarantee the latest’s data of PPTdb, including data retrieving, gene annotation, data processing and database renewwing. PPTdb were consisted by two exactly the same virtual machines – the stable version for public use and the standby version for latest data integration. Once all the data was completely updated and deposited into PPTdb’s database, the standby version would go on-line. And the stable version will go off-line for next data updating.

Comparison to other protist transporter resources

Currently, there was no published transporter-system database specifically designed for protists. Two general-purpose transporter protein databases (TCDB and TransportDB) could be used for protist transporter system studies. However, both databases collected a limited number of protists, which could not provide sufficient resources for protist transporter protein studies. The TCDB contained data for more than ten thousand transporter proteins and provided a transporter classification (TC) system recognized by the International Union of Biochemistry and Molecular Biology (IUBMB), generating systematical functional categories. However, there were only 75 protists in this database and fewer than 300 protist transporters. TransportDB provided clean and comprehensive data resources for transporter systems targeting to sequenced genomes which contained ~ 2500 bacteria species; however, there were fewer than 40 eukaryotic species (and only 11 protists) in the latest published version.

Both TCDB and TransportDB provided interfaces for transporter protein studies; however, neither of them offered mining tools for protist versus human homolog gene searches, protist to protist comparison, or further protist-associated studies. Table 2 listed the major differences between PPTdb, TCDB, and TransportDB from the protist research point of view.

Table 2 A comparison table between PPTdb and other protist transporter resources

Full size table

Iterative functional mining workflow: an example of use

Horizontal gene transfer events were known as evolutionary driving forces of eukaryotes [19]. For example, nucleotide transporter (NTT) gene acquisition was reported as a major evolutionary innovation of Microsporidia which were intracellular parasites of animals and human [20].

As a proof of our user-friendly interface, sequences of putative NTTs were identified from seven Microsporidia species using PPTdb’s iterative functional mining workflow. By using traditional one-way search interface that only allowed single query once a time, users must enter an exactly correct term, for example, “nucleotide transport” returned only 1 GO descriptions in A. Algerae PRA339. This was because words between “nucleotide” and “transport” could not be searched by the general query method. In iterative functional mining, one could first search word “nucleotide” which returned all GO descriptions which contained keyword “nucleotide”. Then, the type-n-search data table of PPTdb allowed a secondary refining search by entering “transport” to narrow down search results. This two-way search interface offered a high flexibility especially on multiple keyword combinations which could not be searched by one-way search, for example, nucleotide-sugar transmembrane transporter activity (GO:0005338), guanine nucleotide transmembrane transporter activity (GO:0001409), and nucleotide transmembrane transporter activity (GO:0015215). Moreover, the type-n-search table instantly returned database query candidates. Users could obtain putative results by just entering a few characters instead of entire searching keywords. For example, entering “transport” would return all the candidate entries including “transport”, “transporter”, and “transmembrane transporter” which could be act as a search guidance to inform users that there was more than one “transport” associated keywords in the GO description database.

In this example, 22 putative transporters were identified by those nucleotide transport associated GO descriptions from seven Microsporidia species (Table 3). Interestingly, more than half of the results are hypothetical proteins. The protist uniqueness genes and human transporter homologs could be easily separated from the search. Finally, we could download all the FASTA sequences by clicking the Download link of the web page. Tools such as MAFFT [21] and ClustalW [22] could do the multiple sequence alignment and deliver the phylogenetic tree of these sequences. Detailed information of steps would be found on the demonstration page of PPTdb (http://pptdb.cgu.edu.tw/demo.php).

Table 3 Search entries^a of NTTs in seven Microsoporidia species

Full size table

Specific search strategies for putative transporter proteins

PPTdb offered the precisely functional search instead of general keyword search used by general purposed databases such as Entrez Gene [17, 18], UniProt [23], and EuPathDB [16]. In addition to keyword search, PPTdb also provided specialized pre-set search filters for putative transporter proteins, including one-click-potential transporter genes, number of transmembrane domains, and iterative functional search boxes. Table 4 illustrated a search term “nucleotide” for putative transporter proteins with 1 transmembrane domain of species Acanthamoeba castellanii str. Neff. It was complicated to mimic all possible search movements for every user of these databases that provided several advanced search tools. The most straightforward search strategy was the keyword search and filters provided by each database. The search would be set if the database offers pre-set filters, such as number of transmembrane domains. However, the keyword search was used without pre-set filters, such as putative transporter proteins. The result showed that the functional filter and iterative search functionalities could make the search more user-friendly than other general purposed databases. All the searched genes Ids from PPTdb could be downloaded in a text format and then be executed more detailed data retrieving processes, such as genetic information from Entrez Gene database, protein 3D structures and pathways from UniProt database.

Table 4 Search comparisons PPTdb and several general purpose genomic databases

Full size table

Future work

PPTdb collected all the putative transporter protist proteins and provides a user-friendly data querying interface. The next goal is to collect the human validated transporter proteins by offering a system for putative and validated proteins comparison. The collection of 3D structures of protist transporters will be the goal as well. We believe these future works will make PPTdb more useful on potential protist associated treatment or drug development.

Conclusions

The PPTdb had been specifically designed for protist transporter system studies and provides a data query portal, an online comparison tool, and a flexible functional search interface. For all putative, hypothetical, or curated protist proteins, the PPTdb provided functional annotations. The PPTdb also offered a straightforward protist-human homology search interface for pathogen and host studies.

Abbreviations

BLASTp:: Basic local alignment search tool for protein
GO:: Gene Ontology
HTD:: Human Transporter Database
HTML5:: HyperText markup language 5
IUBMB:: International Union of Biochemistry and Molecular Biology
MySQL:: My structured query language
NCBI:: National Center for Biotechnology Information
NTT:: Nucleotide transporter
PHP:: PHP hypertext preprocessor
PPTdb:: Pathogenic Protist Transmembranome database
TC:: Transporter classification
TCDB:: Transporter classification database
TM:: Transmembrane domain
TMHMM:: Transmembrane Helices; Hidden Markov Model
YTPdb:: Yeast Transporter Protein Database

References

Dean P, Major P, Nakjang S, Hirt RP, Embley TM. Transport proteins of parasitic protists and their role in nutrient salvage. Front Plant Sci. 2014;5:153.
Article Google Scholar
Vasudevan G, Carter NS, Drew ME, Beverley SM, Sanchez MA, Seyfang A, Ullman B, Landfear SM. Cloning of Leishmania nucleoside transporter genes by rescue of a transport-deficient mutant. Proc Natl Acad Sci U S A. 1998;95:9873–8.
Article CAS Google Scholar
Tsaousis AD, Kunji ERS, Goldberg AV, Lucocq JM, Hirt RP, Embley TM. A novel route for ATP acquisition by the remnant mitochondria of Encephalitozoon cuniculi. Nature. 2008;453:553–6.
Article CAS Google Scholar
Tjaden J, Haferkamp I, Boxma B, Tielens AGM, Huynen M, Hackstein JHP. A divergent ADP/ATP carrier in the hydrogenosomes of Trichomonas gallinae argues for an independent origin of these organelles. Mol Microbiol. 2004;51:1439–46.
Article CAS Google Scholar
Landfear SM. Glucose transporters in parasitic protozoa. Methods Mol Biol. 2010;637:245–62.
Article CAS Google Scholar
Blume M, Hliscs M, Rodriguez-Contreras D, Sanchez M, Landfear S, Lucius R, Matuschewski K, Gupta N. A constitutive pan-hexose permease for the Plasmodium life cycle and transgenic models for screening of antimalarial sugar analogs. FASEB J. 2011;25:1218–29.
Article CAS Google Scholar
Inbar E, Schlisselberg D, Suter Grotemeyer M, Rentsch D, Zilberstein D. A versatile proline/alanine transporter in the unicellular pathogen Leishmania donovani regulates amino acid homoeostasis and osmotic stress responses. Biochem J. 2013;449:555–66.
Article CAS Google Scholar
Shaked-Mishan P, Suter Grotemeyer M, Yoel-Almagor T, Holland N, Zilberstein D, Rentsch D. A novel high-affinity arginine transporter from the human parasitic protozoan Leishmania donovani. Mol Microbiol. 2006;60:30–8.
Article CAS Google Scholar
Fidock DA, Nomura T, Talley AK, Cooper RA, Dzekunov SM, Ferdig MT, Ursos LM, Sidhu AB, Naudé B, Deitsch KW, Su XZ, Wootton JC, Roepe PD, Wellems TE. Mutations in the P. falciparum digestive vacuole transmembrane protein PfCRT and evidence for their role in chloroquine resistance. Mol Cell. 2000;6:861–71.
Article CAS Google Scholar
Rask-Andersen M, Almén MS, Schiöth H. Trends in the exploitation of novel drug targets. Nat Rev Drug Discov. 2011;10:579–90.
Article CAS Google Scholar
Carpenter EP, Beis K, Cameron AD, Iwata S. Overcoming the challenges of membrane protein crystallography. Curr Opin Struct Biol. 2008;18:581–6.
Article CAS Google Scholar
Saier MH, Reddy VS, Tsu BV, Ahmed MS, Li C, Moreno-Hagelsieb G. The transporter classification database (TCDB): recent advances. Nucleic Acids Res. 2016;44:D372–9.
Article CAS Google Scholar
Elbourne LDH, Tetu SG, Hassan KA, Paulsen IT. TransportDB 2.0: a database for exploring membrane transporters in sequenced genomes from all domains of life. Nucleic Acids Res. 2017;45:D320–4.
Article CAS Google Scholar
Ye A, Liu QR, Li CY, Zhao M, Qu H. Human transporter database: comprehensive knowledge and discovery tools in the human transporter genes. PLoS One. 2014;9:e88883.
Article Google Scholar
Brohée S, Barriot R, Moreau Y, André B. YTPdb: a wiki database of yeast membrane transporters. Biochim Biophys Acta. 2010;1798:1908–12.
Article Google Scholar
Aurrecoechea C, Barreto A, Basenko EY, Brestelli J, Brunk BP, Cade S, Crouch K, Doherty R, Falke D, Fischer S, Gajria B, Harb OS, Heiges M, Hertz-Fowler C, Hu S, Iodice J, Kissinger JC, Lawrence C, Li W, Pinney DF, Pulman JA, Roos DS, Shanmugasundram A, Silva-Franco F, Steinbiss S, Stoeckert CJ, Spruill D, Wang H, Warrenfeltz S, Zheng J. EuPathDB: the eukaryotic pathogen genomics database resource. Nucleic Acids Res. 2017;45:D581–91.
Article CAS Google Scholar
Sayers EW, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, Dicuccio M, Edgar R, Federhen S, Feolo M, Geer LY, Helmberg W, Kapustin Y, Landsman D, Lipman DJ, Madden TL, Maglott DR, Miller V, Mizrachi I, Ostell J, Pruitt KD, Schuler GD, Sequeira E, Sherry ST, Shumway M, Sirotkin K, Souvorov A, Starchenko G, Tatusova TA, Wagner L, Yaschenko E, Ye J. Database resources of the National Center for biotechnology information. Nucleic Acids Res. 2009;37:D5–15.
Article CAS Google Scholar
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2009;37:D26–31.
Article CAS Google Scholar
Schönknecht G, Weber APM, Lercher MJ. Horizontal gene acquisitions by eukaryotes as drivers of adaptive evolution. Bioessays. 2014;36:9–20.
Article Google Scholar
Dean P, Sendra KM, Williams TA, Watson AK, Major P, Nakjang S, Kozhevnikova E, Goldberg AV, Kunji ERS, Hirt RP, Embley TM. Transporter gene acquisition and innovation in the evolution of microsporidia intracellular parasites. Nat Commun. 2018;9:1709.
Article CAS Google Scholar
Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2017;30:3059.
Google Scholar
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–80.
Article CAS Google Scholar
Bateman A. UniProt: a hub for protein information. Nucleic Acids Res. 2015;43:D204–12.
Article Google Scholar

Download references

Acknowledgement

Not applicable

Funding

This study was supported by grants from the Chang Gung Memorial Hospital Research Funding (CMRPD1G0541-1G0543) and Ministry of Science and Technology, Taiwan (MOST 106–2221-E-182-068). Publication of this article was sponsored by the grants MOST 107–2320-B-182-021-MY3 and BMRP056.

Availability of data and materials

The web site of PPTdb is freely accessible at http://pptdb.cgu.edu.tw.

About this supplement

This article has been published as part of BMC Bioinformatics Volume 20 Supplement 13, 2019: Selected articles from the 8th Translational Bioinformatics Conference: Bioinformatics. The full contents of the supplement are available online at https://bmcbioinformatics.biomedcentral.com/articles/supplements/volume-20-supplement-13.

Author information

Authors and Affiliations

Department and Graduate Institute of Computer Science and Information Engineering, Chang Gung University, Taoyuan, Taiwan
Chi-Ching Lee & Sin-You Chen
Genomic Medicine Core Laboratory, Chang Gung Memorial Hospital, Linkou, Taiwan
Chi-Ching Lee, Po-Jung Huang, Yuan-Ming Yeh & Cheng-Hsun Chiu
Department of Biomedical Sciences, Chang Gung University, Taoyuan, Taiwan
Po-Jung Huang
Molecular Infectious Disease Research Center, Chang Gung Memorial Hospital, Linkou, Taiwan
Cheng-Hsun Chiu & Petrus Tang
Department of Parasitology, College of Medicine, Chang Gung University, Taoyuan, Taiwan
Wei-Hung Cheng & Petrus Tang

Authors

Chi-Ching Lee
View author publications
You can also search for this author in PubMed Google Scholar
Po-Jung Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan-Ming Yeh
View author publications
You can also search for this author in PubMed Google Scholar
Sin-You Chen
View author publications
You can also search for this author in PubMed Google Scholar
Cheng-Hsun Chiu
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Hung Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Petrus Tang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Software, CC.L., PJ.H. and YM.Y.; Validation, CC.L., PJ.H. and YM.Y.; Investigation, CC.L., SY.C. and WH.C.; Resources, CC.L.; Data Curation, CC.L. and SY.C.; Writing-Original Draft Preparation, CC.L. and WH.C.; Writing-Review & Editing, CH.C., WH.C. and P.T.; Visualization, CC.L., PJ.H. and YM.Y.; Supervision, P.T. All authors have read and approved the final manuscript.

Corresponding authors

Correspondence to Wei-Hung Cheng or Petrus Tang.

Ethics declarations

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

GO terms used in the PPTdb (XLSX 45 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Lee, CC., Huang, PJ., Yeh, YM. et al. Pathogenic Protist Transmembranome database (PPTdb): a web-based platform for searching and analysis of protist transmembrane proteins. BMC Bioinformatics 20 (Suppl 13), 382 (2019). https://doi.org/10.1186/s12859-019-2857-7

Download citation

Published: 24 July 2019
DOI: https://doi.org/10.1186/s12859-019-2857-7

Selected articles from the 8th Translational Bioinformatics Conference: Bioinformatics

Pathogenic Protist Transmembranome database (PPTdb): a web-based platform for searching and analysis of protist transmembrane proteins

Abstract

Background

Results

Conclusions

Background

Construction and content

Sequence characterization and annotation

Web-interface and database architecture

Database contents

Utility and discussion

Transmembrane domain filter

One-click potential transporter gene finder

Sequence homology search for human transporters

Functional component charts

Dynamic type-n-search table

Download page for sequence and annotation retrieval

Iterative functional mining

Pairwise functional compositional comparisons

Data retrieving and automatically update functionality

Comparison to other protist transporter resources

Iterative functional mining workflow: an example of use

Specific search strategies for putative transporter proteins

Future work

Conclusions

Abbreviations

References

Acknowledgement

Funding

Availability of data and materials

About this supplement

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional file

Additional file 1:

Rights and permissions

About this article

Cite this article

Share this article

BMC Bioinformatics

Contact us