Abstract
Membrane proteins provide a significant part in cellular activities. The role of membrane proteins is inevitable in drug interactions and in all living organisms. Membrane protein classification is used to identify the relationships between proteins. With the help of amino acid composition, proteins get classified. A novel protein classification scheme is proposed using Tri-code Embedding vector. This proposed method forms triplet subgroups which are assigned with unique code words. Then a triplet subgroup is formed from the amino acid subgroup which is provided as input to the Bidirectional Long Short-Term Memory (BiLSTM) and SoftMax layer for classification. Two data sets are utilized and classified, with 7582 membrane proteins and 4684 membrane proteins. The results are investigated applying the self-consistency test, the Mathew’s correlation coefficient and the independent data set. Moreover, the proposed method shows its improvement in protein classification process in terms of accuracy, specificity, sensitivity, precision, recall and fmeasure. Thus, the proposed scheme provides an effective protein classification scheme that incorporates the optimistic features of deep learning. The results depict that overall accuracy obtained for data set1 is 99.48% and for data set2 is 99.87%. The proposed method achieves the highest overall classification accuracy with minimum execution time when compared to the other methods.
Similar content being viewed by others
Data availability
Open Source Dataset.
References
Jayapriya K, Mary NAB (2019) Employing a novel 2-gram subgroup intra pattern (2GSIP) with stacked auto encoder for membrane protein classification. Mol Biol Rep 46:2259–2272. https://doi.org/10.1007/s11033-019-04680-3
Thonghin Nopnithi, VasileiosKargas Jack Clews, Robert, (2018) Cryo-electron microscopy of membrane proteins. Journal of Methods 147:176–186
Golmohammadi SK, Kurgan L, Crowley B and Reformat M (2007) Classification of cell membrane. FBIT 07 Proceedings of the 2007 Frontiers in the convergence of Bioscience and information technology, pp.153-158
Chou C, Elrod DW (1999) Prediction of membrane protein types and subcellular locations, Proteins: Proteins: Structure. Function, and Genetics 34:137–53
Liu H, Wang M, Chou K-C (2005) Low-frequency Fourier spectrum for predicting membrane protein types. Biochemical and Biophysical Research Communications 336(3):737–739
Sandaruwan PD, Wannige CT An improved deep learning model for hierarchical classification of protein families. PLoS ONE 16(10): e0258625. https://doi.org/10.1371/journal.pone.0258625
Ali Farman, Haya Maqsood (2005) Classification of membrane protein types using Voting Feature Interval in combination with Chou׳s Pseudo Amino Acid Composition. Journal of Theoretical Biology 384:78–83
Nazar Z, El-Hajj W (2010) Predicting membrane protein type using inter-domain linker knowledge. In: BIOCOMP, pp 209–214
Pandy-Szekeres G, Munk C, Tsonkov TM, Mordalski S, Harpsoe K, Hauser AS, Bojarski AJ, Gloriam DE (2018) GPCRdb in 2018: Adding GPCR structure models and ligands. Nucleic Acids Res. 46:D440–D446
Srinivasu PN, SivaSai JG, Ijaz MF, Bhoi AK, Kim W, Kang JJ (2021) Classification of Skin Disease Using Deep Learning Neural Networks with MobileNet V2 and LSTM. Sensors 21:2852. https://doi.org/10.3390/s21082852
Alphonse AS, Mary NA, Starvin MS (2020) Classification of membrane protein using Tetra Peptide Pattern. Analytical Biochemistry. 1(606)
Ani Brown Mary N, Robert Singh A, Athisayamani S (2020) Banana leaf diseased image classification using novel HEAP auto encoder (HAE) deep learning. Multimed Tools Appl 79, 30601–30613. https://doi.org/10.1007/s11042-020-09521-1
Ani Brown Mary N, Dejey Dharma Coral reef image/video classification employing novel octa-angled pattern for triangular sub region and pulse coupled convolutional neural network (PCCNN). Multimed Tools Appl 77, 31545–31579 (2018). https://doi.org/10.1007/s11042-018-6148-5
Kumar Y, Koul A, Singla R et al (2022) Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-021-03612-z
Vulli A, Srinivasu PN, Sashank MSK, Shafi J, Choi J, Ijaz MF (2022) Fine-Tuned DenseNet-169 for breast cancer metastasis prediction using FastAI and 1-Cycle Policy. Sensors 22:2988
Srinivasu PN, Ijaz MF, Shafi J & Wozniak M, Radha S (2022) 6G driven fast computational networking framework for healthcare applications. In: IEEE Access. https://doi.org/10.1109/ACCESS.2022.3203061
Kumar M, Verma K, Kumar A, Ijaz MF, Rawat DB (2022) ANAF-IoMT: a novel architectural framework for IoMT enabled smart healthcare system by enhancing security based on RECC-VC. IEEE Trans Industrial Inform
Pradhan NR, Singh AP, Verma S, Kavita Kaur N, Roy DS, Shafi J, Wozniak M, Ijaz MF (2022) A novel blockchain-based healthcare system design and performance benchmarking on a multi-hosted testbed. Sensors 22(9):3449
Ali S, El-Sappagh S, Ali F, Imran M, Abuhmed T (2022) Multitask deep learning for cost-effective prediction of patient's length of stay and readmission state using multimodal physical activity sensory data. IEEE J Biomed Health Inform. https://doi.org/10.1109/JBHI.2022.3202178
El-Rashidy N, Abuhmed T, Alarabi L, El-Bakry HM, Abdelrazek S, Ali F, El-Sappagh S (2022) Sepsis prediction in intensive care unit based on genetic feature optimization and stacked deep ensemble learning. Neural Comput Appl 34(5):3603–3632
Parashar J, Kushwah VS, Rai M (2023) Determination Human Behavior Prediction Supported by Cognitive Computing-Based Neural Network. In: Kumar R, Verma AK, Sharma TK, Verma OP, Sharma S. (eds) Soft Computing: Theories and Applications. Lecture Notes in Networks and Systems, vol 627. Springer, Singapore. https://doi.org/10.1007/978-981-19-9858-4_36.
Dey RK, Das AK (2023) Modified term frequency-inverse document frequency based deep hybrid framework for sentiment analysis. Multimed Tools Appl. 82:32967-32990.https://doi.org/10.1007/s11042-023-14653-1
Dey RK, Das AK (2022) A Simple Strategy for Handling 'NOT' Can Improve the Performance of Sentiment Analysis. In: Das AK, Nayak J, Naik B, Vimal S, Pelusi D (eds) Computational Intelligence in Pattern Recognition. CIPR 2022. Lecture Notes in Networks and Systems, vol 480. Springer, Singapore. https://doi.org/10.1007/978-981-19-3089-8_25
Chou KC, Elrod DW (1999) Prediction of membrane protein types and subcellular locations, Proteins: Struct. Funct. Bioinfor. 34(1):137–153
Chou KC, Shen HB (2007) MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM. Biochem. Biophys. Res. Commun. 360(2):339–345
Chou KC (2001) Prediction of protein cellular attributes using pseudo-amino acid composition, Proteins: Struct. Funct. Bioinfor. 43(3):246–255
Wan S, Mak MW, Kung SY (2015) Mem-mEN: predicting multi-functional types of membrane proteins by interpretable elastic nets. IEEE ACM Trans. Comput. Biol. Bioinf 13(4):706–718
Han Guo-Sheng, Zu-Guo Yu, Anh Vo (2014) A two-stage SVM method to predict membrane protein types by incorporating amino acid classifications and physicochemical properties into a general form of Chou’s PseAAC. Journal of Theoretical Biology 344:31–39
Wan S, Mak MW, Kung SY (2016) Benchmark data for identifying multi-functional types of membrane proteins. Data. Brief. 8:105–107
Guo L, Wang S, Li M, Cao Z (2019) Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning. BMC Bioinf. 20(25):1–7
Wang H, Ding Y, Tang J, Guo F (2020) Identification of membrane protein types via multivariate information fusion with Hilbert-Schmidt independence criterion. Neurocomputing. 28(383):257–69
Hopf TA, Colwell LJ, Sheridan R, Rost B, Sander C, Marks DS (2012) Three-dimensional structures of membrane proteins from genomic sequencing. Cell. 149(7):1607–21
He Y, Wang S (2022) SE-BLTCNN: A Channel Attention Adapted Deep Learning Model Based on PSSM for Membrane Protein Classification. Computational Biology and Chemistry. 6
Wang T, Xia T, Hu XM (2010) Geometry preserving projections algorithm for predicting membrane protein types. J. Theor. Biol. 262(2):208–213
Anishetty S, Pennathur G, Anishetty R (2002) Tripeptide analysis of protein structures. BMC Struct. Biol. 2(1):9
Cai YD, Ricardo PW, Jen CH, Chou KC (2004) Application of SVM to predict membrane protein types. J. Theor. Biol. 226(4):373–376
Wang M, Yang J, Xu ZJ, Chou KC (2005) SLLE for predicting membrane protein types. Journal of Theoretical Biology. 232(1):7–15
Arif M, Hayat M, Jan Z (2018) iMem-2LSAAC: A two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into chou’s pseudo amino acid composition. Journal of Theoretical Biology. 7(442):11–21
Chou KC, Shen HB (2007) MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM. Biochemical and biophysical research communications. 360(2):339–45
Cai YD, Zhou GP, Jen CH, Lin SL, Chou KC (2004) Identify catalytic triads of serine hydrolases by support vector machines. J. Theor. Biol. 228(4):551–557
Golmohammadi SK, Kurgan L, Crowley B, Reformat M (2007) Classification of cell membrane proteins, IEEE. Frontiers In Ihe Convergence Of Bioscience And Information Technologies, pp 153–158
Shen HB, Chou KC (2005) Using optimized evidence-theoretic K-nearest neighbor classifier and pseudo-amino acid composition to predict membrane protein types. Biochem. Biophys. Res. Commun. 334(1):288–292
Zhao X, Zou Q, Liu B, Liu X (2014) Exploratory predicting protein folding model with random forest and hybrid features. Current Proteomics. 11(4):289–99
Hayat M, Khan A (2012) MemHyb: predicting membrane protein types by hybridizing SAAC and PSSM. Journal of theoretical biology. 7(292):93–102
Wang J, Li Y, Wang Q, You X, Man J, Wang C, Gao X (2012) ProClusEnsem: predicting membrane protein types by fusing different modes of pseudo amino acid composition. Computers in biology and medicine. 42(5):564–74
Mary, Ani Brown N, Dharma D (2017) Coral reef image classification employing improved LDP for feature extraction. J Visual Comm Image Representation 49:225-242
Alphonse, Sherly A, Ani Brown Mary N (2023) Classification of anti-oxidant proteins using novel physiochemical and conjoint-quad (PCQ) feature composition. Multimedia Tools Appl 1-27
Funding
No financial support for this research, authorship, and/or publication of this article.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethics approval
All applicable international, national, and/or institutional guidelines for the care and use of animals were followed. This manuscript does not contain any studies with human participants or animals performed by any of the three authors.
Consent to participate
I understand that my contribution will be confidential and that there will be no personal identification in the data that I agree to allow to be used in the study.
Consent to publish
Authors should make sure to also seek consent from individuals to publish their data prior to submitting their manuscript to this journal.
Conflicts of interest
No conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gomathi, S., Ram, K.N. & Mary, N.A.B. Triplet encoded sequence based membrane protein classification using BiLSTM. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-19010-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-024-19010-4