Skip to main content
Log in

Triplet encoded sequence based membrane protein classification using BiLSTM

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Membrane proteins provide a significant part in cellular activities. The role of membrane proteins is inevitable in drug interactions and in all living organisms. Membrane protein classification is used to identify the relationships between proteins. With the help of amino acid composition, proteins get classified. A novel protein classification scheme is proposed using Tri-code Embedding vector. This proposed method forms triplet subgroups which are assigned with unique code words. Then a triplet subgroup is formed from the amino acid subgroup which is provided as input to the Bidirectional Long Short-Term Memory (BiLSTM) and SoftMax layer for classification. Two data sets are utilized and classified, with 7582 membrane proteins and 4684 membrane proteins. The results are investigated applying the self-consistency test, the Mathew’s correlation coefficient and the independent data set. Moreover, the proposed method shows its improvement in protein classification process in terms of accuracy, specificity, sensitivity, precision, recall and fmeasure. Thus, the proposed scheme provides an effective protein classification scheme that incorporates the optimistic features of deep learning. The results depict that overall accuracy obtained for data set1 is 99.48% and for data set2 is 99.87%. The proposed method achieves the highest overall classification accuracy with minimum execution time when compared to the other methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Data availability

Open Source Dataset.

References

  1. Jayapriya K, Mary NAB (2019) Employing a novel 2-gram subgroup intra pattern (2GSIP) with stacked auto encoder for membrane protein classification. Mol Biol Rep 46:2259–2272. https://doi.org/10.1007/s11033-019-04680-3

    Article  Google Scholar 

  2. Thonghin Nopnithi, VasileiosKargas Jack Clews, Robert, (2018) Cryo-electron microscopy of membrane proteins. Journal of Methods 147:176–186

    Article  Google Scholar 

  3. Golmohammadi SK, Kurgan L, Crowley B and Reformat M (2007) Classification of cell membrane. FBIT 07 Proceedings of the 2007 Frontiers in the convergence of Bioscience and information technology, pp.153-158

  4. Chou C, Elrod DW (1999) Prediction of membrane protein types and subcellular locations, Proteins: Proteins: Structure. Function, and Genetics 34:137–53

    Google Scholar 

  5. Liu H, Wang M, Chou K-C (2005) Low-frequency Fourier spectrum for predicting membrane protein types. Biochemical and Biophysical Research Communications 336(3):737–739

    Article  Google Scholar 

  6. Sandaruwan PD, Wannige CT  An improved deep learning model for hierarchical classification of protein families. PLoS ONE 16(10): e0258625. https://doi.org/10.1371/journal.pone.0258625

  7. Ali Farman, Haya Maqsood (2005) Classification of membrane protein types using Voting Feature Interval in combination with Chou׳s Pseudo Amino Acid Composition. Journal of Theoretical Biology 384:78–83

    Article  Google Scholar 

  8. Nazar Z, El-Hajj W (2010) Predicting membrane protein type using inter-domain linker knowledge. In: BIOCOMP, pp 209–214

  9. Pandy-Szekeres G, Munk C, Tsonkov TM, Mordalski S, Harpsoe K, Hauser AS, Bojarski AJ, Gloriam DE (2018) GPCRdb in 2018: Adding GPCR structure models and ligands. Nucleic Acids Res. 46:D440–D446

    Article  Google Scholar 

  10. Srinivasu PN, SivaSai JG, Ijaz MF, Bhoi AK, Kim W, Kang JJ (2021) Classification of Skin Disease Using Deep Learning Neural Networks with MobileNet V2 and LSTM. Sensors 21:2852. https://doi.org/10.3390/s21082852

    Article  Google Scholar 

  11. Alphonse AS, Mary NA, Starvin MS (2020) Classification of membrane protein using Tetra Peptide Pattern. Analytical Biochemistry. 1(606)

    Article  Google Scholar 

  12. Ani Brown Mary N, Robert Singh A,  Athisayamani S (2020) Banana leaf diseased image classification using novel HEAP auto encoder (HAE) deep learning. Multimed Tools Appl 79, 30601–30613. https://doi.org/10.1007/s11042-020-09521-1

  13. Ani Brown Mary N, Dejey Dharma Coral reef image/video classification employing novel octa-angled pattern for triangular sub region and pulse coupled convolutional neural network (PCCNN). Multimed Tools Appl 77, 31545–31579 (2018). https://doi.org/10.1007/s11042-018-6148-5

  14. Kumar Y, Koul A, Singla R et al (2022) Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-021-03612-z

    Article  Google Scholar 

  15. Vulli A, Srinivasu PN, Sashank MSK, Shafi J, Choi J, Ijaz MF (2022) Fine-Tuned DenseNet-169 for breast cancer metastasis prediction using FastAI and 1-Cycle Policy. Sensors 22:2988

    Article  Google Scholar 

  16. Srinivasu PN, Ijaz MF, Shafi J & Wozniak M, Radha S (2022) 6G driven fast computational networking framework for healthcare applications. In: IEEE Access. https://doi.org/10.1109/ACCESS.2022.3203061

  17. Kumar M, Verma K, Kumar A, Ijaz MF, Rawat DB (2022) ANAF-IoMT: a novel architectural framework for IoMT enabled smart healthcare system by enhancing security based on RECC-VC. IEEE Trans Industrial Inform

  18. Pradhan NR, Singh AP, Verma S, Kavita Kaur N, Roy DS, Shafi J, Wozniak M, Ijaz MF (2022) A novel blockchain-based healthcare system design and performance benchmarking on a multi-hosted testbed. Sensors 22(9):3449

    Article  Google Scholar 

  19. Ali S, El-Sappagh S, Ali F, Imran M, Abuhmed T (2022) Multitask deep learning for cost-effective prediction of patient's length of stay and readmission state using multimodal physical activity sensory data. IEEE J Biomed Health Inform. https://doi.org/10.1109/JBHI.2022.3202178

  20. El-Rashidy N, Abuhmed T, Alarabi L, El-Bakry HM, Abdelrazek S, Ali F, El-Sappagh S (2022) Sepsis prediction in intensive care unit based on genetic feature optimization and stacked deep ensemble learning. Neural Comput Appl 34(5):3603–3632

    Article  Google Scholar 

  21. Parashar J, Kushwah VS, Rai M (2023) Determination Human Behavior Prediction Supported by Cognitive Computing-Based Neural Network. In: Kumar R, Verma AK, Sharma TK, Verma OP, Sharma S. (eds) Soft Computing: Theories and Applications. Lecture Notes in Networks and Systems, vol 627. Springer, Singapore. https://doi.org/10.1007/978-981-19-9858-4_36.

  22. Dey RK, Das AK (2023) Modified term frequency-inverse document frequency based deep hybrid framework for sentiment analysis. Multimed Tools Appl. 82:32967-32990.https://doi.org/10.1007/s11042-023-14653-1

  23. Dey RK, Das AK (2022) A Simple Strategy for Handling 'NOT' Can Improve the Performance of Sentiment Analysis. In: Das AK, Nayak J, Naik B, Vimal S, Pelusi D (eds) Computational Intelligence in Pattern Recognition. CIPR 2022. Lecture Notes in Networks and Systems, vol 480. Springer, Singapore. https://doi.org/10.1007/978-981-19-3089-8_25

  24. Chou KC, Elrod DW (1999) Prediction of membrane protein types and subcellular locations, Proteins: Struct. Funct. Bioinfor. 34(1):137–153

    Article  Google Scholar 

  25. Chou KC, Shen HB (2007) MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM. Biochem. Biophys. Res. Commun. 360(2):339–345

    Article  Google Scholar 

  26. Chou KC (2001) Prediction of protein cellular attributes using pseudo-amino acid composition, Proteins: Struct. Funct. Bioinfor. 43(3):246–255

    Article  Google Scholar 

  27. Wan S, Mak MW, Kung SY (2015) Mem-mEN: predicting multi-functional types of membrane proteins by interpretable elastic nets. IEEE ACM Trans. Comput. Biol. Bioinf 13(4):706–718

    Article  Google Scholar 

  28. Han Guo-Sheng, Zu-Guo Yu, Anh Vo (2014) A two-stage SVM method to predict membrane protein types by incorporating amino acid classifications and physicochemical properties into a general form of Chou’s PseAAC. Journal of Theoretical Biology 344:31–39

    Article  Google Scholar 

  29. Wan S, Mak MW, Kung SY (2016) Benchmark data for identifying multi-functional types of membrane proteins. Data. Brief. 8:105–107

    Article  Google Scholar 

  30. Guo L, Wang S, Li M, Cao Z (2019) Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning. BMC Bioinf. 20(25):1–7

    Google Scholar 

  31. Wang H, Ding Y, Tang J, Guo F (2020) Identification of membrane protein types via multivariate information fusion with Hilbert-Schmidt independence criterion. Neurocomputing. 28(383):257–69

    Google Scholar 

  32. Hopf TA, Colwell LJ, Sheridan R, Rost B, Sander C, Marks DS (2012) Three-dimensional structures of membrane proteins from genomic sequencing. Cell. 149(7):1607–21

    Article  Google Scholar 

  33. He Y, Wang S (2022) SE-BLTCNN: A Channel Attention Adapted Deep Learning Model Based on PSSM for Membrane Protein Classification. Computational Biology and Chemistry. 6

    Article  Google Scholar 

  34. Wang T, Xia T, Hu XM (2010) Geometry preserving projections algorithm for predicting membrane protein types. J. Theor. Biol. 262(2):208–213

    Article  Google Scholar 

  35. Anishetty S, Pennathur G, Anishetty R (2002) Tripeptide analysis of protein structures. BMC Struct. Biol. 2(1):9

    Article  Google Scholar 

  36. Cai YD, Ricardo PW, Jen CH, Chou KC (2004) Application of SVM to predict membrane protein types. J. Theor. Biol. 226(4):373–376

    Article  MathSciNet  Google Scholar 

  37. Wang M, Yang J, Xu ZJ, Chou KC (2005) SLLE for predicting membrane protein types. Journal of Theoretical Biology. 232(1):7–15

    Article  MathSciNet  Google Scholar 

  38. Arif M, Hayat M, Jan Z (2018) iMem-2LSAAC: A two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into chou’s pseudo amino acid composition. Journal of Theoretical Biology. 7(442):11–21

    Article  MathSciNet  Google Scholar 

  39. Chou KC, Shen HB (2007) MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM. Biochemical and biophysical research communications. 360(2):339–45

    Article  Google Scholar 

  40. Cai YD, Zhou GP, Jen CH, Lin SL, Chou KC (2004) Identify catalytic triads of serine hydrolases by support vector machines. J. Theor. Biol. 228(4):551–557

    Article  Google Scholar 

  41. Golmohammadi SK, Kurgan L, Crowley B, Reformat M (2007) Classification of cell membrane proteins, IEEE. Frontiers In Ihe Convergence Of Bioscience And Information Technologies, pp 153–158

  42. Shen HB, Chou KC (2005) Using optimized evidence-theoretic K-nearest neighbor classifier and pseudo-amino acid composition to predict membrane protein types. Biochem. Biophys. Res. Commun. 334(1):288–292

    Article  Google Scholar 

  43. Zhao X, Zou Q, Liu B, Liu X (2014) Exploratory predicting protein folding model with random forest and hybrid features. Current Proteomics. 11(4):289–99

    Article  Google Scholar 

  44. Hayat M, Khan A (2012) MemHyb: predicting membrane protein types by hybridizing SAAC and PSSM. Journal of theoretical biology. 7(292):93–102

    Article  MathSciNet  Google Scholar 

  45. Wang J, Li Y, Wang Q, You X, Man J, Wang C, Gao X (2012) ProClusEnsem: predicting membrane protein types by fusing different modes of pseudo amino acid composition. Computers in biology and medicine. 42(5):564–74

    Article  Google Scholar 

  46. Mary,  Ani Brown N,  Dharma D (2017) Coral reef image classification employing improved LDP for feature extraction. J Visual Comm Image Representation 49:225-242

  47. Alphonse, Sherly A, Ani Brown Mary N (2023) Classification of anti-oxidant proteins using novel physiochemical and conjoint-quad (PCQ) feature composition. Multimedia Tools Appl 1-27

Download references

Funding

No financial support for this research, authorship, and/or publication of this article.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Nithish Ram.

Ethics declarations

Ethics approval

All applicable international, national, and/or institutional guidelines for the care and use of animals were followed. This manuscript does not contain any studies with human participants or animals performed by any of the three authors.

Consent to participate

I understand that my contribution will be confidential and that there will be no personal identification in the data that I agree to allow to be used in the study.

Consent to publish

Authors should make sure to also seek consent from individuals to publish their data prior to submitting their manuscript to this journal.

Conflicts of interest

No conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gomathi, S., Ram, K.N. & Mary, N.A.B. Triplet encoded sequence based membrane protein classification using BiLSTM. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-19010-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11042-024-19010-4

Keywords

Navigation