Abstract
The biological sequences that occur in DNA, RNA and proteins can be considered as strings formed over the well defined chemical alphabets. Such gene sequences form structure based on the complementary pair and the structures can be interpreted as languages. Matrix insertion-deletion system has been introduced a few years back that modelled several bio-molecular structures occur at intramolecular and intermolecular level. In this paper, we identify some structures that are frequently noticed during RNA folding process such as double bulge loop, extended internal loop, triple stem and loop and we give the corresponding formal language representation to such structures. Further, we model the structures using Matrix insertion-deletion systems. This work is pioneering to give the language representation and modelling the structures of RNA folding process using formal grammar.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Brendel, V., Busse, H.G.: Genome structure described by formal languages. Nucleic Acids Res. 12(5), 2561–2568 (1984)
Brown, M., Wilson, C.: RNA Pseudoknot modelling using intersections of stochastic CFG with applications to database search. In: Proceedings of the Pacific Symposium on Biocomputing, Hawaii, USA, pp. 109–125 (1995)
Cai, L., Russell, L., Wu, Y.: Stochastic modelling of RNA pseudoknotted structures: a grammatical approach. Bioinformatics 19(1), 66–73 (2003)
Galiukschov, B.S.: Semicontextual grammars (in Russian). Matem. Logica i Matem. Lingvistika, pp. 38–50 (1981)
Head, T.: Formal language theory and DNA: an analysis of the generative capacity of specific recombinant behaviors. Bull. Math. Biol. 49(6), 737–750 (1987)
Kari, L., Thierrin, G.: Contextual insertions/deletions and computability. Inf. Comput. 131(1), 47–61 (1996)
Kari, L.: On insertion and deletion in formal languages. Ph.D. Thesis, University of Turku (1991)
Kuppusamy, L., Mahendran, A., Krishna, S.N.: Matrix insertion-deletion systems for bio-molecular structures. In: Natarajan, R., Ojo, A. (eds.) ICDCIT 2011. LNCS, vol. 6536, pp. 301–312. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19056-8_23
Lakshmanan, K., Anand, M., Clergerie, E.V.: Modelling intermolecular structures and defining ambiguity in gene sequences using matrix insertion-deletion systems. In: Enguix, G.B., Dahl, V., Dolores Jimenez Lopez, M. (eds.) Biology, Computation and Linguistics, New Interdisciplinary Paradigms, pp. 71–85. IOS Press, Amsterdam (2011)
Mamitsuka, H., Abe, N.: Prediction of beta-sheet structures using stochastic tree grammars. In: Proceedings of Fifth Workshop on Genome Informatics, pp. 19–28. Universal Academy Press, Yokohama (1994)
Pan, T., Sosnick, T.: RNA folding during transcription. Annu. Rev. Biophys. Biomol. Struct. 35, 161–175 (2006)
Petre, I., Verlan, S.: Matrix insertion-deletion systems. Theoret. Comput. Sci. 456, 80–88 (2012)
Rivas, E., Eddy, S.R.: The language of RNA: a formal grammar that includes pseudoknots. Bioinformatics 16(4), 334–340 (2000)
Rozenberg, G., Salomaa, A.: Handbook of Formal Languages. Springer, New York (1996). doi:10.1007/978-3-642-59126-6
Searls, D.B.: Representing genetic information with formal grammars. In: Proceedings of the National Conference on Artificial Intelligence, Saint Paul, Minnesota, pp. 386–391 (1988)
Searls, D.B.: The computational linguistics of biological sequences. In: Hunter, L. (ed.) Artificial Intelligence and Molecular Biology, pp. 47–120. AAAI Press, Menlo Park (1993)
Searls, D.B.: Formal grammars for intermolecular structures. In: First International IEEE Symposium on Intelligence and Biological Systems, Washington, USA, pp. 30–37 (1995)
Searls, D.B.: The language of genes. Nature 420(6912), 211–217 (2002)
Uemura, Y., Hasegawa, A., Kobayashi, S., Yokomori, T.: TAG for RNA structure prediction. Theoret. Comput. Sci. 210(2), 277–303 (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Mahendran, A., Kuppusamy, L. (2017). Formal Language Representation and Modelling Structures Underlying RNA Folding Process. In: Arumugam, S., Bagga, J., Beineke, L., Panda, B. (eds) Theoretical Computer Science and Discrete Mathematics. ICTCSDM 2016. Lecture Notes in Computer Science(), vol 10398. Springer, Cham. https://doi.org/10.1007/978-3-319-64419-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-64419-6_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-64418-9
Online ISBN: 978-3-319-64419-6
eBook Packages: Computer ScienceComputer Science (R0)