Geometric approach to string analysis for biosequence classification

Boris Brimkov

doi:10.1515/jib-2014-252

Open Access Published by De Gruyter October 18, 2016

Geometric approach to string analysis for biosequence classification

Boris Brimkov

From the journal Journal of Integrative Bioinformatics

https://doi.org/10.1515/jib-2014-252

Summary

Tools that effectively analyze and compare sequences are of great importance in various areas of applied computational research, especially in the framework of molecular biology. In the present paper, we introduce simple geometric criteria based on the notion of string linearity and use them to compare DNA sequences of various organisms, as well as to distinguish them from random sequences. Several other theoretical and statistical results are outlined as well.

Our experiments reveal a substantial difference between biosequences and random sequences – the former having much higher deviation from linearity than the latter – as well as a general trend of increasing deviation from linearity between primitive and biologically complex organisms.

Published Online: 2016-10-18

Published in Print: 2014-12-1

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Geometric approach to string analysis for biosequence classification

Summary

Journal and Issue

Articles in the same Issue