The discipline of Algebraic Dynamic Programming is a powerful method to design and implement versatile pattern matching algorithms on sequences; here we consider mixed sequence and secondary structure motifs in RNA. A recurring challenge when designing new pattern matchers is to provide a statistical analysis of pattern significance. We demonstrate that by the use of so-called canonical pattern descriptions, the expected number of hits on a sequence of length n can be computed a priori, using the pattern matcher itself. This provides a systematic way to calibrate the specificity of pattern matching algorithms. The technique is exemplified by examples using IRE and SECIS elements.
Licensed
Unlicensed
Requires Authentication
Matching and Significance Evaluation of Combined Sequence-Structure Motifs in RNA
From the journal Zeitschrift für Physikalische Chemie
Showing a limited preview of this publication:
Published Online: 2008-04-21
Published in Print: 2002-02