New Stopping Criteria for Segmenting DNA Sequences

Wentian Li
Phys. Rev. Lett. 86, 5815 – Published 18 June 2001
PDFExport Citation

Abstract

We propose a solution on the stopping criterion in segmenting inhomogeneous DNA sequences with complex statistical patterns. This new stopping criterion is based on Bayesian information criterion in the model selection framework. When this criterion is applied to telomere of S. cerevisiae and the complete sequence of E. coli, borders of biologically meaningful units were identified, and a more reasonable number of domains was obtained. We also introduce a measure called segmentation strength which can be used to control the delineation of large domains. The relationship between the average domain size and the threshold of segmentation strength is determined for several genome sequences.

  • Received 15 September 2000

DOI:https://doi.org/10.1103/PhysRevLett.86.5815

©2001 American Physical Society

Authors & Affiliations

Wentian Li

  • Laboratory of Statistical Genetics, Box 192, Rockefeller University, 1230 York Avenue, New York, New York 10021

References (Subscription Required)

Click to Expand
Issue

Vol. 86, Iss. 25 — 18 June 2001

Reuse & Permissions
Access Options
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review Letters

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×