CSIR Central

Statistical analysis of counts and spacing of consistent repeating patterns in a set of homologous DNA sequences

IR@NEERI: National Environment Engineering Research Institute

View Archive Info
 
 
Field Value
 
Title Statistical analysis of counts and spacing of consistent repeating patterns in a set of homologous DNA sequences
 
Creator Raje, D V
Purohit, H J
Lijnzaad, P
Singh, R N
 
Subject Environmental Biotechnology
 
Description Unusual patterns in nucleic acid or protein sequences are often suspected for their biological relevance. Repeating patterns of nucleotides are one such type and are typically searched in large genome sequences. In this exercise, our interest is to look for repeating patterns, which are conserved in a set of homologous DNA sequences, not only in terms of their counts/occurrences,but also their spacing/seperating distances. We refer to such patterns as consistent repeating patterns. It becomes desirable to know the probability of multiple occurence of pattern in sequences and whether the spacing due to occurrences of pattern in the sequence exhibits any statistically significant property. The information derived through statistical analysis may help in planning experiments or even raise new queries that may require attention to better understand the molecular mechanisms. A case study with four hundred 16S rDNA sequences resulted into nine most consistent repeaing patterns. The statistical significance of counts of these patterns was studied using Poisson approximation. The spacing analysis of patterns was carried with recourse to uniform probability distribution. The analysis revealed that most of the patterns show significant clustering, with one pattern occuring thrice and evenly dispersed in a sequence. The significance of occurrence and spacing of repeating patterns raised a few queries explanation, perhaps through experimentation.
 
Date 2006-09-25
 
Type Article
PeerReviewed
 
Format application/pdf
 
Identifier http://neeri.csircentral.net/23/1/Raje1.pdf
Raje, D V and Purohit, H J and Lijnzaad, P and Singh, R N (2006) Statistical analysis of counts and spacing of consistent repeating patterns in a set of homologous DNA sequences. Current Science, 91 (6). pp. 789-795.
 
Relation http://neeri.csircentral.net/23/