Article ID Journal Published Year Pages File Type
1960047 Biophysical Journal 2005 10 Pages PDF
Abstract

We are interested in applying the principles of information theory to structural biology calculations. In this article, we explore the information content of an important computational procedure: sequence alignment. Using a reference state developed from exhaustive sequences, we measure alignment statistics and evaluate gap penalties based on first-principle considerations and gap distributions. We show that there are different gap penalties for different alphabet sizes and that the gap penalties can depend on the length of the sequences being aligned. In a companion article, we examine the information content of molecular force fields.

Related Topics
Life Sciences Biochemistry, Genetics and Molecular Biology Biochemistry
Authors
, ,