Computational Molecular Biology Lecture - Protein Sequence Database Searches

4pm Wednesday, April 9, 2008
Swig Boardroom
115 Waterman Street, room 241
Providence, RI 02912
Google Map

"Protein Sequence Database Searches Using Compositionally Adjusted Amino Acid Substitution Matrices"

Stephen Altschul, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health

...To what extent are such adjusted matrices of utility for general purpose protein database searches? Using standard test platforms, we compared a standard matrix to compositionally-adjusted matrices, with relative entropy left unconstrained, or constrained in various ways. We found that constraining the relative entropy of the compositionally adjusted matrix to a fixed value in the new compositional context generally produced the best results. We also found that if the sequences compared are not known to have strong compositional biases, then it is still on average advantageous to use an adjusted matrix when the sequences satisfy certain simple length or compositional inequalities. Applying these findings to general-purpose database searches can lead to a significant improvement in retrieval performance, with a minimal increase in execution time.

Related Items (2)

Companies / Organizations (2)

Is this your event?
Please help us to keep the information accurate and up-to-date.
Email us your edits, additions, and deletions. Thank you.