If you are seeing this message, you may be experiencing temporary network problems. Please wait a few minutes and refresh the page. If the problem persists, you may wish to report it to your local Network Manager.
It is also possible that your web browser is not configured or not able to display style sheets. In this case, although the visual presentation will be degraded, the site should continue to be functional. We recommend using the latest version of Microsoft or Mozilla web browser to help minimise these problems.
Wiley InterScience | ||||||||||||||||
![]() FEBS JournalVolume 272 Issue 20, Pages 5101 - 5109 Published Online: 7 Oct 2005 Journal compilation © 2010 Federation of European Biochemical Societies Published on behalf of the Federation of European Biochemical Societies
Abstract | References | Full Text: HTML, PDF (Size: 151K) | Related Articles | Citation Tracking MINIREVIEW Protein database searches using compositionally adjusted substitution matrices Copyright 2005 FEBS KEYWORDS
blast
• BLOSUM • compositional adjustment • protein database searches • substitution matrices ABSTRACTAlmost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long-standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions. Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein–protein version of blast. (Received 25 May 2005, accepted 4 August 2005) |
|
|
|