TY - GEN
T1 - Algorithms for structural comparison and statistical analysis of 3D protein motifs
AU - Chen, Brian Y.
AU - Fofanov, Viacheslav Y.
AU - Kristensen, David M.
AU - Kimmel, Marek
AU - Lichtarge, Olivier
AU - Kavraki, Lydia E.
PY - 2005
Y1 - 2005
N2 - The comparison of structural subsites in proteins is increasingly relevant to the prediction of their biological function. To address this problem, we present the Match Augmentation algorithm (MA). Given a structural motif of interest, such as a functional site, MA searches a target protein structure for a match: the set of atoms with the greatest geometric and chemical similarity. MA is extremely efficient because it exploits the fact that the amino acids in a structural motif are not equally important to function. Using motif residues ranked on functional significance via the Evolutionary Trace (ET), MA prioritizes its search by initially forming matches with functionally significant residues, then, guided by ET, it augments this partial match stepwise until the whole motif is found. With this hierarchical strategy, MA runs considerably faster than other methods, and almost always identifies matches in homologs known to have cognate functional sites. Second, in order to interpret matches, we further introduce a statistical method using nonparametric density estimation of the frequency distribution of structural matches. Our results show that the hierarchy of functional importance within structural motifs speeds up the search within targets, and points to a new method to score their statistical significance.
AB - The comparison of structural subsites in proteins is increasingly relevant to the prediction of their biological function. To address this problem, we present the Match Augmentation algorithm (MA). Given a structural motif of interest, such as a functional site, MA searches a target protein structure for a match: the set of atoms with the greatest geometric and chemical similarity. MA is extremely efficient because it exploits the fact that the amino acids in a structural motif are not equally important to function. Using motif residues ranked on functional significance via the Evolutionary Trace (ET), MA prioritizes its search by initially forming matches with functionally significant residues, then, guided by ET, it augments this partial match stepwise until the whole motif is found. With this hierarchical strategy, MA runs considerably faster than other methods, and almost always identifies matches in homologs known to have cognate functional sites. Second, in order to interpret matches, we further introduce a statistical method using nonparametric density estimation of the frequency distribution of structural matches. Our results show that the hierarchy of functional importance within structural motifs speeds up the search within targets, and points to a new method to score their statistical significance.
UR - http://www.scopus.com/inward/record.url?scp=15944389002&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=15944389002&partnerID=8YFLogxK
M3 - Conference contribution
C2 - 15759639
AN - SCOPUS:15944389002
SN - 9812560467
SN - 9789812560469
T3 - Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005
SP - 334
EP - 345
BT - Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005
T2 - 10th Pacific Symposium on Biocomputing, PSB 2005
Y2 - 4 January 2005 through 8 January 2005
ER -