TY - JOUR
T1 - Cavity scaling
T2 - Automated refinement of cavity-aware in protein function prediction
AU - Chen, Brian Y.
AU - Bryant, Drew H.
AU - Fofanov, Viacheslav Y.
AU - Kristensen, David M.
AU - Cruess, Amanda E.
AU - Kimmel, Marek
AU - Lichtarge, Olivier
AU - Kavraki, Lydia E.
PY - 2007/4
Y1 - 2007/4
N2 - Algorithms for geometric and chemical comparison of protein substructure can be useful for many applications in protein function prediction. These motif matching algorithms identify matches of geometric and chemical similarity between well-studied functional sites, motifs, and substructures of functionally uncharacterized proteins, targets. For the purpose of function prediction, the accuracy of motif matching algorithms can be evaluated with the number of statistically significant matches to functionally related proteins, true positives (TPs), and the number of statistically insignificant matches to functionally unrelated proteins, false positives (FPs). Our earlier work developed cavity-aware motifs which use motif points to represent functionally significant atoms and C-spheres to represent functionally significant volumes. We observed that cavity-aware motifs match significantly fewer FPs than matches containing only motif points. We also observed that high-impact C-spheres, which significantly contribute to the reduction of FPs, can be isolated automatically with a technique we call Cavity Scaling. This paper extends our earlier work by demonstrating that C-spheres can be used to accelerate point-based geometric and chemical comparison algorithms, maintaining accuracy while reducing runtime. We also demonstrate that the placement of C-spheres can significantly affect the number of TPs and FPs identified by a cavity-aware motif. While the optimal placement of C-spheres remains a diffcult open problem, we compared two logical placement strategies to better understand C-sphere placement.
AB - Algorithms for geometric and chemical comparison of protein substructure can be useful for many applications in protein function prediction. These motif matching algorithms identify matches of geometric and chemical similarity between well-studied functional sites, motifs, and substructures of functionally uncharacterized proteins, targets. For the purpose of function prediction, the accuracy of motif matching algorithms can be evaluated with the number of statistically significant matches to functionally related proteins, true positives (TPs), and the number of statistically insignificant matches to functionally unrelated proteins, false positives (FPs). Our earlier work developed cavity-aware motifs which use motif points to represent functionally significant atoms and C-spheres to represent functionally significant volumes. We observed that cavity-aware motifs match significantly fewer FPs than matches containing only motif points. We also observed that high-impact C-spheres, which significantly contribute to the reduction of FPs, can be isolated automatically with a technique we call Cavity Scaling. This paper extends our earlier work by demonstrating that C-spheres can be used to accelerate point-based geometric and chemical comparison algorithms, maintaining accuracy while reducing runtime. We also demonstrate that the placement of C-spheres can significantly affect the number of TPs and FPs identified by a cavity-aware motif. While the optimal placement of C-spheres remains a diffcult open problem, we compared two logical placement strategies to better understand C-sphere placement.
KW - Protein function
KW - Protein structure
KW - Refinement of protein motifs
UR - http://www.scopus.com/inward/record.url?scp=34347406147&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34347406147&partnerID=8YFLogxK
U2 - 10.1142/s021972000700276x
DO - 10.1142/s021972000700276x
M3 - Article
C2 - 17589966
AN - SCOPUS:34347406147
SN - 0219-7200
VL - 5
SP - 353
EP - 382
JO - Journal of Bioinformatics and Computational Biology
JF - Journal of Bioinformatics and Computational Biology
IS - 2 A
ER -