TY - GEN
T1 - Intrinsic evaluation of text mining tools may not predict performance on realistic tasks
AU - Caporaso, J. Gregory
AU - Deshpande, Nita
AU - Fink, J. Lynn
AU - Bourne, Philip E.
AU - Cohen, K. Bretonnel
AU - Hunter, Lawrence
PY - 2008
Y1 - 2008
N2 - Biomedical text mining and other automated techniques are beginning to achieve performance which suggests that they could be applied to aid database curators. However, few studies have evaluated how these systems might work in practice. In this article we focus on the problem of annotating mutations in Protein Data Bank (PDB) entries, and evaluate the relationship between performance of two automated techniques, a text-mining-based approach (MutationFinder) and an alignment-based approach, in intrinsic versus extrinsic evaluations. We find that high performance on gold standard data (an intrinsic evaluation) does not necessarily translate to high performance for database annotation (an extrinsic evaluation). We show that this is in part a result of lack of access to the full text of journal articles, which appears to be critical for comprehensive database annotation by text mining. Additionally, we evaluate the accuracy and completeness of manually annotated mutation data in the PDB, and find that it is far from perfect. We conclude that currently the most cost-effective and reliable approach for database annotation might incorporate manual and automatic annotation methods.
AB - Biomedical text mining and other automated techniques are beginning to achieve performance which suggests that they could be applied to aid database curators. However, few studies have evaluated how these systems might work in practice. In this article we focus on the problem of annotating mutations in Protein Data Bank (PDB) entries, and evaluate the relationship between performance of two automated techniques, a text-mining-based approach (MutationFinder) and an alignment-based approach, in intrinsic versus extrinsic evaluations. We find that high performance on gold standard data (an intrinsic evaluation) does not necessarily translate to high performance for database annotation (an extrinsic evaluation). We show that this is in part a result of lack of access to the full text of journal articles, which appears to be critical for comprehensive database annotation by text mining. Additionally, we evaluate the accuracy and completeness of manually annotated mutation data in the PDB, and find that it is far from perfect. We conclude that currently the most cost-effective and reliable approach for database annotation might incorporate manual and automatic annotation methods.
UR - http://www.scopus.com/inward/record.url?scp=40549141170&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=40549141170&partnerID=8YFLogxK
M3 - Conference contribution
C2 - 18229722
AN - SCOPUS:40549141170
SN - 9812776087
SN - 9789812776082
T3 - Pacific Symposium on Biocomputing 2008, PSB 2008
SP - 640
EP - 651
BT - Pacific Symposium on Biocomputing 2008, PSB 2008
T2 - 13th Pacific Symposium on Biocomputing, PSB 2008
Y2 - 4 January 2008 through 8 January 2008
ER -