BioC-compatible full-text passage detection for protein-protein interactions using extended dependency graph
Author(s) | Peng,Yifan | |
Author(s) | Arighi,Cecilia | |
Author(s) | Wu,Cathy H. | |
Author(s) | Vijay-Shanker,K. | |
Ordered Author | Yifan Peng, Cecilia Arighi, Cathy H. Wu and K. Vijay-Shanker | |
UD Author | Arighi, Cecilia Noemi; Wu, Cathy Huey-Hwa; Shanker, Vijay K | |
Date Accessioned | 2017-07-13T20:01:49Z | |
Date Available | 2017-07-13T20:01:49Z | |
Copyright Date | The Author(s) 2016 | |
Publication Date | 4/12/16 | |
Description | Publisher's PDF | |
Abstract | There has been a large growth in the number of biomedical publications that report experimental results. Many of these results concern detection of protein-protein interactions (PPI). In BioCreative V, we participated in the BioC task and Developmenteloped a PPI system to detect text passages with PPIs in the full-text articles. By adopting the BioC format, the output of the system can be seamlessly added to the biocuration pipeline with little effort required for the system integration. A distinctive feature of our PPI system is that it utilizes extended dependency graph, an intermediate level of representation that attempts to abstract away syntactic variations in text. As a result, we are able to use only a limited set of rules to extract PPI pairs in the sentences, and additional rules to detect additional passages for PPI pairs. For evaluation, we used the 95 articles that were provided for the BioC annotation task. We retrieved the unique PPIs from the BioGRID database for these articles and show that our system achieves a recall of 83.5%. In order to evaluate the detection of passages with PPIs, we further annotated Abstract and Results sections of 20 documents from the dataset and show that an f-value of 80.5% was obtained. To evaluate the generalizability of the system, we also conducted experiments on AIMed, a well-known PPI corpus. We achieved an f-value of 76.1% for sentence detection and an f-value of 64.7% for unique PPI detection. | |
Department | University of Delaware, Computer and Information Sciences University of Delaware, Center for Bioinformatics and Computational Biology | |
Citation | Peng, Y., Arighi, C., Wu, C. H., & Vijay-Shanker, K. (2016). BioC-compatible full-text passage detection for protein-protein interactions using extended dependency graph. Database-the Journal of Biological Databases and Curation, , baw072. doi:10.1093/database/baw072 | |
DOI | 10.1093/database/baw072 | |
ISSN | 1758-0463 | |
URL | http://udspace.udel.edu/handle/19716/21540 | |
Language | English | |
Publisher | Oxford University Press | |
dc.rights | CC BY 4.0 | |
dc.source | Database-the Journal of Biological Databases and Curation | |
dc.source.uri | https://academic.oup.com/database/article-lookup/doi/10.1093/database/baw072 | |
Title | BioC-compatible full-text passage detection for protein-protein interactions using extended dependency graph | |
Type | Article |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- BioC-compatible full-text passage detection for protein-protein interactions using extended dependency graph.pdf
- Size:
- 438.57 KB
- Format:
- Adobe Portable Document Format
- Description:
- Main article