Extraction of knowledge for micrornas and genes: extracting connections through association, involvement, and regulation

dc.contributor.authorGupta, Samir
dc.date.accessioned2021-11-17T18:23:43Z
dc.date.available2021-11-17T18:23:43Z
dc.date.issued2021
dc.date.updated2021-08-09T22:12:15Z
dc.description.abstractBiological entities such as genes, proteins, and microRNAs are critical players in various biological processes and diseases. The role of these entities on biological processes and diseases forms a significant part of biomedical knowledge bases. However, a large portion of this information is buried in scientific literature as unstructured text. This work is motivated by our belief that the development of relation extraction systems that capture the roles of such entities on different processes and diseases from literature is important and much needed. We hypothesize that connections between biological entities and concepts as stated in text can be captured by the extraction of a small number of relations, which we call CAIR relations: Connections through Association, Involvement, and Regulation. ☐ We have developed a general relation extraction framework that reduces the effort required for developing individual relation extraction (RE) systems. This framework is based on a structured representation called Extended Dependency Graph (EDG), which utilizes syntactic dependencies and information beyond syntax to capture thematic dependencies. Based on this framework, we developed a general CAIR relation extraction system to connect a bio-entity to associated concepts. To demonstrate the wide applicability of CAIR relations and the framework, we have developed several RE systems and text-mining applications including miRiaD, a tool to extract the role of microRNAs in diseases, and Phos2X, a tool to extract the functional impact of protein phosphorylation. Additionally, as a continuation of miRiaD development, we also developed DEXTER, a tool to extract microRNA’s differential expression level information in diseases, which covers a different aspect of microRNA-disease associations. Such differential expression statements are stated through comparative sentences, comparing expression levels in two different samples. Thus, we have developed a general system to identify comparison sentences and extract the various components (compared aspect, compared entities/scenarios, and scale of the comparison). Additionally, we extended DEXTER to also extract gene expression information. All the tools we have developed have been evaluated by comparing with human annotations and show high precision and recall.en_US
dc.description.advisorShanker, Vijay K.
dc.description.degreePh.D.
dc.description.departmentUniversity of Delaware, Department of Computer and Information Sciences
dc.identifier.doihttps://doi.org/10.58088/xn3m-v820
dc.identifier.unique1285525614
dc.identifier.urihttps://udspace.udel.edu/handle/19716/29411
dc.language.rfc3066en
dc.publisherUniversity of Delawareen_US
dc.relation.urihttps://login.udel.idm.oclc.org/login?url=https://www.proquest.com/dissertations-theses/extraction-knowledge-micrornas-genes-extracting/docview/2572616171/se-2
dc.subjectBioNLP
dc.subjectmicroRNA
dc.subjectNatural Language Processing
dc.subjectRelation Extraction
dc.subjectText mining
dc.titleExtraction of knowledge for micrornas and genes: extracting connections through association, involvement, and regulationen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Gupta_udel_0060D_14543.pdf
Size:
1.65 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.22 KB
Format:
Item-specific license agreed upon to submission
Description: