Utilizing image information for biomedical document classification

Author(s)Li, Pengyuan
Date Accessioned2022-03-22T11:54:42Z
Date Available2022-03-22T11:54:42Z
Publication Date2021
SWORD Update2022-01-21T17:03:30Z
AbstractBiomedical research findings are typically disseminated through publications. To simplify access to domain specific knowledge while supporting the research community, several biomedical databases devote significant effort to manual curation of the literature. The first step in the biocuration process is to identify articles relevant to the specific area on which the database is focused within a large volume of publications -- which is a labor intensive and slow process. Thus, automatically identifying publications that are relevant to a specific topic is one of the fundamental tasks toward expediting the biocuration process and, in turn, biomedical research. ☐ Current methods for categorization of biomedical documents focus on textual contents, typically extracted from the title and the abstract. Notably, images and captions are often used in publications to convey pivotal information about research processes, experiments and results. In this thesis, we explore means for utilizing and integrating image information into biomedical document classification. To do that, we first develop a new and effective system for extracting figures and their captions from biomedical publications. The vast majority of extracted figures are compound images consisting of multiple panels, where each individual panel potentially conveys a different type of information. In order to use the image information from each individual panel, we propose an efficient and effective method to separate those compound images into their constituent panels. Last, we introduce a new biomedical document classification scheme that uses information derived from images, captions, in addition to titles-and-abstracts.en_US
AdvisorShatkay, Hagit
DegreePh.D.
DepartmentUniversity of Delaware, Department of Electrical and Computer Engineering
DOIhttps://doi.org/10.58088/jeww-df02
Unique Identifier1304832406
URLhttps://udspace.udel.edu/handle/19716/30673
Languageen
PublisherUniversity of Delawareen_US
URIhttps://login.udel.idm.oclc.org/login?url=https://www.proquest.com/dissertations-theses/utilizing-image-information-biomedical-document/docview/2629299157/se-2?accountid=10457
KeywordsBiomedical document analysisen_US
KeywordsBiocuration processen_US
KeywordsImage informationen_US
TitleUtilizing image information for biomedical document classificationen_US
TypeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Li_udel_0060D_14801.pdf
Size:
12.28 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.22 KB
Format:
Item-specific license agreed upon to submission
Description: