A unified framework for event related information seeking

Author(s)Lu, Kuang
Date Accessioned2021-03-25T12:18:01Z
Date Available2021-03-25T12:18:01Z
Publication Date2020
SWORD Update2021-02-08T17:03:55Z
AbstractEvents of different types and scopes are happening every day and influencing nearly every aspect of people's life. In order to obtain information about the events to understand them, search is usually the tool to use. Therefore, it is crucial to provide adequate support for event related information. News and microblogs are two popular information sources to be searched. In this work, we focus on providing effective search techniques for event related queries. The challenges of event related information seeking on these two sources, as well as the inadequacy of the existing methods in terms of overcoming the challenges, are discussed. Based on this, we propose a unified framework to assist event related search interests on news and microblogs that not only includes novel techniques to address the challenges, but also bridge the retrieval on them to better satisfy the search interests, meaning to leverage background information from news to refine the results on microblogs. ☐ On news retrieval, we argue the importance of background information and that it is essential to provide dedicated background information retrieval support. Through analyzing background information from temporal and semantic perspective, a time filter and an aspect based background retrieval model are proposed. Experiments on the TREC News Track data set illustrate that these two methods can work in concert to provide statistically significant improvements over a competitive baseline. Further analysis demonstrates the usefulness of the individual components of the model, such as aspect identification and aspect language model estimation. ☐ On microblog retrieval, due to the lack of long term search interest support and the unsatisfactory pseudo relevance feedback performance, a novel relevance signal called query collectivity is proposed, which measures relevance by using the collective presence of multiple query terms. For long term search interests, this measure is used to detect the time periods (e.g. days) when there is no relevant information and informs the system to not return any results. However, when the measure is employed for pseudo relevance feedback, expansion terms are selected in a conservative fashion, and the measure is able to only select them for queries when there is a potential of improving the retrieve effectiveness. Experiment results on TREC Microblog data sets show that the measure achieves advantageous performances on both tasks. ☐ Besides concentrating on enhancing the retrieval for different genres individually for event related searches, we also propose to bridge the two types of retrieval by leveraging the background aspects from news to re-rank the initial retrieval results on microblogs on a per aspect basis to provide an option of exploring microblogs in finer granularity. We carefully design an experiment and its results suggest that the news aspects can represent meaningful event related search interests and, with the help of language models of the aspects that can be obtained from the background news retrieval, initial microblog results can be effectively re-ranked for the aspects.en_US
AdvisorFang, Hui
DegreePh.D.
DepartmentUniversity of Delaware, Department of Electrical and Computer Engineering
DOIhttps://doi.org/10.58088/tnvb-bb15
Unique Identifier1243269035
URLhttps://udspace.udel.edu/handle/19716/28860
Languageen
PublisherUniversity of Delawareen_US
URIhttps://login.udel.idm.oclc.org/login?url=https://www.proquest.com/dissertations-theses/unified-framework-event-related-information/docview/2492271022/se-2?accountid=10457
KeywordsUnified frameworken_US
KeywordsEvent relateden_US
KeywordsInformation seekingen_US
TitleA unified framework for event related information seekingen_US
TypeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Lu_udel_0060D_14402.pdf
Size:
2.16 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.22 KB
Format:
Item-specific license agreed upon to submission
Description: