TY - GEN
T1 - Syntactic query models for restatement retrieval
AU - Balasubramanian, Niranjan
AU - Allan, James
PY - 2009
Y1 - 2009
N2 - We consider the problem of retrieving sentence level restatements. Formally, we define restatements as sentences that contain all or some subset of information present in a query sentence. Identifying restatements is useful for several applications such as multi-document summarization, document provenance, text reuse and novelty detection. Spurious partial matches and term dependence become important issues for restatement retrieval in these settings. To address these issues, we focus on query models that capture relative term importance and sequential term dependence. In this paper, we build query models using syntactic information such as subject-verb-objects and phrases. Our experimental results on two different collections show that syntactic query models are consistently more effective than purely statistical alternatives.
AB - We consider the problem of retrieving sentence level restatements. Formally, we define restatements as sentences that contain all or some subset of information present in a query sentence. Identifying restatements is useful for several applications such as multi-document summarization, document provenance, text reuse and novelty detection. Spurious partial matches and term dependence become important issues for restatement retrieval in these settings. To address these issues, we focus on query models that capture relative term importance and sequential term dependence. In this paper, we build query models using syntactic information such as subject-verb-objects and phrases. Our experimental results on two different collections show that syntactic query models are consistently more effective than purely statistical alternatives.
UR - https://www.scopus.com/pages/publications/70350630415
U2 - 10.1007/978-3-642-03784-9_14
DO - 10.1007/978-3-642-03784-9_14
M3 - Conference contribution
AN - SCOPUS:70350630415
SN - 3642037836
SN - 9783642037832
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 143
EP - 155
BT - String Processing and Information Retrieval - 16th International Symposium, SPIRE 2009, Proceedings
T2 - 16th International Symposium on String Processing and Information Retrieval, SPIRE 2009
Y2 - 25 August 2009 through 27 August 2009
ER -