Skip to main navigation Skip to search Skip to main content

Beyond ranked lists in web search: Aggregating web content into topic pages

  • Microsoft USA

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

We investigate the automatic generation of topic pages as an alternative to the current Web search paradigm. Topic pages explicitly aggregate information across documents, filter redundancy, and promote diversity of topical aspects. We propose a novel framework for building rich topical aspect models and selecting diverse information from the Web. In particular, we use Web search logs to build aspect models with various degrees of specificity, and then employ these aspect models as input to a sentence selection method that identifies relevant and non-redundant sentences from the Web. Automatic and manual evaluations on biographical topics show that topic pages built by our system compare favorably to regular Web search results and to MDS-style summaries of the Web results on all metrics employed.

Original languageEnglish
Pages (from-to)509-534
Number of pages26
JournalInternational Journal of Semantic Computing
Volume4
Issue number4
DOIs
StatePublished - Dec 1 2010

Keywords

  • aspect model
  • query log
  • topic page
  • Web search

Fingerprint

Dive into the research topics of 'Beyond ranked lists in web search: Aggregating web content into topic pages'. Together they form a unique fingerprint.

Cite this