Skip to main navigation Skip to search Skip to main content

Selection of views to materialize in a data warehouse

  • Kirusa Inc.

Research output: Contribution to journalArticlepeer-review

197 Scopus citations

Abstract

A data warehouse stores materialized views of data from one or more sources, with the purpose of efficiently implementing decision-support or OLAP queries. One of the most important decisions in designing a data warehouse is the selection of materialized views to be maintained at the warehouse. The goal is to select an appropriate set of views that minimizes total query response time and the cost of maintaining the selected views, given a limited amount of resource, e.g., materialization time, storage space, etc. In this article, we have developed a theoretical framework for the general problem of selection of views in a data warehouse. We present polynomial-time heuristics for a selection of views to optimize total query response time under a disk-space constraint, for some important special cases of the general data warehouse scenario, viz.: 1) an AND view graph, where each query/view has a unique evaluation, e.g., when a multiple-query optimizer can be used to general a global evaluation plan for the queries, and 2) an OR view graph, in which any view can be computed from any one of its related views, e.g., data cubes. We present proofs showing that the algorithms are guaranteed to provide a solution that is fairly close to (within a constant factor ratio of) the optimal solution. We extend our heuristic to the general AND-OR view graphs. Finally, we address in detail the view-selection problem under the maintenance cost constraint and present provably competitive heuristics.

Original languageEnglish
Pages (from-to)24-43
Number of pages20
JournalIEEE Transactions on Knowledge and Data Engineering
Volume17
Issue number1
DOIs
StatePublished - Jan 2005

Keywords

  • Data warehouse
  • Materialization
  • View selection
  • Views

Fingerprint

Dive into the research topics of 'Selection of views to materialize in a data warehouse'. Together they form a unique fingerprint.

Cite this