Skip to main navigation Skip to search Skip to main content

A Scheduling Approach to Incremental Maintenance of Datalog Programs

  • Shikha Singh
  • , Sergey Madaminov
  • , Michael A. Bender
  • , Michael Ferdman
  • , Ryan Johnson
  • , Benjamin Moseley
  • , Hung Ngo
  • , Dung Nguyen
  • , Soeren Olesen
  • , Kurt Stirewalt
  • , Geoffrey Washburn
  • Williams College
  • Stony Brook University
  • Amazon.com, Inc.
  • Carnegie Mellon University
  • Relational AI
  • Infor Inc.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

In this paper, we study the problem of incremental maintenance of Datalog programs and model it as a scheduling problem on DAGs. We design provably good time-and memory-efficient scheduling algorithms for (re)executing a Datalog program where some (but not necessarily all) of the inputs have changed. We prove that our schedulers, called LevelBased and LevelBased with lookahead, have asymptotically improved running time and space efficiency when compared with benchmark algorithms used in production at LogicBlox.The main result of the paper is a hybrid scheduler, which combines LevelBased with the production LogicBlox scheduler (or any other heuristic scheduler). The hybrid scheduler achieves strong worst-case guarantees and robustness without losing out on the best-case behavior of the production LogicBlox scheduler. Our experiments show that the hybrid scheduler results in similar or improved total execution times compared to LogicBlox scheduler, while consistently reducing the scheduling overhead-by as much as 50% on some datasets. This hybrid scheme requires little to no overhead but provides predictability and reliability, which are crucial in a commercial application such as LogicBlox.

Original languageEnglish
Title of host publicationProceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium, IPDPS 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages864-873
Number of pages10
ISBN (Electronic)9781728168760
DOIs
StatePublished - May 2020
Event34th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2020 - New Orleans, United States
Duration: May 18 2020May 22 2020

Publication series

NameProceedings - 2020 IEEE 34th International Parallel and Distributed Processing Symposium, IPDPS 2020

Conference

Conference34th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2020
Country/TerritoryUnited States
CityNew Orleans
Period05/18/2005/22/20

Keywords

  • DAG scheduling
  • databases
  • Datalog programs
  • incremental computing
  • incremental maintenance
  • LogicBlox
  • parallel task scheduling

Fingerprint

Dive into the research topics of 'A Scheduling Approach to Incremental Maintenance of Datalog Programs'. Together they form a unique fingerprint.

Cite this