Partial Materialized Views

Early access to partial query results is highly desirable during exploration of massive data sets. However, it is challenging to provide transactionally consistent, immediate partial results without significantly increasing queries’ run-to-completion time. To address this problem, this paper proposes a partial materialized view method to cache some of the most frequently accessed results rather than all the possible results. Compared to traditional materialized views, the proposed partial materialized views do not require maintenance during insertion into base relations, and have much smaller storage and maintenance overhead. Upon the arrival of a query, the RDBMS first searches the partial materialized view and returns to the user the cached partial results. Since a large portion of the partial materialized view is cached in memory, this usually finishes within a millisecond. Then the RDBMS continues to execute the query to find the remaining results. The proposed techniques can also be extended to rank query result tuples according to their popularity, which addresses the information overflow problem. The efficiency of our partial materialized view method is evaluated through a simulation study, a theoretical analysis, and an initial implementation in PostgreSQL.

By: Gang Luo

Published in: RC24089 in 2006

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

rc24089.pdf

Questions about this service can be mailed to reports@us.ibm.com .