Indexing Continual Range Queries for Efficient Stream Processing

A large number of continual range queries could be issued against numerical data streams, such as stock prices, sensor readings, temperatures, and others. To efficiently process these long-running queries, only the potentially relevant queries should be evaluated against the data. We develop a virtual construct-based query indexing approach to efficiently identifying the range queries that match each data object in the streams. A set of virtual constructs, e.g., intervals in 1D space or rectangular regions in 2D space, are predefined such that it is efficient to find all the virtual constructs containing any given data object. Each virtual construct has a unique ID and an associated query ID list. The query index is built as follows. Each range query is first decomposed into one or more virtual constructs. The query ID is then inserted into the query ID lists associated with those decomposed virtual constructs. Search becomes extremely efficient. For a given data object, we first find all virtual constructs covering it. Then, we report the matched queries from the ID lists associated with the covering virtual constructs.

By: Kun-Lung Wu, Shyh-Kwei Chen, Philip S. Yu

Published in: RC23249 in 2004


This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.


Questions about this service can be mailed to .