The Discrete Wavelet Transform (DWT) at the heart of the JPEG 2000 image compression system operates on user-definable tiles of the image, as opposed to fixed-size blocks of the image as does the Discrete Cosine Transform (DCT) used in JPEG. This difference reduces artificial blocking effects but can severely stress the memory system. We examine the interaction of the DWT and the memory hierarchy, modify the structure of the DWT computation and the layout of the image data to improve cache and TLB locality, and demonstrate significant performance improvements of the DWT over a baseline implementation. Our optimized DWT implementation exhibits speedups of up to 4X over the DWT in a JPEG 2000 reference implementation. Rearrange sub-band coefficients
By: Siddhartha Chatterjee, Christopher D. Brooks
Published in: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), Piscataway, NJ, IEEE. , vol.1, p.797-800 in 2002
Please obtain a copy of this paper from your local library. IBM cannot distribute this paper externally.
Questions about this service can be mailed to reports@us.ibm.com .