Intra-File-Level Parallelism on XML Parsing: Framework and Performance Evaluation

XML parsing is one of critical operations influencing the performance of XML-based applications. To accelerate its performance on multi-core processor, this paper proposes a novel Intra-File-Level Parallelism (IFLP) framework. Given an XML document, an IFLP parser can split it into small fragments with a little overhead, and simultaneously parse these fragments with several independent processing elements. Finally, the parsing results of the fragments will be combined together. We emulate an IFLP parser based on the well-known Expat parser. Through some experiments and theoretical analysis, we show that in most cases, the IFLP parser can effectively reduce the parsing latency, especially for large documents (i.e. >100KB), the latency can be decreased close to 1/N (N is the number of processing elements).

By: Zhi Yong Liang; Yang Xu; Ya Bin Dang

Published in: HPCNCS 2008Worthington, OH, ISRST, p.67-74 in 2008

Please obtain a copy of this paper from your local library. IBM cannot distribute this paper externally.

Questions about this service can be mailed to reports@us.ibm.com .