Sciweavers

XSYM
2009
Springer

A Data Parallel Algorithm for XML DOM Parsing

13 years 11 months ago
A Data Parallel Algorithm for XML DOM Parsing
Abstract. The extensible markup language XML has become the de facto standard for information representation and interchange on the Internet. XML parsing is a core operation performed on an XML document for it to be accessed and manipulated. This operation is known to cause performance bottlenecks in applications and systems that process large volumes of XML data. We believe that parallelism is a natural way to boost performance. Leveraging multicore processors can offer a cost-effective solution, because future multicore processors will support hundreds of cores, and will offer a high degree of parallelism in hardware. We propose a data parallel algorithm called ParDOM for XML DOM parsing, that builds an in-memory tree structure for an XML document. ParDOM has two phases. In the first phase, an XML document is partitioned into chunks and parsed in parallel. In the second phase, partial DOM node tree structures created during the first phase, are linked together (in parallel) to buil...
Bhavik Shah, Praveen Rao, Bongki Moon, Mohan Rajag
Added 25 May 2010
Updated 25 May 2010
Type Conference
Year 2009
Where XSYM
Authors Bhavik Shah, Praveen Rao, Bongki Moon, Mohan Rajagopalan
Comments (0)