24
IRUS Total
Downloads
  Altmetric

Scalable XML query processing using parallel pushdown transducers

File Description SizeFormat 
VLDB_2013.pdfAccepted version565.78 kBAdobe PDFView/Open
Title: Scalable XML query processing using parallel pushdown transducers
Authors: Ogden, P
Thomas, DBJ
Pietzuch, PR
Item Type: Journal Article
Abstract: In online social networking, network monitoring and financial applications, there is a need to query high rate streams of XML data, but methods for executing individual XPath queries on streaming XML data have not kept pace with multicore CPUs. For data-parallel processing, a single XML stream is typically split into well-formed fragments, which are then processed independently. Such an approach, however, introduces a sequential bottleneck and suffers from low cache locality, limiting its scalability across CPU cores. We describe a data-parallel approach for the processing of streaming XPath queries based on pushdown transducers. Our approach permits XML data to be split into arbitrarilysized chunks, with each chunk processed by a parallel automaton instance. Since chunks may be malformed, our automata consider all possible starting states for XML elements and build mappings from starting to finishing states. These mappings can be constructed independently for each chunk by different CPU cores. For streaming queries from the XPathMark benchmark, we show a processing throughput of 2.5 GB/s, with near linear scaling up to 64 CPU cores. © 2013 VLDB Endowment.
Issue Date: 1-Sep-2013
Date of Acceptance: 1-Sep-2013
URI: http://hdl.handle.net/10044/1/24043
DOI: http://doi.org/10.14778/2556549.2556558
ISSN: 2150-8097
Publisher: Very Large Database Endowment
Start Page: 1738
End Page: 1749
Journal / Book Title: Proceedings of the VLDB Endowment
Volume: 6
Issue: 14
Copyright Statement: © 2013 VLDB Endowment, 2150-8097/13/14.
Appears in Collections:Faculty of Engineering
Computing