Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Electrical and Electronic Engineering
  4. Electrical and Electronic Engineering PhD theses
  5. Associative transducers for the parallel processing of streaming data
 
  • Details
Associative transducers for the parallel processing of streaming data
File(s)
Ogden-P-2017-PhD-Thesis.pdf (1.73 MB)
Thesis
Author(s)
Ogden, Peter
Type
Thesis or dissertation
Abstract
Crowd-sourcing and the rise of the internet of things are causing a massive increase in the rate of streaming data that needs to be processed. At the same time CPU clock-speeds are stagnating so parallel algorithms are needed to process the high rate of data with low query response times. Automata and transducers are natural models for querying unbounded streams but are inherently sequential, processing each item of data in the stream in order. To continue to use automata and transducers for stream processing on increasing data rates, a new approach is needed which offers scaling to large numbers of cores.

This thesis introduces a new computational model of associative transducers which transforms the execution of a transducer into an associative operator, associativity which is then used to provide highly scalable data-parallelism. Associative transducers are backed by formal model that provides the theoretical basis for executing transducers in an associative manner and three applications demonstrate their use for processing textual, XML and geospatial data. Each use for associative transducers is individually evaluated against comparable systems, showing almost universal scaling to 64~cores and performance comparable to large MapReduce clusters or commercial database engines but with no need to load the data prior to querying. For geospatial queries in particular, a system based on associative transducers performs some queries three times faster than a comparable system running on a MapReduce cluster while using a third the number of cores.
Version
Open Access
Date Issued
2017-01
Date Awarded
2017-08
URI
http://hdl.handle.net/10044/1/50190
DOI
https://doi.org/10.25560/50190
Advisor
Thomas, David
Sponsor
BAE SYSTEMS (Firm) ; Engineering and Physical Sciences Research Council
Publisher Department
Electrical and Electronic Engineering
Publisher Institution
Imperial College London
Qualification Level
Doctoral
Qualification Name
Doctor of Philosophy (PhD)
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback