STATS - A Point Access Method for Multidimensional Clusters
File(s)stats_point_access_cropped(1).pdf (368.28 KB)
Accepted version
Author(s)
Evagorou, Giannis
Heinis, Thomas
Type
Conference Paper
Abstract
The ubiquity of high-dimensional data in machine learning and data mining applications makes its efficient indexing and retrieval from main memory crucial. Frequently, these machine learning algorithms need to query specific characteristics of single multidimensional points. For example, given a clustered dataset, the cluster membership (CM) query retrieves the cluster to which an object belongs. To efficiently answer this type of query we have developed STATS, a novel main-memory index which scales to answer CM queries on increasingly big datasets. Current indexing methods are oblivious to the structure of clusters in the data, and we thus, develop STATS around the key insight that exploiting the cluster information when indexing and preserving it in the index will accelerate look up. We show experimentally that STATS outperforms known methods in regards to retrieval time and scales well with dataset size for any number of dimensions.
Editor(s)
Benslimane, D
Damiani, E
Grosky, WI
Hameurlain, A
Sheth, AP
Wagner, RR
Date Issued
2017
Date Acceptance
2017-08-01
Citation
Database and Expert Systems Applications - 28th International Conference, DEXA 2017, Lyon, France, August 28-31, 2017, Proceedings, Part I, 2017, pp.352-361
ISBN
978-3-319-64467-7
ISSN
0302-9743
Publisher
Springer Verlag
Start Page
352
End Page
361
Journal / Book Title
Database and Expert Systems Applications - 28th International Conference, DEXA 2017, Lyon, France, August 28-31, 2017, Proceedings, Part I
Copyright Statement
© 2017 Springer International Publishing AG.
Sponsor
Engineering & Physical Science Research Council (E
European Research Office
Identifier
https://doi.org/10.1007/978-3-319-64468-4
Grant Number
EP/N023242/1
720270
Source
DEXA 2017: Database and Expert Systems Applications
Subjects
08 Information And Computing Sciences
Artificial Intelligence & Image Processing
Publication Status
Published
Start Date
2017-08-28
Finish Date
2017-08-31
Coverage Spatial
Lyon, France
Date Publish Online
2017-08-01