Graph-based data clustering via multiscale community detection
File(s)s41109-019-0248-7.pdf (1.54 MB)
Published version
Author(s)
Liu, Zijing
Barahona, Mauricio
Type
Journal Article
Abstract
We present a graph-theoretical approach to data clustering, which combines the creation of a graph from the data with Markov Stability, a multiscale community detection framework. We show how the multiscale capabilities of the method allow the estimation of the number of clusters, as well as alleviating the sensitivity to the parameters in graph construction. We use both synthetic and benchmark real datasets to compare and evaluate several graph construction methods and clustering algorithms, and show that multiscale graph-based clustering achieves improved performance compared to popular clustering methods without the need to set externally the number of clusters.
Date Issued
2020-01-08
Date Acceptance
2019-12-16
Citation
Applied Network Science, 2020, 5 (3), pp.1-20
ISSN
2364-8228
Publisher
SpringerOpen
Start Page
1
End Page
20
Journal / Book Title
Applied Network Science
Volume
5
Issue
3
Copyright Statement
© The Author(s). 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0
International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and
reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the
Creative Commons license, and indicate if changes were made.
International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and
reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the
Creative Commons license, and indicate if changes were made.
Sponsor
Engineering & Physical Science Research Council (EPSRC)
Identifier
http://arxiv.org/abs/1909.04491v1
Grant Number
EP/N014529/1
Subjects
cs.IR
cs.IR
cs.LG
physics.data-an
stat.ML
Notes
16 pages, 5 figures
Publication Status
Published
Date Publish Online
2020-01-08