SpringerOpen Newsletter

Receive periodic news and updates relating to SpringerOpen.

Open Access Methodology

Clustering datasets by complex networks analysis

Giuliano Armano and Marco Alberto Javarone*

Author Affiliations

DIEE ‐ Department of Electrical and Electronic Engineering, University of Cagliari Piazza d’Armi, Cagliari ‐ 09123, Italy

For all author emails, please log on.

Complex Adaptive Systems Modeling 2013, 1:5  doi:10.1186/2194-3206-1-5

Published: 13 March 2013

Abstract

This paper proposes a method based on complex networks analysis, devised to perform clustering on multidimensional datasets. In particular, the method maps the elements of the dataset in hand to a weighted network according to the similarity that holds among data. Network weights are computed by transforming the Euclidean distances measured between data according to a Gaussian model. Notably, this model depends on a parameter that controls the shape of the actual functions. Running the Gaussian transformation with different values of the parameter allows to perform multiresolution analysis, which gives important information about the number of clusters expected to be optimal or suboptimal.

Solutions obtained running the proposed method on simple synthetic datasets allowed to identify a recurrent pattern, which has been found in more complex, synthetic and real, datasets.

Keywords:
Clustering; Community detection; Complex networks; Multiresolution analysis