Data Partitioning—Empirical Approach

Computing and Communications

Associated organisational units

Text available via DOI:

https://doi.org/10.1007/978-3-030-02384-3_7
Final published version

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Chapter (peer-reviewed) › peer-review

Published

More...

Publication date	2019
Host publication	Empirical Approach to Machine Learning
Editors	Plamen Angelov, Xiaowei Gu
Publisher	Springer-Verlag
Pages	175-198
Number of pages	24
Volume	800
ISBN (print)	9783030023836
<mark>Original language</mark>	English

Publication series

Name	Studies in Computational Intelligence
Publisher	Springer
Volume	800
ISSN (Print)	1860-949X

Abstract

In this chapter, a new empirical approach, named autonomous data partitioning, is proposed to partition the data autonomously by creating a Voronoi tessellation around the objectively identified prototypes to form data clouds, which transform the large amount of raw data into a much smaller (manageable) number of more representative aggregations with semantic meaning. The proposed empirical algorithm has two forms/types, namely, the offline version and the evolving version. The offline version is based on the ranks of the observations in terms of their multimodal typicality values and local ensemble properties. The evolving version is for streaming data processing and works with the data density. It is able to start “from scratch”, but can create a hybrid with the offline version as well. Moreover, an algorithm is proposed to guarantee the local optimality of the autonomous data partitioning approach allowing the proposed approach to end up with a locally optimal structure of data clouds represented by their focal points/prototypes, which is then ready to be used for analysis, building a multi-model classifier, predictor, controller or for fault isolation. © 2019, Springer Nature Switzerland AG.

Research

Associated organisational units

Links

Text available via DOI: