Rights statement: This is the author’s version of a work that was accepted for publication in Knowledge-Based Systems. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Knowledge-Based Systems, 218, 2021 DOI: 10.1016/j.knosys.2021.106870
Accepted author manuscript, 710 KB, PDF document
Available under license: CC BY-NC-ND
Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Self-organizing fuzzy inference ensemble system for big streaming data classification
AU - Gu, X.
AU - Angelov, P.
AU - Zhao, Z.
N1 - This is the author’s version of a work that was accepted for publication in Knowledge-Based Systems. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Knowledge-Based Systems, 218, 2021 DOI: 10.1016/j.knosys.2021.106870
PY - 2021/4/22
Y1 - 2021/4/22
N2 - An evolving intelligent system (EIS) is able to self-update its system structure and meta-parameters from streaming data. However, since the majority of EISs are implemented on a single-model architecture, their performances on large-scale, complex data streams are often limited. To address this deficiency, a novel self-organizing fuzzy inference ensemble framework is proposed in this paper. As the base learner of the proposed ensemble system, the self-organizing fuzzy inference system is capable of self-learning a highly transparent predictive model from streaming data on a chunk-by-chunk basis through a human-interpretable process. Very importantly, the base learner can continuously self-adjust its decision boundaries based on the inter-class and intra-class distances between prototypes identified from successive data chunks for higher classification precision. Thanks to its parallel distributed computing architecture, the proposed ensemble framework can achieve great classification precision while maintain high computational efficiency on large-scale problems. Numerical examples based on popular benchmark big data problems demonstrate the superior performance of the proposed approach over the state-of-the-art alternatives in terms of both classification accuracy and computational efficiency.
AB - An evolving intelligent system (EIS) is able to self-update its system structure and meta-parameters from streaming data. However, since the majority of EISs are implemented on a single-model architecture, their performances on large-scale, complex data streams are often limited. To address this deficiency, a novel self-organizing fuzzy inference ensemble framework is proposed in this paper. As the base learner of the proposed ensemble system, the self-organizing fuzzy inference system is capable of self-learning a highly transparent predictive model from streaming data on a chunk-by-chunk basis through a human-interpretable process. Very importantly, the base learner can continuously self-adjust its decision boundaries based on the inter-class and intra-class distances between prototypes identified from successive data chunks for higher classification precision. Thanks to its parallel distributed computing architecture, the proposed ensemble framework can achieve great classification precision while maintain high computational efficiency on large-scale problems. Numerical examples based on popular benchmark big data problems demonstrate the superior performance of the proposed approach over the state-of-the-art alternatives in terms of both classification accuracy and computational efficiency.
KW - Ensemble system
KW - Evolving intelligent system
KW - Large-scale data stream
KW - Prototypes
KW - Benchmarking
KW - Computational efficiency
KW - Data streams
KW - Efficiency
KW - Fuzzy inference
KW - Intelligent systems
KW - Predictive analytics
KW - Classification accuracy
KW - Classification precision
KW - Decision boundary
KW - Evolving intelligent systems
KW - Large-scale problem
KW - Parallel/distributed computing
KW - Predictive modeling
KW - Self-organizing fuzzy
KW - Classification (of information)
U2 - 10.1016/j.knosys.2021.106870
DO - 10.1016/j.knosys.2021.106870
M3 - Journal article
VL - 218
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
SN - 0950-7051
M1 - 106870
ER -