Outside the box - Research Portal | Lancaster University

Computing and Communications

Associated organisational units

Text available via DOI:

https://doi.org/10.14313/JAMRIS_2-2014/16
Final published version

Keywords

non-traditional statistical learning, anomaly detection, data density, proximity measures, RDE, data analytics, data-driven approaches, machine learning

View graph of relations

Outside the box: an alternative data analytics frame-work

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Outside the box: an alternative data analytics frame-work. / Angelov, Plamen.
In: Journal of Automation, Mobile Robotics and Intelligent Systems, Vol. 8, No. 2, 04.2014, p. 29-35.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Angelov, P 2014, 'Outside the box: an alternative data analytics frame-work', Journal of Automation, Mobile Robotics and Intelligent Systems, vol. 8, no. 2, pp. 29-35. https://doi.org/10.14313/JAMRIS_2-2014/16

APA

Angelov, P. (2014). Outside the box: an alternative data analytics frame-work. Journal of Automation, Mobile Robotics and Intelligent Systems, 8(2), 29-35. https://doi.org/10.14313/JAMRIS_2-2014/16

Vancouver

Angelov P. Outside the box: an alternative data analytics frame-work. Journal of Automation, Mobile Robotics and Intelligent Systems. 2014 Apr;8(2):29-35. doi: 10.14313/JAMRIS_2-2014/16

Author

Angelov, Plamen. / Outside the box : an alternative data analytics frame-work. In: Journal of Automation, Mobile Robotics and Intelligent Systems. 2014 ; Vol. 8, No. 2. pp. 29-35.

Bibtex

@article{b6a2d056c1404e19a35e6db6b6881e13,

title = "Outside the box: an alternative data analytics frame-work",

abstract = "In this paper, an alternative framework for data analytics is proposed which is based on the spatially-aware concepts of eccentricity and typicality which represent the density and proximity in the data space. This approach is statistical, but differs from the traditional probability theory which is frequentist in nature. It also differs from the belief and possibility-based approaches as well as from the deterministic first principles approaches, although it can be seen as deterministic in the sense that it provides exactly the same result for the same data. It also differs from the subjective expert-based approaches such as fuzzy sets.It can be used to detect anomalies, faults, form clusters, classes, predictive models, controllers. The main motivation for introducing the new typicality- and eccentricity-based data analytics (TEDA) is the fact that real processes which are of interest for data analytics, such as climate, economic and financial, electro-mechanical, biological, social and psychological etc., are often complex, uncertainand poorly known, but not purely random. Unlike, purely random processes, such as throwing dices, tossing coins, choosing coloured balls from bowls and other games, real life processes of interest do violate the main assumptions which the traditional probability theory requires. At the same time they are seldom deterministic (more precisely, have always uncertainty/noise component which is nondeterministic), creating expert and belief-based possibilistic models is cumbersome and subjective. Despite this, different groups of researchers and practitioners favour and do use one of the above approaches with probability theory being (perhaps) the most widely used one. The proposed new framework TEDA is a systematic methodology which does not require prior assumptions and can be used for development of a range of methods for anomalies and fault detection, image processing, clustering, classification, prediction, control, filtering, regression, etc. In this paper due to the space limitations, only few illustrative examples are provided aiming proof of concept.",

keywords = "non-traditional statistical learning, anomaly detection, data density, proximity measures, RDE, data analytics, data-driven approaches, machine learning",

author = "Plamen Angelov",

year = "2014",

month = apr,

doi = "10.14313/JAMRIS_2-2014/16",

language = "English",

volume = "8",

pages = "29--35",

journal = "Journal of Automation, Mobile Robotics and Intelligent Systems",

issn = "1897-8649",

publisher = "Industrial Research Institute for Automation and Measurements",

number = "2",

}

RIS

TY - JOUR

T1 - Outside the box

T2 - an alternative data analytics frame-work

AU - Angelov, Plamen

PY - 2014/4

Y1 - 2014/4

N2 - In this paper, an alternative framework for data analytics is proposed which is based on the spatially-aware concepts of eccentricity and typicality which represent the density and proximity in the data space. This approach is statistical, but differs from the traditional probability theory which is frequentist in nature. It also differs from the belief and possibility-based approaches as well as from the deterministic first principles approaches, although it can be seen as deterministic in the sense that it provides exactly the same result for the same data. It also differs from the subjective expert-based approaches such as fuzzy sets.It can be used to detect anomalies, faults, form clusters, classes, predictive models, controllers. The main motivation for introducing the new typicality- and eccentricity-based data analytics (TEDA) is the fact that real processes which are of interest for data analytics, such as climate, economic and financial, electro-mechanical, biological, social and psychological etc., are often complex, uncertainand poorly known, but not purely random. Unlike, purely random processes, such as throwing dices, tossing coins, choosing coloured balls from bowls and other games, real life processes of interest do violate the main assumptions which the traditional probability theory requires. At the same time they are seldom deterministic (more precisely, have always uncertainty/noise component which is nondeterministic), creating expert and belief-based possibilistic models is cumbersome and subjective. Despite this, different groups of researchers and practitioners favour and do use one of the above approaches with probability theory being (perhaps) the most widely used one. The proposed new framework TEDA is a systematic methodology which does not require prior assumptions and can be used for development of a range of methods for anomalies and fault detection, image processing, clustering, classification, prediction, control, filtering, regression, etc. In this paper due to the space limitations, only few illustrative examples are provided aiming proof of concept.

AB - In this paper, an alternative framework for data analytics is proposed which is based on the spatially-aware concepts of eccentricity and typicality which represent the density and proximity in the data space. This approach is statistical, but differs from the traditional probability theory which is frequentist in nature. It also differs from the belief and possibility-based approaches as well as from the deterministic first principles approaches, although it can be seen as deterministic in the sense that it provides exactly the same result for the same data. It also differs from the subjective expert-based approaches such as fuzzy sets.It can be used to detect anomalies, faults, form clusters, classes, predictive models, controllers. The main motivation for introducing the new typicality- and eccentricity-based data analytics (TEDA) is the fact that real processes which are of interest for data analytics, such as climate, economic and financial, electro-mechanical, biological, social and psychological etc., are often complex, uncertainand poorly known, but not purely random. Unlike, purely random processes, such as throwing dices, tossing coins, choosing coloured balls from bowls and other games, real life processes of interest do violate the main assumptions which the traditional probability theory requires. At the same time they are seldom deterministic (more precisely, have always uncertainty/noise component which is nondeterministic), creating expert and belief-based possibilistic models is cumbersome and subjective. Despite this, different groups of researchers and practitioners favour and do use one of the above approaches with probability theory being (perhaps) the most widely used one. The proposed new framework TEDA is a systematic methodology which does not require prior assumptions and can be used for development of a range of methods for anomalies and fault detection, image processing, clustering, classification, prediction, control, filtering, regression, etc. In this paper due to the space limitations, only few illustrative examples are provided aiming proof of concept.

KW - non-traditional statistical learning, anomaly detection

KW - data density

KW - proximity measures

KW - RDE

KW - data analytics

KW - data-driven approaches

KW - machine learning

U2 - 10.14313/JAMRIS_2-2014/16

DO - 10.14313/JAMRIS_2-2014/16

M3 - Journal article

VL - 8

SP - 29

EP - 35

JO - Journal of Automation, Mobile Robotics and Intelligent Systems

JF - Journal of Automation, Mobile Robotics and Intelligent Systems

SN - 1897-8649

IS - 2

ER -

Research

Associated organisational units

Links

Text available via DOI:

Keywords

Outside the box: an alternative data analytics frame-work

Standard

Harvard

APA

Vancouver

Author

Bibtex

RIS

Quick Links

Connect With Us

Faculties & Depts

Contact Us