Rights statement: The final publication is available at Springer via http://dx.doi.org/10.1007/s00521-021-06137-w
Accepted author manuscript, 760 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Final published version
Licence: CC BY: Creative Commons Attribution 4.0 International License
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Detecting and Learning from Unknown by Extremely Weak Supervision
T2 - eXploratory Classifier (xClass)
AU - Angelov, Plamen
AU - Almeida Soares, Eduardo
N1 - The final publication is available at Springer via http://dx.doi.org/10.1007/s00521-021-06137-w
PY - 2021/11/30
Y1 - 2021/11/30
N2 - In this paper, we break with the traditional approach to classification, which is regarded as a form of supervised learning. We offer a method and algorithm, which make possible fully autonomous (unsupervised) detection of new classes, and learning following a very parsimonious training priming (few labeled data samples only). Moreover, new unknown classes may appear at a later stage and the proposed xClass method and algorithm are able to successfully discover this and learn from the data autonomously. Furthermore, the features (inputs to the classifier) are automatically sub-selected by the algorithm based on the accumulated data density per feature per class. In addition, the automatically generated model is easy to interpret and is locally generative and based on prototypes which define the modes of the data distribution. As a result, a highly efficient, lean, human-understandable, autonomously self-learning model (which only needs an extremely parsimonious priming) emerges from the data. To validate our proposal, we approbated it on four challenging problems, including imbalanced Faces-1999 data base, Caltech-101 dataset, vehicles dataset, and iRoads dataset, which is a dataset of images of autonomous driving scenarios. Not only we achieved higher precision (in one of the problems outperforming by 25% all other methods), but, more significantly, we only used a single class beforehand, while other methods used all the available classes and we generated interpretable models with smaller number of features used, through extremely weak and weak supervision. We demonstrated the ability to detect and learn new classes for both images and numerical examples.
AB - In this paper, we break with the traditional approach to classification, which is regarded as a form of supervised learning. We offer a method and algorithm, which make possible fully autonomous (unsupervised) detection of new classes, and learning following a very parsimonious training priming (few labeled data samples only). Moreover, new unknown classes may appear at a later stage and the proposed xClass method and algorithm are able to successfully discover this and learn from the data autonomously. Furthermore, the features (inputs to the classifier) are automatically sub-selected by the algorithm based on the accumulated data density per feature per class. In addition, the automatically generated model is easy to interpret and is locally generative and based on prototypes which define the modes of the data distribution. As a result, a highly efficient, lean, human-understandable, autonomously self-learning model (which only needs an extremely parsimonious priming) emerges from the data. To validate our proposal, we approbated it on four challenging problems, including imbalanced Faces-1999 data base, Caltech-101 dataset, vehicles dataset, and iRoads dataset, which is a dataset of images of autonomous driving scenarios. Not only we achieved higher precision (in one of the problems outperforming by 25% all other methods), but, more significantly, we only used a single class beforehand, while other methods used all the available classes and we generated interpretable models with smaller number of features used, through extremely weak and weak supervision. We demonstrated the ability to detect and learn new classes for both images and numerical examples.
KW - Extremely weak supervision
KW - Novelty detection
KW - Interpretability
KW - Autonomously self-learning model
U2 - 10.1007/s00521-021-06137-w
DO - 10.1007/s00521-021-06137-w
M3 - Journal article
VL - 33
SP - 15145
EP - 15157
JO - Neural Computing and Applications
JF - Neural Computing and Applications
SN - 0941-0643
IS - 22
ER -