Adversarial Attack Detection via Fuzzy Predictions

Home > Research > Publications & Outputs > Adversarial Attack Detection via Fuzzy Predictions

Associated organisational units

Electronic data

TFS_final
Accepted author manuscript, 2.97 MB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

More...

<mark>Journal publication date</mark>	31/12/2024
<mark>Journal</mark>	IEEE Transactions on Fuzzy Systems
Issue number	12
Volume	32
Number of pages	10
Pages (from-to)	7015-7024
Publication Status	Published
Early online date	3/10/24
<mark>Original language</mark>	English

Abstract

Image processing using neural networks act as a tool to speed up predictions for users, specifically on large-scale image samples. To guarantee the clean data for training accuracy, various deep learning-based adversarial attack detection techniques have been proposed. These crisp set-based detection methods directly determine whether an image is clean or attacked, while, calculating the loss is non-differentiable and hinders training through normal back-propagation. Motivated by the recent success in fuzzy systems, in this work, we present an attack detection method to further improve detection performance, which is suitable for any pre-trained neural network classifier. Subsequently, the fuzzification network is used to obtain feature maps to produce fuzzy sets of difference degree between clean and attacked images. The fuzzy rules control the intelligence that determines the detection boundaries. Different from previous fuzzy systems, we propose a fuzzy mean-intelligence mechanism with new support and confidence functions to improve fuzzy rule's quality. In the defuzzification layer, the fuzzy prediction from the intelligence is mapped back into the crisp model predictions for images. The loss between the prediction and label controls the rules to train the fuzzy detector. We show that the fuzzy rule-based network learns rich feature information than binary outputs and offer to obtain an overall performance gain. Experiment results show that compared to various benchmark fuzzy systems and adversarial attack detection methods, our fuzzy detector achieves better detection performance over a wide range of images.

Research

Associated organisational units

Electronic data

Adversarial Attack Detection via Fuzzy Predictions

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us