Locally private estimation of conditional probability distribution for random forest in multimedia applications

Computing and Communications

Text available via DOI:

https://doi.org/10.1016/j.ins.2023.119111
Final published version

Keywords

Attribute perturbation, Distribution estimation, Multimedia applications, Privacy protection, Random decision tree

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Locally private estimation of conditional probability distribution for random forest in multimedia applications. / Wu, Xiaotong; Bilal, Muhammad; Xu, Xiaolong et al.
In: Information Sciences, Vol. 642, 119111, 30.09.2023.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Bibtex

@article{ff6d9b13bab548b1858a7c7776b018d0,

title = "Locally private estimation of conditional probability distribution for random forest in multimedia applications",

abstract = "The application of artificial intelligence models to raw multimedia data is susceptible to various data inference attacks, posing a significant risk in terms of sensitive input information leakage. Most of the existing studies on privacy-preserving multimedia applications based on artificial intelligence, focus on a single intelligent model and thus have various limitations. In this paper, we attempt to directly perturb the core component of many multimedia intelligent models in Bayesian networks and deep learning. That is, we apply conditional probability distribution estimation to guarantee the privacy of the models. At first, we present the formal problem formulation of private conditional probability distribution estimation and apply it to random forest for task classification in multimedia applications. Then, we design a simple perturbation approach called NAIVEPRIVDISTEST, to add noise to all the elements of probability estimation in random forest. Next, we present an improved approach called FASTLRG, that utilizes the taxonomy tree to discretize the continuous attributes, thereby combining the attribute features to improve the prediction accuracy of random forest. Finally, we perform extensive experiments to evaluate the performance of random forest based on the proposed estimation algorithms. The experimental results indicate that the proposed models have better performance compared with existing private decision trees.",

keywords = "Attribute perturbation, Distribution estimation, Multimedia applications, Privacy protection, Random decision tree",

author = "Xiaotong Wu and Muhammad Bilal and Xiaolong Xu and Houbing Song",

year = "2023",

month = sep,

day = "30",

doi = "10.1016/j.ins.2023.119111",

language = "English",

volume = "642",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

RIS

TY - JOUR

T1 - Locally private estimation of conditional probability distribution for random forest in multimedia applications

AU - Wu, Xiaotong

AU - Bilal, Muhammad

AU - Xu, Xiaolong

AU - Song, Houbing

PY - 2023/9/30

Y1 - 2023/9/30

N2 - The application of artificial intelligence models to raw multimedia data is susceptible to various data inference attacks, posing a significant risk in terms of sensitive input information leakage. Most of the existing studies on privacy-preserving multimedia applications based on artificial intelligence, focus on a single intelligent model and thus have various limitations. In this paper, we attempt to directly perturb the core component of many multimedia intelligent models in Bayesian networks and deep learning. That is, we apply conditional probability distribution estimation to guarantee the privacy of the models. At first, we present the formal problem formulation of private conditional probability distribution estimation and apply it to random forest for task classification in multimedia applications. Then, we design a simple perturbation approach called NAIVEPRIVDISTEST, to add noise to all the elements of probability estimation in random forest. Next, we present an improved approach called FASTLRG, that utilizes the taxonomy tree to discretize the continuous attributes, thereby combining the attribute features to improve the prediction accuracy of random forest. Finally, we perform extensive experiments to evaluate the performance of random forest based on the proposed estimation algorithms. The experimental results indicate that the proposed models have better performance compared with existing private decision trees.

AB - The application of artificial intelligence models to raw multimedia data is susceptible to various data inference attacks, posing a significant risk in terms of sensitive input information leakage. Most of the existing studies on privacy-preserving multimedia applications based on artificial intelligence, focus on a single intelligent model and thus have various limitations. In this paper, we attempt to directly perturb the core component of many multimedia intelligent models in Bayesian networks and deep learning. That is, we apply conditional probability distribution estimation to guarantee the privacy of the models. At first, we present the formal problem formulation of private conditional probability distribution estimation and apply it to random forest for task classification in multimedia applications. Then, we design a simple perturbation approach called NAIVEPRIVDISTEST, to add noise to all the elements of probability estimation in random forest. Next, we present an improved approach called FASTLRG, that utilizes the taxonomy tree to discretize the continuous attributes, thereby combining the attribute features to improve the prediction accuracy of random forest. Finally, we perform extensive experiments to evaluate the performance of random forest based on the proposed estimation algorithms. The experimental results indicate that the proposed models have better performance compared with existing private decision trees.

KW - Attribute perturbation

KW - Distribution estimation

KW - Multimedia applications

KW - Privacy protection

KW - Random decision tree

U2 - 10.1016/j.ins.2023.119111

DO - 10.1016/j.ins.2023.119111

M3 - Journal article

AN - SCOPUS:85159297048

VL - 642

JO - Information Sciences

JF - Information Sciences

SN - 0020-0255

M1 - 119111

ER -

Research

Links

Text available via DOI:

Keywords