Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Locally private estimation of conditional probability distribution for random forest in multimedia applications
AU - Wu, Xiaotong
AU - Bilal, Muhammad
AU - Xu, Xiaolong
AU - Song, Houbing
PY - 2023/9/30
Y1 - 2023/9/30
N2 - The application of artificial intelligence models to raw multimedia data is susceptible to various data inference attacks, posing a significant risk in terms of sensitive input information leakage. Most of the existing studies on privacy-preserving multimedia applications based on artificial intelligence, focus on a single intelligent model and thus have various limitations. In this paper, we attempt to directly perturb the core component of many multimedia intelligent models in Bayesian networks and deep learning. That is, we apply conditional probability distribution estimation to guarantee the privacy of the models. At first, we present the formal problem formulation of private conditional probability distribution estimation and apply it to random forest for task classification in multimedia applications. Then, we design a simple perturbation approach called NAIVEPRIVDISTEST, to add noise to all the elements of probability estimation in random forest. Next, we present an improved approach called FASTLRG, that utilizes the taxonomy tree to discretize the continuous attributes, thereby combining the attribute features to improve the prediction accuracy of random forest. Finally, we perform extensive experiments to evaluate the performance of random forest based on the proposed estimation algorithms. The experimental results indicate that the proposed models have better performance compared with existing private decision trees.
AB - The application of artificial intelligence models to raw multimedia data is susceptible to various data inference attacks, posing a significant risk in terms of sensitive input information leakage. Most of the existing studies on privacy-preserving multimedia applications based on artificial intelligence, focus on a single intelligent model and thus have various limitations. In this paper, we attempt to directly perturb the core component of many multimedia intelligent models in Bayesian networks and deep learning. That is, we apply conditional probability distribution estimation to guarantee the privacy of the models. At first, we present the formal problem formulation of private conditional probability distribution estimation and apply it to random forest for task classification in multimedia applications. Then, we design a simple perturbation approach called NAIVEPRIVDISTEST, to add noise to all the elements of probability estimation in random forest. Next, we present an improved approach called FASTLRG, that utilizes the taxonomy tree to discretize the continuous attributes, thereby combining the attribute features to improve the prediction accuracy of random forest. Finally, we perform extensive experiments to evaluate the performance of random forest based on the proposed estimation algorithms. The experimental results indicate that the proposed models have better performance compared with existing private decision trees.
KW - Attribute perturbation
KW - Distribution estimation
KW - Multimedia applications
KW - Privacy protection
KW - Random decision tree
U2 - 10.1016/j.ins.2023.119111
DO - 10.1016/j.ins.2023.119111
M3 - Journal article
AN - SCOPUS:85159297048
VL - 642
JO - Information Sciences
JF - Information Sciences
SN - 0020-0255
M1 - 119111
ER -