NodeGuard: A Highly Efficient Two-Party Computation Framework for Training Large-Scale Gradient Boosting Decision Tree

LANCASTER UNIVERSITY LEIPZIG

Text available via DOI:

https://doi.org/10.1109/spw63631.2024.00015
Final published version

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Standard

NodeGuard: A Highly Efficient Two-Party Computation Framework for Training Large-Scale Gradient Boosting Decision Tree. / Dai, Tianxiang; Jiang, Yufan; Li, Yong et al.
2024 IEEE Security and Privacy Workshops (SPW). IEEE, 2024.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Harvard

Dai, T, Jiang, Y, Li, Y & Mei, F 2024, NodeGuard: A Highly Efficient Two-Party Computation Framework for Training Large-Scale Gradient Boosting Decision Tree. in 2024 IEEE Security and Privacy Workshops (SPW). IEEE. https://doi.org/10.1109/spw63631.2024.00015

APA

Dai, T., Jiang, Y., Li, Y., & Mei, F. (2024). NodeGuard: A Highly Efficient Two-Party Computation Framework for Training Large-Scale Gradient Boosting Decision Tree. In 2024 IEEE Security and Privacy Workshops (SPW) IEEE. https://doi.org/10.1109/spw63631.2024.00015

Vancouver

Dai T, Jiang Y, Li Y, Mei F. NodeGuard: A Highly Efficient Two-Party Computation Framework for Training Large-Scale Gradient Boosting Decision Tree. In 2024 IEEE Security and Privacy Workshops (SPW). IEEE. 2024 Epub 2024 May 23. doi: 10.1109/spw63631.2024.00015

Author

Dai, Tianxiang ; Jiang, Yufan ; Li, Yong et al. / NodeGuard: A Highly Efficient Two-Party Computation Framework for Training Large-Scale Gradient Boosting Decision Tree. 2024 IEEE Security and Privacy Workshops (SPW). IEEE, 2024.

Bibtex

@inproceedings{e4e2c6225924481ca6b5c2b93947d68a,

title = "NodeGuard: A Highly Efficient Two-Party Computation Framework for Training Large-Scale Gradient Boosting Decision Tree",

abstract = "The Gradient Boosting Decision Tree (GBDT) is a well-known machine learning algorithm, which achieves high performance and outstanding interpretability in real-world scenes such as fraud detection, online marketing and risk management. Meanwhile, two data owners can jointly train a GBDT model without disclosing their private dataset by executing secure Multi-Party Computation (MPC) protocols. In this work, we propose NodeGuard, a highly efficient two-party computation (2PC) framework for large-scale GBDT training and inference. NodeGuard guarantees that no sensitive intermediate results are leaked in the training and inference. The efficiency advantage of NodeGuard is achieved by applying a novel keyed bucket aggregation protocol, which optimizes the communication and computation complexity globally in the training. Additionally, we introduce a probabilistic approximate division protocol with an optimization for re-scaling, when the divisor is publicly known. Finally, we compare NodeGuard to state-of-the-art frameworks, and we show that NodeGuard is extremely efficient. It can improve the privacy-preserving GBDT training performance by a factor of 5.0 to 131 in LAN and 2.7 to 457 in WAN.",

author = "Tianxiang Dai and Yufan Jiang and Yong Li and Fei Mei",

year = "2024",

month = jul,

day = "4",

doi = "10.1109/spw63631.2024.00015",

language = "English",

booktitle = "2024 IEEE Security and Privacy Workshops (SPW)",

publisher = "IEEE",

}

RIS

TY - GEN

T1 - NodeGuard: A Highly Efficient Two-Party Computation Framework for Training Large-Scale Gradient Boosting Decision Tree

AU - Dai, Tianxiang

AU - Jiang, Yufan

AU - Li, Yong

AU - Mei, Fei

PY - 2024/7/4

Y1 - 2024/7/4

N2 - The Gradient Boosting Decision Tree (GBDT) is a well-known machine learning algorithm, which achieves high performance and outstanding interpretability in real-world scenes such as fraud detection, online marketing and risk management. Meanwhile, two data owners can jointly train a GBDT model without disclosing their private dataset by executing secure Multi-Party Computation (MPC) protocols. In this work, we propose NodeGuard, a highly efficient two-party computation (2PC) framework for large-scale GBDT training and inference. NodeGuard guarantees that no sensitive intermediate results are leaked in the training and inference. The efficiency advantage of NodeGuard is achieved by applying a novel keyed bucket aggregation protocol, which optimizes the communication and computation complexity globally in the training. Additionally, we introduce a probabilistic approximate division protocol with an optimization for re-scaling, when the divisor is publicly known. Finally, we compare NodeGuard to state-of-the-art frameworks, and we show that NodeGuard is extremely efficient. It can improve the privacy-preserving GBDT training performance by a factor of 5.0 to 131 in LAN and 2.7 to 457 in WAN.

AB - The Gradient Boosting Decision Tree (GBDT) is a well-known machine learning algorithm, which achieves high performance and outstanding interpretability in real-world scenes such as fraud detection, online marketing and risk management. Meanwhile, two data owners can jointly train a GBDT model without disclosing their private dataset by executing secure Multi-Party Computation (MPC) protocols. In this work, we propose NodeGuard, a highly efficient two-party computation (2PC) framework for large-scale GBDT training and inference. NodeGuard guarantees that no sensitive intermediate results are leaked in the training and inference. The efficiency advantage of NodeGuard is achieved by applying a novel keyed bucket aggregation protocol, which optimizes the communication and computation complexity globally in the training. Additionally, we introduce a probabilistic approximate division protocol with an optimization for re-scaling, when the divisor is publicly known. Finally, we compare NodeGuard to state-of-the-art frameworks, and we show that NodeGuard is extremely efficient. It can improve the privacy-preserving GBDT training performance by a factor of 5.0 to 131 in LAN and 2.7 to 457 in WAN.

U2 - 10.1109/spw63631.2024.00015

DO - 10.1109/spw63631.2024.00015

M3 - Conference contribution/Paper

BT - 2024 IEEE Security and Privacy Workshops (SPW)

PB - IEEE

ER -

Research

Links

Text available via DOI: