Home > Research > Publications & Outputs > Task-based Parallel Computation of the Density ...

Electronic data

  • siam

    Rights statement: c 2017 Society for Industrial and Applied Mathematics

    Final published version, 628 KB, PDF document

    Available under license: None

Links

Text available via DOI:

View graph of relations

Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning. / Ghale, Purnima; Kroonblawd, Matthew P.; Mniszewski, Sue et al.
In: SIAM Journal on Scientific Computing, Vol. 39, No. 6, 2017, p. C466-C480.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Ghale, P, Kroonblawd, MP, Mniszewski, S, Negre, CFA, Pavel, R, Pino, S, Sardeshmukh, V, Shi, G & Hahn, G 2017, 'Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning', SIAM Journal on Scientific Computing, vol. 39, no. 6, pp. C466-C480. https://doi.org/10.1137/16M109404X

APA

Ghale, P., Kroonblawd, M. P., Mniszewski, S., Negre, C. F. A., Pavel, R., Pino, S., Sardeshmukh, V., Shi, G., & Hahn, G. (2017). Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning. SIAM Journal on Scientific Computing, 39(6), C466-C480. https://doi.org/10.1137/16M109404X

Vancouver

Ghale P, Kroonblawd MP, Mniszewski S, Negre CFA, Pavel R, Pino S et al. Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning. SIAM Journal on Scientific Computing. 2017;39(6):C466-C480. Epub 2017 Dec 14. doi: 10.1137/16M109404X

Author

Ghale, Purnima ; Kroonblawd, Matthew P. ; Mniszewski, Sue et al. / Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning. In: SIAM Journal on Scientific Computing. 2017 ; Vol. 39, No. 6. pp. C466-C480.

Bibtex

@article{604015213a54416186214f7b7cce1f27,
title = "Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning",
abstract = "Quantum-based molecular dynamics (QMD) is a highly accurate and transferable method for material science simulations. However, the time scales and system sizes accessible to QMD are typically limited to picoseconds and a few hundred atoms. These constraints arise due to expensive self-consistent ground-state electronic structure calculations that can often scale cubically with the number of atoms. Linearly scaling methods depend on computing the density matrix P from the Hamiltonian matrix H by exploiting the sparsity in both matrices. The second-order spectral projection (SP2) algorithm is an O(N) algorithm that computes P with a sequence of 40-50 matrix-matrix multiplications. In this paper, we present task-based implementations of a recently developed data-parallel graph-based approach to the SP2 algorithm, G-SP2. We represent the density matrix P as an undirected graph and use graph partitioning techniques to divide the computation into smaller independent tasks. The partitions thus obtained are generally not of equal size and give rise to undesirable load imbalances in standard MPI-based implementations. This load-balancing challenge can be mitigated by dynamically scheduling parallel computations at runtime using task-based programming models. We develop task-based implementations of the data-parallel G-SP2 algorithm using both Intel's Concurrent Collections (CnC) as well as the Charm++ programming model and evaluate these implementations for future use. Scaling and performance results of our implementations are investigated for representative segments of QMD simulations for solvated protein systems containing more than 10,000 atoms.",
keywords = "quantum molecular dynamics, asynchronous parallel programming, Charm plus, CnC, graph partitioning, density matrix, ELECTRONIC-STRUCTURE THEORY, PROCESSING UNITS, ALGORITHMS, METALS, DECAY",
author = "Purnima Ghale and Kroonblawd, {Matthew P.} and Sue Mniszewski and Negre, {Christian F. A.} and Robert Pavel and Sergio Pino and Vivek Sardeshmukh and Guangjie Shi and Georg Hahn",
year = "2017",
doi = "10.1137/16M109404X",
language = "English",
volume = "39",
pages = "C466--C480",
journal = "SIAM Journal on Scientific Computing",
issn = "1064-8275",
publisher = "SIAM PUBLICATIONS",
number = "6",

}

RIS

TY - JOUR

T1 - Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning

AU - Ghale, Purnima

AU - Kroonblawd, Matthew P.

AU - Mniszewski, Sue

AU - Negre, Christian F. A.

AU - Pavel, Robert

AU - Pino, Sergio

AU - Sardeshmukh, Vivek

AU - Shi, Guangjie

AU - Hahn, Georg

PY - 2017

Y1 - 2017

N2 - Quantum-based molecular dynamics (QMD) is a highly accurate and transferable method for material science simulations. However, the time scales and system sizes accessible to QMD are typically limited to picoseconds and a few hundred atoms. These constraints arise due to expensive self-consistent ground-state electronic structure calculations that can often scale cubically with the number of atoms. Linearly scaling methods depend on computing the density matrix P from the Hamiltonian matrix H by exploiting the sparsity in both matrices. The second-order spectral projection (SP2) algorithm is an O(N) algorithm that computes P with a sequence of 40-50 matrix-matrix multiplications. In this paper, we present task-based implementations of a recently developed data-parallel graph-based approach to the SP2 algorithm, G-SP2. We represent the density matrix P as an undirected graph and use graph partitioning techniques to divide the computation into smaller independent tasks. The partitions thus obtained are generally not of equal size and give rise to undesirable load imbalances in standard MPI-based implementations. This load-balancing challenge can be mitigated by dynamically scheduling parallel computations at runtime using task-based programming models. We develop task-based implementations of the data-parallel G-SP2 algorithm using both Intel's Concurrent Collections (CnC) as well as the Charm++ programming model and evaluate these implementations for future use. Scaling and performance results of our implementations are investigated for representative segments of QMD simulations for solvated protein systems containing more than 10,000 atoms.

AB - Quantum-based molecular dynamics (QMD) is a highly accurate and transferable method for material science simulations. However, the time scales and system sizes accessible to QMD are typically limited to picoseconds and a few hundred atoms. These constraints arise due to expensive self-consistent ground-state electronic structure calculations that can often scale cubically with the number of atoms. Linearly scaling methods depend on computing the density matrix P from the Hamiltonian matrix H by exploiting the sparsity in both matrices. The second-order spectral projection (SP2) algorithm is an O(N) algorithm that computes P with a sequence of 40-50 matrix-matrix multiplications. In this paper, we present task-based implementations of a recently developed data-parallel graph-based approach to the SP2 algorithm, G-SP2. We represent the density matrix P as an undirected graph and use graph partitioning techniques to divide the computation into smaller independent tasks. The partitions thus obtained are generally not of equal size and give rise to undesirable load imbalances in standard MPI-based implementations. This load-balancing challenge can be mitigated by dynamically scheduling parallel computations at runtime using task-based programming models. We develop task-based implementations of the data-parallel G-SP2 algorithm using both Intel's Concurrent Collections (CnC) as well as the Charm++ programming model and evaluate these implementations for future use. Scaling and performance results of our implementations are investigated for representative segments of QMD simulations for solvated protein systems containing more than 10,000 atoms.

KW - quantum molecular dynamics

KW - asynchronous parallel programming

KW - Charm plus

KW - CnC

KW - graph partitioning

KW - density matrix

KW - ELECTRONIC-STRUCTURE THEORY

KW - PROCESSING UNITS

KW - ALGORITHMS

KW - METALS

KW - DECAY

U2 - 10.1137/16M109404X

DO - 10.1137/16M109404X

M3 - Journal article

VL - 39

SP - C466-C480

JO - SIAM Journal on Scientific Computing

JF - SIAM Journal on Scientific Computing

SN - 1064-8275

IS - 6

ER -