Simulation Framework for Asynchronous Iterative Methods

  • Evan Christopher Coleman (1) Naval Surface Warfare Center, Dahlgren Division; (2) Old Dominion University
  • Erik Jensen Old Dominion University
  • Masha Sosonkina Old Dominion University
Keywords: Asynchronous Iterative Methods, Fault Tolerance, Asynchronous Simulation, Shared Memory, Intel Xeon Phi

Abstract

As high-performance computing (HPC) platforms progress towards exascale, computational methods must be revamped to successfully leverage them. In particular, (1) asynchronous methods become of great importance because synchronization becomes prohibitively expensive and (2) resilience of computations must be achieved, e.g., using checkpointing selectively which may otherwise become prohibitively expensive due to the sheer scale of the computing environment. In this work, a simulation framework is proposed and tested to examine the potential benefit of asynchronous iteration for various HPC accelerator architectures (which typically admit different granularities of computations). Additionally, an example of a case study using the simulation framework is presented to examine the efficacy of different checkpointing schemes for asynchronous relaxation methods.

Published
2018-06-28
How to Cite
Coleman, E., Jensen, E., & Sosonkina, M. (2018). Simulation Framework for Asynchronous Iterative Methods. Journal of Simulation Engineering, 1. Retrieved from https://jsime.org/index.php/jsimeng/article/view/6
Section
Articles