A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI

TitleA Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI
Publication TypeConference Proceedings
Year of Publication2012
AuthorsBland, W., P. Du, A. Bouteiller, T. Herault, G. Bosilca, and J. Dongarra
EditorKaklamanis, C., T. Papatheodorou, and P. Spirakis
Conference Name18th International European Conference on Parallel and Distributed Computing (Euro-Par 2012) (Best Paper Award)
Date Published08-2012
PublisherSpringer-Verlag
Conference LocationRhodes, Greece