Improving checkpointing intervals by considering individual job failure probabilities

التفاصيل البيبلوغرافية
العنوان: Improving checkpointing intervals by considering individual job failure probabilities
المؤلفون: Frank, Alvaro, Baumgartner, Manuel, Salkhordeh, Reza, Brinkmann, Andre
المصدر: 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS) IPDPS Parallel and Distributed Processing Symposium (IPDPS), 2021 IEEE International. :299-309 May, 2021
Relation: 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
قاعدة البيانات: IEEE Xplore Digital Library
الوصف
ردمك:9781665440660
تدمد:15302075
DOI:10.1109/IPDPS49936.2021.00038