نوع مقاله : مقاله مروری
نویسندگان
گروه مهندسی برق و کامپیوتر، دانشکده فنی مهندسی، دانشگاه خوارزمی، تهران، ایران
چکیده
کلیدواژهها
موضوعات
عنوان مقاله [English]
نویسندگان [English]
Nowadays, fault tolerance in different systems is a very essential factor. Using checkpointing methods and safe spots for recovery after faults occur can increase the reliability and dependability of systems. The main issue with using checkpointing methods is their overhead. This overhead made as a result of checkpointing execution and it has negative impact on system performance. Therefore, numerous approaches and methods have been introduced to address this problem. These approaches and methods aim to reduce the overhead in order to increase system performance. this paper, thoroughly studied and reviewed various checkpointing methods. These methods organized into distinct groups. Then, determine These groups based on the type of checkpointing execution and the different systems levels. Those are such as: coordinated checkpointing, system-level checkpointing, application-level checkpointing, and distributed system checkpointing. Finally, this paper provides a detailed summary in a Comprehensive graph and conclusion for each of these groups.
کلیدواژهها [English]