Back close

Fault Tolerance and Recovery for Grid Application Reliabilityusing Checkpointing Mechanism

Publication Type : Journal Article

Publisher : International Journal of Computer Applications

Source : International Journal of Computer Applications. 26, No.5, 32-37, July2011, Indexed in Citeseer, Google Scholar, ISSN: 0975-8887, Impact factor: 0.814, DOI : 10.5120/3098-4252

Url : https://www.researchgate.net/publication/252983695_Fault_Tolerance_and_Recovery_for_Grid_Application_Reliability_using_Check_Pointing_Mechanism

Campus : Chennai

School : Department of Computer Science and Engineering, School of Computing

Year : 2011

Abstract : The check pointing mechanism and rollback recovery is a well-known method to achieve fault tolerance in grid computing systems. If any resource or process is tending to be faulty in run time that will be detected by check pointing mechanism through the Task Dependency Graph (TDG) and their respective worst case execution time and deadline parameters are used to decide the schedulability. The common approach is to use rollback-dependent graph or check point graph. The scheduling of concurrent tasks can be done using the proposed Concurrent Task Scheduling Algorithm (CTSA) algorithm to recover from the faulty states using replication or rollback techniques. The earlier fault detection methods are not scalable with the diversity of user applications and the frequency of faults varies dynamically making the faults hard to detect and recover. The check pointing and replication mechanisms have been used in high performance grid computing where the synchronization between communicating processes is needed to enhance the efficiency of check pointing mechanism. The performance improvements over the faulty conditions can be obtained with or without data and process replication. The experimental results show that the CTSA can lead to significant performance gain for a variety of scenarios.

Cite this Research Publication : S.Baghavathi Priya, Dr.T.Ravichandran, “Fault Tolerance and Recovery for Grid Application Reliabilityusing Checkpointing Mechanism”, International Journal of Computer Applications. 26, No.5, 32-37, July 2011, Indexed in Citeseer, Google Scholar, ISSN: 0975-8887, Impact factor: 0.814, DOI : 10.5120/3098-4252.

Admissions Apply Now