Title: Fault tolerance in grid computing by resource clustering
Authors: Miloud Khaldi; Mohammed Rebbah; Boudjelal Meftah; Mohammed Debakla
Addresses: Computer Science Department, Faculty of Exact Sciences, University Mustapha Stambouli, Mascara, Algeria ' Computer Science Department, Faculty of Exact Sciences, University Mustapha Stambouli, Mascara, Algeria ' Computer Science Department, Faculty of Exact Sciences, University Mustapha Stambouli, Mascara, Algeria ' Computer Science Department, Faculty of Exact Sciences, University Mustapha Stambouli, Mascara, Algeria
Abstract: Computational grids have the potential for solving large-scale scientific problems using heterogeneous and geographically distributed resources. At this scale, the characteristics of dynamicity, resource heterogeneity and scalability have made fault tolerance more complex. In this paper, we propose FT-GRC a fault tolerance model that seeks to find the best substitute for the failed node by the clustering of the grid resources. This model is based on dynamic coloured graphs without replication of computer resources. The proposed fault tolerance mechanism uses scoring function to determine the appropriate substitute for each failed node by calculating the performance level of each node, and later exploits clustering to determine optimally the choice of substitute. Experimental results show the efficiency of the scoring method and the gain obtained by looking for the substitutes in the same cluster and then by the research for the nearest substitutes.
Keywords: grid computing; dynamic coloured graph; fault tolerance; clustering; scoring.
DOI: 10.1504/IJITST.2020.104577
International Journal of Internet Technology and Secured Transactions, 2020 Vol.10 No.1/2, pp.120 - 142
Received: 17 Feb 2018
Accepted: 16 May 2018
Published online: 21 Jan 2020 *