A reliability-aware scheduling algorithm for parallel task executing on cloud computing system Online publication date: Mon, 24-Jan-2022
by Jie Cao; Zhifeng Zhang; Bo Wang; Xiao Cui; Jinchao Xu
International Journal of Intelligent Systems Technologies and Applications (IJISTA), Vol. 20, No. 3, 2021
Abstract: As cloud computing is established on the massive cheap server clusters, it causes compute nodes' software and hardware to go wrong. Different computing nodes and communications links have different failure rate. For the parallel task scheduling problem that cloud users have requirements for deadlines and executing reliability, we put forward to generate all possible execution schemes of a parallel task on a cloud computing system. All the execution schemes are constructed into an execution scheme graph (ESG), in which a path from the start point to end point corresponds to an execution scheme of a parallel task. Based on ESG, we propose the maximum reliability execution scheme solving algorithm MRES that searches the execution schemes which have maximum reliability cost while meeting the parallel task's deadline requirement. The experimental results show that MRES algorithm can effectively improve the executing success rate.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Intelligent Systems Technologies and Applications (IJISTA):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com