Optimal Choice of Checkpointing Interval for High Availability
2005 (English)In: Pacific Rim Dependable Computing Conference,2005, Los Alamitos, CA, USA: IEEE Computer Society , 2005, 159- p.Conference paper (Refereed)
Supporting high availability by checkpointing and switching to a backup upon failure of a primary has a cost. Trade-off studies help system architects to decide whether higher availability at the cost of higher response time is to strive for. The decision will lead to configuring a fault-tolerant server for best performance.
This paper provides a mathematical model employing queuing theory that helps to compute the optimal checkpointing interval for a primary-backup replicated server. The optimization criterion is system availability. The model guides towards the checkpointing interval that is short enough to give low failover time, but long enough to utilize most of the system resources for servicing client requests. The novelty of the work is the detailed modelling of service times, wait times for earlier calls in the queue, and priority of checkpointing calls over client calls within the queues. Studies on the model in Mathematica and validation of a modelling assumption through simulations are included.
Place, publisher, year, edition, pages
Los Alamitos, CA, USA: IEEE Computer Society , 2005. 159- p.
Dependebility, availability, checkpointing, queueing theory
IdentifiersURN: urn:nbn:se:liu:diva-30990Local ID: 16674OAI: oai:DiVA.org:liu-30990DiVA: diva2:251813