Application recovery in parallel programming environment. (English) Zbl 1015.68618

Kranzlm├╝ller, Dieter (ed.) et al., Recent advances in parallel virtual machine and message passing interface. 9th European PVM/MPI users’ group meeting, Linz, Austria, September 29-October 2, 2002. Proceedings. Berlin: Springer. Lect. Notes Comput. Sci. 2474, 234-242 (2002).
Summary: In this paper, fault-tolerant feature of TOPAS parallel programming environment for distributed systems is presented. TOPAS automatically analyzes data dependence among tasks and synchronizes data, which reduces the time needed for parallel program developments. TOPAS also provides supports for scheduling, load balancing and fault tolerance. The main topics of this paper is to present the solution for transparent recovery of asynchronous distributed computation on clusters of workstations without hardware spare when a fault occurs on a node. Experiments show simplicity and efficiency of parallel programming in TOPAS environment with fault-tolerant integration, which provides graceful performance degradation and quick reconfiguration time for application recovery.
For the entire collection see [Zbl 1011.68745].


68U99 Computing methodologies and applications
68N19 Other programming paradigms (object-oriented, sequential, concurrent, automatic, etc.)
68M14 Distributed systems


Full Text: Link