ABSTRACT

Computing Tools ..................................................... 84 4.4.2 Image Grid ............................................................................ 84 4.4.3 Computational Fluid Dynamics Grid ............................... 85 4.4.4 Course Online Grid ............................................................. 86 4.4.5 Massive Data Processing Grid ........................................... 88

4.5 Grid Dependable Research ........................................................... 88 4.5.1 Grid Monitor ......................................................................... 89

4.5.1.1 Architecture ............................................................. 89 4.5.1.2 Features ..................................................................... 93

4.5.2 Fault Tolerant Grid Platform .............................................. 93 4.5.2.1 Fault-Tolerant Grid Topology ................................ 94 4.5.2.2 Fault Tolerance of Information Server ................. 95

4.5.2.3 Fault Tolerance of the System-Level Component ............................................................. 96

4.5.2.4 Task Fault Tolerance .............................................. 96 4.5.3 Grid Fault Detection .......................................................... 101

4.5.3.1 Architecture ........................................................... 101 4.5.3.2 Adaptive Model ..................................................... 102

4.5.4 Adaptive Application Fault Tolerance ............................. 105 4.5.4.1 Overview of Failure Handling .............................. 106 4.5.4.2 Application-Level Fault-Tolerance

Techniques ............................................................. 107 4.5.4.3 Model of Policy Making ......................................... 108

4.6 Conclusion ..................................................................................... 113 Acknowledgments .............................................................................. 113 References ............................................................................................ 113

In recent years, grid computing has become very popular for its potential of aggregating high-performance computational and large-scale storage resources that distribute over the Internet. According to [1,2], grid computing is “resource sharing and coordinated problem solving in dynamic, multi-institutional virtual organizations.” The purpose of grid computing is to eliminate the resource islands in the application level, and to make computing and services ubiquitous.