BlueGene/L Failure Analysis and Prediction Models
Filtering Failure Logs for a BlueGene/L Prototype
Critical event prediction for proactive management in large-scale computer clusters
International Conference on Dependable Systems and Networks (DSN’06)
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD ’03
2005 International Conference on Dependable Systems and Networks (DSN’05)
M. Gupta
Yanyong Zhang
Yinglung Liang
R. Vilalta
S. Ma
J. E. Moreira
Exploring event correlation for failure prediction in coalitions of clusters