Filtering Failure Logs for a BlueGene/L Prototype
Critical event prediction for proactive management in large-scale computer clusters
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD ’03
2005 International Conference on Dependable Systems and Networks (DSN’05)
A. Sivasubramaniam
R. Vilalta
S. Ma
J. E. Moreira
I. Rish
A. J. Oliner
Exploring event correlation for failure prediction in coalitions of clusters