On the Reliability of the IBM MVS/XA Operating System
IEEE Transactions on Software Engineering
S. Mourad
Exploring event correlation for failure prediction in coalitions of clusters