Measurement-Based Analysis of Error Latency
- 1 May 1987
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Computers
- Vol. C-36 (5), 529-537
- https://doi.org/10.1109/TC.1987.1676937
Abstract
This paper demonstrates a practical methodology for the study of error latency under a real workload. The method is illustrated with sampled data on the physical memory activity, gathered by hardware instrumentation on a VAX 11/780 during the normal workload cycle of the installation. These data are used to simulate fault occurrence and to reconstruct the error discovery process in the system. The technique provides a means to study the system under different workloads and for multiple days. An approach to determine the percentage of undiscovered errors is also developed and a verification of the entire methodology is performed. This study finds that the mean error latency, in the memory containing the operating system, varies by a factor of 10 to 1 (in hours) between the low and high workloads. It is found that of all errors occurring within a day, 70 percent are detected in the same day, 82 percent within the following day, and 91 percent within the third day. The increase in failure rate due to latency is not so much a function of remaining errors but is dependent on whether or not there is a latent error.This publication has 6 references indexed in Scilit:
- DEPENDABLE COMPUTING AND FAULT TOLERANCE : CONCEPTS AND TERMINOLOGYPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- WORKLOAD, PERFORMANCE, AND RELlABlLlTY OF DIGITAL COMPUTlNG SYSTEMSPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Measurement and modeling of computer reliability as affected by system activityACM Transactions on Computer Systems, 1986
- A Measurement-Based Model for Workload Dependence of CPU ErrorsIEEE Transactions on Computers, 1986
- New results in fault latency modellingPublished by American Institute of Aeronautics and Astronautics (AIAA) ,1983
- A Statistical Failure/Load Relationship: Results of a Multicomputer StudyIEEE Transactions on Computers, 1982