Performance of checksums and CRCs over real data

1 October 1995

journal article
Published by Association for Computing Machinery (ACM) in ACM SIGCOMM Computer Communication Review

Vol. 25 (4), 68-76
https://doi.org/10.1145/217391.217413

Abstract

Checksum and CRC algorithms have historically been studied under the assumption that the data fed to the algorithms was entirely random. This paper examines the behavior of checksums and CRCs over real data from various UNIX® file systems. We show that, when given real data in small to modest pieces (e.g., 48 bytes), all the checksum algorithms have skewed distributions. In one dramatic case, 0.01% of the check values appeared nearly 19% of the time. These results have implications for CRCs and checksums when applied to real data. They also cause a spectacular failure rate for the both TCP and Fletcher's checksums when trying to detect certain types of packet splices.

Keywords

This publication has 7 references indexed in Scilit:

SEAL detects cell misordering
IEEE Network, 1992
Improving the efficiency of the OSI checksum calculation
ACM SIGCOMM Computer Communication Review, 1989
Fletcher's error detection algorithm: how to implement it efficiently and how toavoid the most common pitfalls
ACM SIGCOMM Computer Communication Review, 1988
Computing the Internet checksum
1988
An Arithmetic Checksum for Serial Transmissions
IEEE Transactions on Communications, 1982
Transmission Control Protocol
1981
Development of a Transmission Error Model and an Error Control Model
Published by Defense Technical Information Center (DTIC) ,1975

Cited by 9 articles