An Empirical Study of the Usage of Checksums for Web Downloads

We analyzed the current practices for using checksums for web downloads. We built a dataset of 277 web pages containing checksums that we annotated. This webpage allows visitors download the dataset and submit new URLs to extend our dataset.

Cite

Bernard, G., Coudert, R., Chapuis, B., and Huguenin, K. (2023). An Empirical Study of the Usage of Checksums for Web Downloads. In The Web Conference (WWW) (p. 12). [doi]

Statistics about the collected checksums

Figures from the paper updated with additional submitted checksums

Most frequently used techniques, ordered by strength. Based on observations.

Distribution of file sizes (in KB); log-scale x-axis. Based on observations.

Distribution of files types. Based on observations.

Distribution of outcome for checksum verification. Based on observations.