There is a newer version of the record available.

Published December 1, 2024 | Version v1
Dataset Open

Results and repro tools for Towards Cross-Build Differential Testing paper

  • 1. ROR icon Victoria University of Wellington
  • 2. ROR icon University of Auckland
  • 3. Oracle Labs

Description

Recent concerns about software supply chain security  have led to the emergence of different binaries built from the same source code. This will sometimes result in binaries that are not identical and therefore have  different cryptographic hashes. The question arises whether those binaries are still  equivalent, i.e., whether they have the same behaviour. We explore whether differential testing can be used to provide evidence for non-equivalence.

We study this for 3,541 pairs of binaries built for the same Maven artifact version, distributed on Maven Central, Google Assured Open Source Software and/or Oracle Build-From-Source. We use EvoSuite to generate tests for the baseline binary from Maven Central, run these tests against this baseline binary and any available alternately built binaries, and compare the results for consistency. We argue that any differences may indicate variations in program behaviour and could, therefore, be used to detect compromised binaries or failures at runtime.

This dataset contains the final results used to produce Figure 1 in different_test_outcomes.tsv, as well as tools to reproduce these results from several starting points. See README.md for instructions.

Citation: Jens Dietrich, Tim White, Valerio Terragni and Behnaz Hassanshahi. Towards Cross-Build Differential Testing. 18th IEEE International Conference on Software Testing, Verification and Validation (ICST) 2025.

Files

cross-build-diff-test.zip

Files (273.0 MB)

Name Size Download all
md5:12ebee23bc459b0b80e0c1cbd6bfdadd
272.9 MB Preview Download
md5:75c390b637155ed966ebe33d001e70ec
6.7 kB Download
md5:bc6c01caa8181ec71dd01c56783b50c0
5.0 kB Preview Download