Dataset Open Access

D1D-TRNS4 Truly Random Number Seeds with Verified Statistical Quality

Harris V Georgiou (MSc,PhD)

========================================================

       Dataset: D1D-TRNS4

    Truly Random Number Seeds with Verified
        Statistical Quality (Jul.2016)
 
            Release Notes

    Copyright (c) 2016 by Harris V. Georgiou

========================================================
   Release:     Jul 31, 2016

   - Version:  1.1a
   - Format:   .wav/.bin
========================================================


This file contains important information about the current version of the dataset package. Downloading and using this material hints that you accept the EULA/Terms-of-Use (please read carefully).

We welcome your comments and suggestions.

_______________________________________________
WHAT'S IN THIS PACKAGE?

-  Overview
-  Available file formats
-  Files and Datasets
-  License Agreement

_______________________________________________
OVERVIEW

The generation of truly random number streams is a task of fundamental importance in cryptography and computer simulations. The term "truly" refers to the use of a natural phenomenon that is inherently random, e.g. the decay of a radio-active element. Instead, a more practical approach is to use pseudo-random number generators (PRNG) that closely resembles such random process (output), as long as they have a very long period and their initial seed is inherently random. Additionally, some cryptographically strong algorithms, such as hashing/digest or encryption functions can also be used, since they act by design as "entropy diffusers".

This package contains four sets of data that combine (a) truly random sources as seed and (b) cryptography for additional randomness in the output. Specifically, low-quality (noisy) sound recordings of rainfall and RF static are used as input for multiple PGP/GPG encryption steps with random keys. The result is a set of random binary data of very high statistical quality (sizes 690KB-1.27MB), which can be used as-is for one-time pads or as seeds for high-quality PRNG implementations.

The source data is low-quality noisy waveforms: two rainfall samples with (approximately) mean=1.3KHz and stdev=600Hz in both cases; and two RF static samples with (approximately) mean=1.6KHz and stdev=800Hz. These can also be used as-is, but for quality PRNG seeds these raw data have to be properly pre-processed (at least whitened).

The final data are evaluated using the ENT command-line tool and the appropriate statistics in Excel/LibreOffice format. In essence, ENT compares the bit-level statistics of the binary (PGP/GPG) files to the theoretically optimal values if the data are perfectly random.

_______________________________________________
AVAILABLE FILE FORMATS

The datasets are available in the following formats (included):

*.wav    : source sound files (4bit/mono/8kHz/32kbps)
*.bin   : binary files ready for use (PGP/GPG output)

 

Files (17.5 MB)
Name Size
D1D-TRNS4.7z md5:6d04af3ece9bb6d201c9dcfd6f76f422 17.5 MB Download
D1D-TRNS4.7z.asc md5:7c9387647624e0db16a4d772aa7454ec 851 Bytes Download
EULA-TermsOfUse.txt md5:f5be4d66f1d3d7c5ccdebe0c8602caee 2.9 kB Download
README.txt md5:e19fa997bf9525575c0efd084aabd8bb 4.6 kB Download

Share

Cite as