Published June 22, 2021 | Version v2
Dataset Open

Nanopore Translocation Signal

  • 1. Instituto de Ciencias Humanas, Sociales y Ambientales, CONICET Mendoza Technological Scientific Center, Mendoza M5500, Argentina
  • 2. Division of Solid-State Electronics, Department of Electrical Engineering, Uppsala University, SE-751 03 Uppsala, Sweden
  • 3. Department of Electrical and Computer Engineering, MS: EC33 ECSN Suite 4.7 The University of Texas at Dallas 800 W. Campbell Rd. Richardson, TX 75080, USA

Description

This dataset contains a set of nanopore translocation current traces. It is divided in two parts.

Part I: This part contains artificially generated traces with different levels of background noise (SNR = 4, 2, 1, 0.5, and 0.25)

For each noise level, three parameters are varied in data generation:

a. Twenty different concentrations of nanoparticles as the analytes (Cnp):

0.013, 0.016, 0.020, 0.025, 0.032, 0.040, 0.050, 0.063, 0.080, 0.1, 0.13, 0.16, 0.20, 0.25, 0.32, 0.40, 0.50, 0.63, 0.80, and 1, with the unit of nano-molar, [nM].

b. Fifteen different diameters of the nanoparticles (Dnp):

3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, and 17, with the unit of nanometer, [nm].

c. Five different translocation durations (Duration):

0.5, 1.0, 1.5, 3.0, and 5.0, with the unit of millisecond, [ms].

In total we have 20*15*5=1500 current traces for each SNR.

There are three datasets: training, validation and test. Traces in training datasets are of 20 seconds, traces in validation and test datasets are 10 seconds long. For SNR = 4, training, validation and test datasets are provided. For other SNRs, only test traces are provided.

Pert II: This part contains real experimental datasets, the translocation of Lambda DNA and Streptavidin with 6 current traces each at different bias voltages. Each Lambda DNA trace has 71 seconds, while each Streptavidin trace has 126 seconds. Two truncated pyramid shape nanopores were used in our experiments, one with a side length of 7.5 nm and another 16 nm, both in a 55 nm-thick silicon layer, for DNA and protein streptavidin translocation, respectively. The DNA and streptavidin were dispersed in 500 mM KCl electrolyte with a concentration of 78 pM and 84 nM, respectively.


 

Details of artificially generated data and experimental data can be found in our paper:

Dario Dematties, Chenyu Wen, Mauricio David Pérez, Dian Zhou, Shi-Li Zhang. Deep learning of nanopore sensing signals using a bi-path network. arXiv:2105.03660.

Trained and Validated Models

In this data set we also include all the trained and best validated models evaluated in our paper.

Nanopore Translocation Detector Trained and Validated Models 

In this data set we also include all the trained and best validated models evaluated in our newer paper: A Generalized Transformer-Based Pulse Detection Algorithm

Notes

The Artificial data set is compressed and split into "n" .zip files. Once you downloaded such files you can use the following sequence of commands for joining and unzipping them: zip -F Artificial_Data.zip --out single-archive.zip unzip single-archive.zip

Files

Artificial_Data.zip

Files (29.3 GB)

Name Size Download all
md5:62723ab2935f6cda022eb954858c4f10
1.1 GB Download
md5:55da7f3c910c6e37efead54b0ad78756
1.1 GB Download
md5:940443f8d72ca9661357edd0fcdcd2dd
1.1 GB Download
md5:ec4525d187d5a59f510e89d488ab7606
1.1 GB Download
md5:8d4f32b15de5b5aba1b9ed6e60460cb0
1.1 GB Download
md5:18830fcd8e628cecaf9197f462db22a4
1.1 GB Download
md5:d39f4f3eba1b554db316d63684540b80
1.1 GB Download
md5:6ebd3728a3fd39e8a5781c387de10b9f
1.1 GB Download
md5:0f6e35852312e89237b5e0f48b0a2c24
1.1 GB Download
md5:f90aed39606e6fdf0acd1a0082d74197
1.1 GB Download
md5:685ccee5f689742fa8a7190795d2229f
1.1 GB Download
md5:ea960fb3245bc9935f454c08d9c0b7e9
1.1 GB Download
md5:e836e465729f12070353072848d17a7a
1.1 GB Download
md5:f1718365208609397b29605b7ce09e4c
1.1 GB Download
md5:f3a20e083efe9b2d77c67daf38687bdc
1.1 GB Download
md5:ee4a0376a129c6499c7a17a8b7ff3815
1.1 GB Download
md5:82e65a7fd1d2fde6d3e5b924898275d0
1.1 GB Download
md5:d24c0b70c646e5e23a212ad6c570af78
1.1 GB Download
md5:20acfa7b5fb483868f082c4533cd6ede
23.8 MB Preview Download
md5:7790530c2f2894b3e2cc17d66695be3c
1.1 GB Download
md5:62a58540dd3ed3548828b2b17770118d
1.1 GB Download
md5:9b0887bb71fb362fa323466be4ea73ff
1.1 GB Download
md5:436da4c0ab3b2a2c311e3518daa31a0a
1.1 GB Download
md5:2b243556192e42a183650fc7db85c787
1.1 GB Download
md5:a5a45f21f94881cb4b42b7d372a9f1e7
957.9 MB Preview Download
md5:8d118a5cc19c0974ffde9ed104326dc1
49.8 MB Preview Download
md5:2f20b6752885c020b7ca644da894be3e
3.5 GB Preview Download