{
  "DOI": "10.5281/zenodo.3384092",
  "abstract": "A Benchmark Dataset for Low-Dose CT Reconstruction Methods.\n\n\nThe following Data Descriptor article provides full documentation:\n\n\nLeuschner, J., Schmidt, M., Baguer, D.O.\u00a0et al.\u00a0LoDoPaB-CT, a benchmark dataset for low-dose computed tomography reconstruction.\u00a0Sci Data\u00a08,\u00a0109 (2021). https://www.nature.com/articles/s41597-021-00893-z\n\n\n\u00a0\n\n\nThe python library DIV\u03b1\u2113 (github.com/jleuschn/dival)\u00a0can be used to download and access the dataset.\n\n\nReconstructions from the\u00a0LIDC/IDRI dataset\u00a0are used as a basis for this dataset.\n\n\n\u00a0\n\n\nThe ZIP files included in the LoDoPaB dataset contain multiple HDF5\u00a0files. Each HDF5 file contains one\u00a0HDF5 dataset named \"data\", that provides a number of\u00a0samples (128 except for the last file in each ZIP file). For example, the n-th training sample pair\u00a0is stored in the files \"observation_train_%03d.hdf5\" and \"ground_truth_train_%03d.hdf5\"\u00a0where \"%03d\" is floor(n / 128), at row (n mod 128)\u00a0of \"data\".\n\n\nNote: each last ground truth file (i.e.\u00a0ground_truth_train_279.hdf5,\u00a0ground_truth_validation_027.hdf5 and\u00a0ground_truth_test_027.hdf5) still contains a HDF5 dataset of\u00a0shape (128, 362, 362), although it contains\u00a0less than 128 valid samples. Thus, the number of valid samples needs to\u00a0be determined from the total samples numbers in the part (i.e. \"train\": 35820, \"validation\": 3522, \"test\": 3553), or from the corresponding observation\u00a0file, for\u00a0which the first dimension of the HDF5 dataset matches\u00a0the\u00a0number of valid samples in the file.\n\n\nThe randomized patient IDs of the\u00a0samples are provided as\u00a0CSV files. The patient IDs of the train, validation and test parts are integers in the range of 0\u2013631, 632\u2013691 and 692\u2013751, respectively. The ID of each sample is stored in a single row.\n\n\nAcknowledgements\n\n\nJohannes Leuschner, Maximilian Schmidt and Daniel Otero Baguer acknowledge the support by the Deutsche\nForschungsgemeinschaft (DFG) within the framework of GRK 2224/1 \u201c\u03c03: Parameter Identification \u2013 Analysis,\nAlgorithms, Applications\u201d. We thank Simon Arridge, Ozan \u00d6ktem, Carola-Bibiane Sch\u00f6nlieb and Christian\nEtmann for the fruitful discussion about the procedure, and Felix Lucka and Jonas Adler for their ideas and\nhelpful feedback on the simulation setup. The authors acknowledge the National Cancer Institute and the Foundation for the National Institutes of Health, and their critical role in the creation of the free publicly available LIDC/IDRI Database used in this study.",
  "author": [
    {
      "family": "Leuschner",
      "given": "Johannes"
    },
    {
      "family": "Schmidt",
      "given": "Maximilian"
    },
    {
      "family": "Otero Baguer",
      "given": "Daniel"
    }
  ],
  "id": "3384092",
  "issued": {
    "date-parts": [
      [
        "2019",
        "10",
        "04"
      ]
    ]
  },
  "language": "eng",
  "publisher": "Zenodo",
  "title": "LoDoPaB-CT Dataset",
  "type": "dataset",
  "version": "1.0.0"
}