There is a newer version of the record available.

Published November 20, 2018 | Version 1
Dataset Open

PESO: Prostate Epithelium Segmentation on H&E-stained prostatectomy whole slide images

Description

Large set of whole-slide-images (WSI) of prostatectomy specimens with various grades of prostate cancer (PCa). More information can be found in the corresponding paper: https://doi.org/10.1038/s41598-018-37257-4

The WSIs in this dataset can be viewed using the open-source software ASAP or Open Slide.

Due to the large size of the complete dataset, the data has been split up in to multiple archives.

The data from the training set:

  • peso_training_masks.zip: Training masks (N=62) that have been used to train the main network of our paper. These masks are generated by a trained U-Net on the corresponding IHC slides.
  • peso_training_masks_corrected.zip: A subset of the color deconvolution masks (N=25) on which manual annotations have been made. Within these regions, stain and other artifacts have been removed.
  • peso_training_colordeconvolution.zip: Mask files (N=62) containing the P63&CK8/18 channel of the color deconvolution operation. These masks mark all regions that are stained by either P63 or CK8/18 in the IHC version of the slides.
  • peso_training_wsi_{1-6}.zip: Zip files containing the whole slide images of the training set (N=62). Each archive contains 10 slides, excluding the last which contains 12. These images are exported at a pixel resolution of 0.48mu/pixels. 

The data from the test set:

  • peso_testset_regions.zip: Collection of annotation XML files with outlines of the test regions. These can be used to view the test regions in more detail using ASAP.
  • peso_testset_png.zip: Export of the test set regions in PNG format (2500x2500 pixels per region).
  • peso_testset_png_padded.zip: Export of the test regions in PNG format padded with a 500 pixel wide border (3500x3500 pixels per region). Useful for segmenting pixels at the border of the regions.
  • peso_testset_mapping.csv: A csv file mapping files from the test set (numbered 1-160) to regions in the xml files. The csv file also contains the label (benign or cancer) for each region.
  • peso_testset_wsi_{1-4}.zip: Zip files containing the whole slide images of the test set (N=40). Each archive contains 10 slides of the test set. These images are exported at a pixel resolution of 0.48mu/pixels. 

This study was financed by a grant from the Dutch Cancer Society (KWF), grant number KUN 2015-7970.

If you make use of this dataset please cite both the dataset itself and the corresponding paper: https://doi.org/10.1038/s41598-018-37257-4

Files

peso_testset_mapping.csv

Files (138.1 GB)

Name Size Download all
md5:c267372ed522e1d15b8b550f3f3a8454
3.0 kB Preview Download
md5:c212aa96a7f79eb177ce1e4bfedb1ac1
205.2 MB Preview Download
md5:0c054a0ad3ae341988fa8c79591e2fe0
400.1 MB Preview Download
md5:9ec87b07fec0f1df7a17cf2de86aff99
17.1 kB Preview Download
md5:88342979339ed915272a50a4a3140a6d
13.1 GB Preview Download
md5:e74424f99b8f60baa455c9291ceaa994
13.0 GB Preview Download
md5:bd661a495ff00aae6009c4dd7e003f08
14.2 GB Preview Download
md5:088d6b472d3b6305af8a31ff77e3cb15
14.1 GB Preview Download
md5:8a6ec1f175d4479bec3c32b5ee82955a
1.4 GB Preview Download
md5:e3db7901434a0b2d51dfa57f4e59ebbe
1.7 GB Preview Download
md5:8e2c86fcecfafe09c9d48a60b42441b5
587.3 MB Preview Download
md5:8e4e53d7ba855fc2f318dce94b05fe31
14.4 GB Preview Download
md5:bfcae8b444c12c0ecbb717dc37334020
13.9 GB Preview Download
md5:f7d484acec429c3a9d9e685969edd82b
12.1 GB Preview Download
md5:75a350b450193b48e9bed3a3484f639d
13.2 GB Preview Download
md5:bc94373db95c5e2ceefe08c45a8e54db
11.0 GB Preview Download
md5:4fa8cdd4b748d67b6c39a982be7b627a
15.0 GB Preview Download

Additional details

Related works

Is documented by
10.1038/s41598-018-37257-4 (DOI)