Weakly Supervised Learning for Industrial Optical Inspection
- 1. Robert Bosch Corporate Research department, Schwieberdingen, Germany
Description
Abstract
In the following, we present a synthetic benchmark corpus for detect detection on statistically textured surfaces.We hope that it facilitates to further develop and benchmark classification algorithms for applications of industrial optical inspection. All data is publicly available and can be downloaded from this page.
Competition at DAGM 2007 symposium
The DAGM (Deutsche Arbeitsgemeinschaft für Mustererkennung e.V., German chapter of the IAPR (International Association for Pattern Recognition)) and the GNSS (German Chapter of the European Neural Network Society) offered an open competition on Weakly Supervised Learning for Industrial Optical Inspection held as part of the DAGM symposium in 2007.
The competition was inspired by the fact that automated optical inspection allows to reduce the cost of industrial quality control significantly. The competitors had to design a classification algorithm which:
- detects miscellaneous defects on various statistically textured backgrounds.
- learns to discern defects automatically from a weakly labelled training data.
- works on data whose exact characteristics are unknown at development time.
- adapts all parameters automatically and does not require any human intervention.
- has a moderate running time (in this competition 24 hours for training and 12 hours for the test phase).
- takes into account asymmetric costs for false positive and false negative decisions (1:20 was used for the competition).
Data description
Preview Image: https://zenodo.org/api/iiif/record:12750201:examples_small.jpg/full/!800,800/0/default.jpg
The data is artificially generated, but similar to real world problems. The first six out of ten datasets, denoted as development datasets, are supposed to be used for algorithm development. The remaining four datasets, which are referred to as competition datasets, can be used to evaluate the performance. Researchers should consider not using or analyzing the competition datasets before the development is completed as a code of honour.
In the following we provide some details about the datasets:
- Each development (competition) dataset consists of 1000 (2000) 'non-defective' and of 150 (300) 'defective' images saved in grayscale 8-bit PNG format.
- Each dataset is generated by a different texture model and defect model.
- 'Non-defective' images show the background texture without defects, 'defective' images have exactly one labelled defect on the background texture.
- All datasets has been randomly split into a training and testing sub-dataset of equal size.
- Weak labels are provided as ellipses roughly indicating the defective area. Technically, defective images are augmented with a separate grayscale 8-bit image in the PNG format located in a folder 'Label'. The values 0 and 255 denote background and defective area, respectively.
All meta-data is subsumed in a separate ASCII textfile called 'Labels.txt' which is located in the 'Label' folder. The structure is as follows:
1 \n
[id of item no. 1] \t [0 if non-defective, 1 if defective] \t [filename of raw image no. 1] \t 0 \t [filename of label image no. 1 if defective, 0 otherwise] \n
...
[id of item no. N] \t [0 if non-defective, 1 if defective] \t [filename of raw image no. N] \t 0 \t [filename of label image no. N if defective, 0 otherwise] \n
Notes
Files
examples_small.jpg
Files
(2.9 GB)
Name | Size | Download all |
---|---|---|
md5:9ed7f0480aea9bd0a83b429a17e3e750
|
267.9 MB | Preview Download |
md5:2743ebc79cb91551b51599a67f0e7528
|
410.4 MB | Preview Download |
md5:024f1f6c6d11d72ee12f19dcc98aa3d8
|
228.0 MB | Preview Download |
md5:57a337e93e38c71e01d15770e252f46d
|
202.9 MB | Preview Download |
md5:d6852ae3667db59e6a4d586d98633540
|
196.9 MB | Preview Download |
md5:463a932d87a5c0e02ee10cdb3265c318
|
215.0 MB | Preview Download |
md5:40ba70f232fe1cb891a46c1b77dc4940
|
260.8 MB | Preview Download |
md5:67571381c8aa5f82d5180e6e8838f735
|
380.4 MB | Preview Download |
md5:d02a892a8b6180893c8ea018848606cb
|
392.6 MB | Preview Download |
md5:083d8da49d46614202a332f9e9dd1bca
|
392.4 MB | Preview Download |
md5:acb79662c2c0e2d73724b9685543e2ee
|
465.7 kB | Preview Download |
Additional details
References
- Matthias Wieler, Tobias Hahn, Fred. A. Hamprecht. (2007) Weakly supervised learning for industrial optical inspection. [Dataset].
- https://hci.iwr.uni-heidelberg.de/content/weakly-supervised-learning-industrial-optical-inspection
- https://conferences.mpi-inf.mpg.de/dagm/2007/prizes.html#industry