There is a newer version of this record available.

Dataset Open Access

Sticky Pi -- Machine Learning Data, Configuration and Models

Quentin Geissmann


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Quentin Geissmann</dc:creator>
  <dc:date>2021-04-12</dc:date>
  <dc:description>Dataset for the Machine Learning section of the Sticky Pi project (https://doc.sticky-pi.com/)

Contains the dataset for the three algorithms described in the publication: Universal Insect Detector, Siamese Insect Matcher and Insect Tuboid Classifier.

Universal Insect Detector:

`universal_insect_detector/` contains training/validation data, configuration files to train the model, and the model as trained and used for publication.


	`data/` – A set of svg images that contain the embedded jpg raw image, and a set of non-intersecting polygon around the labelled insects
	`output/`
	
		`model_final.pth` – the model as trained for the publication
	
	
	`config/`
	
		`config.yaml` – The configuration file defining the hyperparameters to train the model as well as the taxonomic labels
		`config.yaml `– The configuration file defining the hyperparameters to train the model
		`mask_rcnn_R_101_C4_3x.yaml` – the base configuration file from which config is derived
	
	


 

Siamese Insect Matcher

`siamese_insect_matcher/` contains training/validation data, configuration files to train the model, and the model as trained and used for publication.


	`data/` – a set of svg images that contain two embedded jpg raw images vertically stacked corresponding to two frames in a series. Each predicted insect is labelled as a polygon. Insects that are labelled as the same instance, between the two frames, are grouped (i.e. SVG group). The filename of each image is `&lt;device&gt;.&lt;datetime_frame_1&gt;.&lt;datetime_frame_2&gt;.svg`
	`output/`
	
		`model_final.pth` – the model as trained for the publication
	
	
	`config/`
	
		`config.yaml` – The configuration file defining the hyperparameters to train the model as well as the taxonomic labels
		`config.yaml` – The configuration file defining the hyperparameters to train the model
	
	


 

Insect Tuboid Classifier:

`insect_tuboid_classifier/` contains images of insect tuboid, a database file describing their taxonomy, a configuration file to train the model, and the model as trained and used for publication.


	`data/`
	
		`database.db`: a sqlite file with a single table `ANNOTATIONS`. The table maps a unique identifier of each tuboid (tuboid_id) to a set of manually annotated taxonomic variables.
		A directory tree of the form: `&lt;series_id&gt;/&lt;tuboid_id&gt;/`. Each terminal directory contains:
		
			
			
				`tuboid.jpg` – a jpeg image made of 224 x 224 tiles representing all the shots in a tuboid, left to right, top to bottom – might be padded with empty images
				`metadata.txt` – a csv text file with columns:
				
					
					
						parrent_image_id – &lt;device&gt;.&lt;UTC_datetime&gt;
						X – the X coordinates of the object centroid
						Y – the Y coordinates of the object centroid
					
					
				
				
				scale – The scaling factor applied between the original and image and the 224 x 224 tile (&gt;1 =&gt; image was enlarged)
				`context.jpg` – a representation of the first whole image of a series, with a box around the first tuboid shot (this is for debugging/labelling purposes)
			
			
		
		
	
	
	`output/`
	
		`model_final.pth` – the model as trained for the publication
	
	
	config/
	
		`config.yaml` – The configuration file defining the hyperparameters to train the model as well as the taxonomic labels
	
	
</dc:description>
  <dc:identifier>https://zenodo.org/record/4680119</dc:identifier>
  <dc:identifier>10.5281/zenodo.4680119</dc:identifier>
  <dc:identifier>oai:zenodo.org:4680119</dc:identifier>
  <dc:language>eng</dc:language>
  <dc:relation>doi:10.5281/zenodo.4680118</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:subject>instect traps</dc:subject>
  <dc:subject>behavioral ecology</dc:subject>
  <dc:title>Sticky Pi -- Machine Learning Data, Configuration and Models</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>dataset</dc:type>
</oai_dc:dc>
210
505
views
downloads
All versions This version
Views 210128
Downloads 505276
Data volume 1.1 TB435.8 GB
Unique views 187118
Unique downloads 16238

Share

Cite as