This volunteers_trainedObservers_sequenceClassification readme.txt file was generated on 2021-03-17 by Lucie Thel



GENERAL INFORMATION



1. Title of Dataset: volunteers_trainedObservers_sequenceClassification


2. Author Information

	A. Principal Investigator Contact Information

		Name: Lucie Thel

		Institution: Laboratoire de Biometrie et Biologie Evolutive, UMR CNRS 5558

		Address: UCB Lyon 1 - Bat. Gregor Mendel, 43 bd du 11 novembre 1918, 69622 VILLEURBANNE cedex, France

		Email: lucie.mc.thel@gmail.com


	B. Associate or Co-investigator Contact Information

		Name: Sarah Huebner

		Institution: College of Biological Sciences, University of Minnesota

		Address: Saint Paul, MN 55108, U.S.A.

		Email: huebn090@umn.edu


3. Date of data collection: 2010-07-25 - 2013-04-07


4. Geographic location of data collection: Serengeti National Park, Tanzania


5. Information about funding sources that supported the collection of the data :
This work was supported by NSF grant DEB-1020479 to CP for maintenance of the long-term Lion Project, the University of Minnesota Supercomputing Institute, private donations raised during the Serengeti Live and Save Snapshot Serengeti crowd-funding campaigns, and by grants to AS from Explorer’s Club, UMN Thesis Research Grants, UMN Office of International Programs, American Society of Mammalogists, and Minnesota Zoo Ulysses S. Seal Conservation fund. Snapshot Serengeti website development was funded by awards to the Zooniverse from the Alfred P. Sloan Foundation. 





SHARING/ACCESS INFORMATION



1. Licenses/restrictions placed on the data: Creative Commons Attribution 4.0 International License



2. Links to publications that cite or use the data: https://www.nature.com/articles/sdata201526


3. Links to other publicly accessible locations of the data: None


4. Links/relationships to ancillary data sets: http://dx.doi.org/10.5061/dryad.5pt92


5. Was data derived from another source? yes

	A. If yes, list source(s): http://dx.doi.org/10.5061/dryad.5pt92 (Swanson, A. et al. Dryad http://dx.doi.org/10.5061/dryad.5pt92 (2015))


6. Recommended citation for this dataset: 





DATA & FILE OVERVIEW



1. File List: volunteers_trainedObservers_sequenceClassification
This dataset contains the classifications made by the volunteers of Snapshot Serengeti program and three trained observers of sequences of 1 to 3 pictures, taken between 2010 and 2013 in Serengeti National Park (Tanzania). It contains information about the species present in the pictures (topi, kongoni or Grant’s gazelle), and the age class of the individuals of the pictures (juveniles of less than 1 month, between 1 and 6 months, between 6 and 12 months, of more than 12 months; adults; individuals of undetermined age). This dataset is used in the analyses of the paper “Can citizen science analysis of camera trap data be used to study reproduction? Lessons from Snapshot Serengeti program” published in Wildlife Biology (2021).


2. Relationship between files, if important: none


3. Additional related data collected that was not included in the current data package: none


4. Are there multiple versions of the dataset? no





METHODOLOGICAL INFORMATION



1. Description of methods used for collection/generation of data: 
See details in: Swanson A., Kosmala M., Lintott C., Simpson R., Smith A. & Packer C. (2015). Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna. Scientific Data, Nature Publishing Group


2. Methods for processing the data: 
Each sequence of pictures was first classified by citizen scientists on the Zooniverse platform: the species and the presence or absence of young was determined. Then, pictures were classified again by three trained observers (on the basis of morphological criteria to identify individual age class): the species was reassessed, the number of individuals of 6 different age classes (juveniles of less than 1 month, between 1 and 6 months, between 6 and 12 months, of more than 12 months; adults; individuals of undetermined age) was determined.
See details in: Thel L., Chamaille-Jammes S., Keurinck L., Catala M., Packer C., Huebner S., Bonenfant C. (2021). Can citizen science analysis of camera trap data be used to study reproduction? Lessons from Snapshot Serengeti program. Wildlife Biology


3. Instrument- or software-specific information needed to interpret the data: none


4. Standards and calibration information, if appropriate: none


5. Environmental/experimental conditions: 
See details in: Swanson A., Kosmala M., Lintott C., Simpson R., Smith A. & Packer C. (2015). Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna. Scientific Data, Nature Publishing Group
And in: Thel L., Chamaille-Jammes S., Keurinck L., Catala M., Packer C., Huebner S., Bonenfant C. (2021). Can citizen science analysis of camera trap data be used to study reproduction? Lessons from Snapshot Serengeti program. Wildlife Biology


6. Describe any quality-assurance procedures performed on the data: 
See details in: Swanson A., Kosmala M., Lintott C., Simpson R., Smith A. & Packer C. (2015). Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna. Scientific Data, Nature Publishing Group
And in: Thel L., Chamaille-Jammes S., Keurinck L., Catala M., Packer C., Huebner S., Bonenfant C. (2021). Can citizen science analysis of camera trap data be used to study reproduction? Lessons from Snapshot Serengeti program. Wildlife Biology


7. People involved with sample collection, processing, analysis and/or submission: Thel Lucie, Chamaille-Jammes Simon, Keurinck Lea, Catala Maxime, Packer Craig, Huebner Sarah, Bonenfant Christophe





DATA-SPECIFIC INFORMATION FOR: volunteers_trainedObservers_sequenceClassification



1. Number of variables: 17


2. Number of cases/rows: 7078


3. Variable List: 
capture_event_id: A unique identifier for each capture event and resultant image set
image_names: One to three URL suffix(es) to be appended to ‘https://snapshotserengeti.s3.msi.umn.edu/’ to yield the full URL of the image(s) associated to the capture event id
date_time: The date-time stamp is reported in yyyy-mm-dd hh:mm:ss. Time zone is UTC + 3:00. Note that daylight savings time is not observed in Tanzania
site_id: The alpha-numeric site code
loc_x: UTM X coordinates of the site (datum Arc1960, zone 36S)
loc_y: UTM Y coordinates of the site (datum Arc1960, zone 36S)
species: Species of animal in the capture event assessed by the trained observers
user_name: Identity of the trained observer
inf_one: Number of individuals less than one month old present in the capture event
one_to_six: Number of individuals one to six months old present in the capture event
six_to_twelve: Number of individuals six to twelve months old present in the capture event
sup_twelve: Number of individuals more than twelve months old present in the capture event
adult: Number of adult individuals present in the capture event
unknown: Number of individuals of unknown age present in the capture event
proportion_young_volunteers: The proportion of volunteers who selected ‘young present’ for this capture event for this species
nb_indiv_trainedobservers: Number of individuals present in the capture event for this species, estimated by the trained observer. Note this value can vary according to the trained observer
nb_indiv_volunteers: Median number of individuals, estimated by the volunteers as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11–50 or 51+


4. Missing data codes: none


5. Specialized formats or other abbreviations used: none


