10.5281/zenodo.1478972
https://zenodo.org/records/1478972
oai:zenodo.org:1478972
Higgins, P. A.
P. A.
Higgins
Pérez-Suárez, David
David
Pérez-Suárez
0000-0003-0784-6909
University College London
Murray, Sophie A.
Sophie A.
Murray
0000-0002-9378-5315
Trinity College Dublin
Maloney, Shane A.
Shane A.
Maloney
0000-0002-4715-1805
Trinity College Dublin
Gallagher, Peter T.
Peter T.
Gallagher
0000-0001-9745-0400
Dublin Institute for Advanced Studies
Bloomfield, Shaun
Shaun
Bloomfield
0000-0002-4183-9895
Northumbria University
McCloskey, Aoife E.
Aoife E.
McCloskey
0000-0002-4830-9352
Trinity College Dublin
Roche, Joseph
Joseph
Roche
0000-0002-9981-1502
Trinity College Dublin
O'Callaghan, David
David
O'Callaghan
O'Neill, Arlene
Arlene
O'Neill
0000-0002-1745-6739
Trinity College Dublin
Ryan, Dan F.
Dan F.
Ryan
NASA Goddard Space Flight Center
Leka, K.D.
K.D.
Leka
0000-0003-0026-931X
NorthWest Research Associates
Sunspotter - 14 years of SoHO/MDI
Zenodo
2018
active regions
sunspots
solar physics
citizen science
zooniverse
2018-11-07
eng
10.5281/zenodo.1478971
1
Creative Commons Attribution Share Alike 4.0 International
Second dataset based on automated detections produced by SMART [1] with data between May 1996 and August 2010 and classifications by volunteers through the zooniverse's Sunspotter project.
Volunteers had to choose the most complex active region of a pair based on a random selection of the least classified images. Unlike the first round [2], all active regions were scaled to have the same size. Attempting therefore to remove the bias produced by the differences on sizes.
The dataset is composed of five files:
lookup_timesfits.csv: lists the filenames and the date of the data acquisition.
lookup_properties.csv: lists the properties about the active region observed in each frame to be classified. Some of the properties as measured by SMART [1]
classifications.csv: lists each classification made by the volunteers.
rankings.csv: lists the final ranking on complexity.
users_links.csv: cross identifiers of users between dataset 2 (this one) and dataset 1 [2].
The score provided on the rankings file follows the Elo rating system. However, a new score following other selection mechanism is possible using the data available on the classification file.
Though the user's information has been removed, the classifications keep an index to differentiate classifications made by different users.
Some software to ingest the tables into a sqlite database and to obtain some preliminary results are available on GitHub.
[1]: DOI: 10.1016/j.asr.2010.06.024
[2]: DOI: 10.5281/zenodo.1478966