Published October 25, 2023 | Version 1.0.1
Dataset Open

CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

Description

The CloudTracks dataset consists of 1,780 MODIS satellite images hand-labeled for the presence of more than 12,000 ship tracks. More information about how the dataset was constructed may be found at github.com/stanfordmlgroup/CloudTracks. The file structure of the dataset is as follows:

CloudTracks/
    full/
       images/
          (sample image name) mod2002121.1920D.png
       jsons/
          (sample json name) mod2002121.1920D.json

The naming convention is as follows:
mod2002121.1920D: the first 3 letters specify which of the sensors on the two MODIS satellites captured the image, mod for Terra and myd for Aqua. This is followed by a 4 digit year (2002) and a 3 digit day of the year (121). The following 4 digits specify the time of day (1920; 24 hour format in the UTC timezone), followed by D or N for Day or Night.

The 1,780 MODIS Terra and Aqua images were collected between 2002 and 2021 inclusive over various stratocumulus cloud regions (such as the East Pacific and East Atlantic) where ship tracks have commonly been observed. Each image has dimension 1354 x 2030 and a spatial resolution of 1km. Of the 36 bands collected by the instruments, we selected channels 1, 20, and 32 to capture useful physical properties of cloud formations.

The labels are found in the corresponding JSON files for each image. The following keys in the json are particularly important:

imagePath: the filename of the image.
shapes: the list of annotations corresponding to the image, where each element of the list is a dictionary corresponding to a single instance annotation. The dictionary has a key with value "shiptrack" or "uncertain" which is the label of the annotation and the corresponding value is a linestrip detailing the ship track path.

Further pre-processing details may be found at the GitHub link above. If you have any questions about the dataset, contact us at:
mahmedch@stanford.edulynakim@stanford.edujirvin16@cs.stanford.edu

Files

CloudTracks.zip

Files (20.7 GB)

Name Size Download all
md5:0e862bf2ac903a5a42bbaf26c7413263
20.7 GB Preview Download