Published February 6, 2025 | Version v1.0
Dataset Open

GM-SEUS: A harmonized dataset of ground-mounted solar energy in the US with enhanced metadata

Description

Ground-Mounted Solar Energy in the United States (GM-SEUS)

Abstract

Solar energy generating systems are critical components of our expanding energy infrastructure, yet available datasets remain incomplete or not publicly available–particularly at the sub-array level. Combining the best open-access datasets in the US with image analysis on freely available remotely-sensed imagery, we present the Ground-Mounted Solar Energy in the United States (GM-SEUS) dataset, a harmonized, open access geospatial and temporal repository of solar energy arrays and panel-rows. GM-SEUS v1.0 includes over 15,000 commercial- and utility-scale ground-mounted solar photovoltaic and concentrating solar energy arrays (186 GW) covering 2,950 km² and includes 2.92 million unique solar panel-rows (466 km²). We use these newly compiled and delineated solar arrays and panel-rows to harmonize and independently estimate value-added attributes to existing datasets including installation year, azimuth, mount technology, panel-row area and dimensions, inter-row spacing, ground cover ratio, tilt, and installed capacity. By estimating and harmonizing these attributes of the distributed US solar energy landscape, GM-SEUS supports diverse applications in renewable energy modeling, ecosystem service assessment, and infrastructure planning. 

Technical info

This is the data repository for creating and maintaining the Ground-Mounted Solar Energy in the United States (GM-SEUS) spatiotemporal dataset of solar arrays and panel-rows using existing datasets, machine learning, and object-based image analysis to enhance existing sources. Contents of this repository are described here briefly, with the attatched data README providing more detailed descriptions. The source Github Repository for generating this dataset can be found here. The related paper was published in Scientific Data.

This is the initial release of GM-SEUS (version 1.0). All input datasets and solar panel-row delineation results are up-to-date through December 11th, 2024. 

Primary Repository Contents Include: 

GMSEUS_Arrays_Final: Final array dataset containing over 15,000 array boundaries from existing datasets and enhanced by buffer-dissolve-erode technique with GM-SEUS panel-rows containing all array-level attributes (ESRI:102003), geopackage, shapefile, and comma separated values

GMSEUS_Panels_Final: Final panel-row dataset containing 2.92 million boundaries from existing datasets and newly delineated GM-SEUS panel-rows containing all panel-row-level attributes (ESRI:102003), geopackage, shapefile, and comma separated values

GMSEUS_NAIP_Arrays: All array boundaries created by buffer-dissolve-erode method of newly delineated (NAIP) GM-SEUS panel-rows (ESRI:102003), geopackage, shapefile, and comma separated values

GMSEUS_NAIP_Panels: All newly delineated panel-row boundaries (ESRI:102003), geopackage, shapefile, and comma separated values

GMSEUS_NAIP_PanelsNoQAQC: All newly delineated panel-rows from NAIP imagery without any quality control (ESRI:102003), geopackage, shapefile, and comma separated values

NAIPtrainRF: Training dataset of 12,000 NAIP training points (2,000 per class) containing class values, spectral index values, the year of NAIP imagery accessed, and point coordinates (WGS84), comma separated values

NAIPclassifyRF: Random forest classifier trees and weights as output from Google Earth Engine classifier, comma separated values

LabeledImages: Directory containing image and mask subdirectories with ~17,500 input and target images for deep learning pattern recognition applications, GeoTIFF

Disclaimer: 

This dataset provides a broad characterization of solar array design practices. Any characterization of solar array design and management derived from remote sensing imagery should be considered with extreme scrutiny given the limitations of such approaches. While our work fills a critical data gap and compiles and enhances existing high-fidelity datasets, the design practices reported here are thus subject to uncertainty and should not be used to represent actual conditions at individual sites. No warranty is expressed or implied regarding accuracy, completeness or fitness for a specific purpose. We publish this dataset in open access, for the broader science community, policy makers, and stakeholders in addressing questions about the existing renewable energy landscape and do not consent to this data being used to target, identify, or make claims about individual arrays, properties, or entities. Any such use case is strictly prohibited. 

GM-SEUS is released under CC-BY 4.0. However, components derived from third-party datasets retain the original license of those inputs. Some upstream datasets used in boundary generation contain non-commerical (NC) licensing terms. As a result, users intending to reuse GM-SEUS for commercial purposes must ensure compliance with the licensing conditions of those upstream sources. GM-SEUS does not incorporate metadata or attribute information from non-commercial datasets. However, certain geometry or inferred boundaries may constitute derivative works of those sources. To support transparency, GM-SEUS retains the original spatial data source in the Source attribute column, and full upstream licensing information is provided in the accompanying sourceDataLicenses.csv file.

Files

GMSEUS_v1_0.zip

Files (8.6 GB)

Name Size Download all
md5:ad405eda1feac84e92ccf28282bed646
3.5 GB Preview Download
md5:30dc9b1bc0a96f6981a8b60dd4b3e8ff
5.1 GB Preview Download
md5:65e2897e1c569f7c5220099ec9b8ffab
15.8 kB Preview Download

Additional details

Funding

United States Department of Agriculture
INFEWS/T1: Developing Pathways Toward Sustainable Irrigation across the United States Using Process-based Systems Models (SIRUS) 2018-67003-27406
Michigan State University
Climate Change Research Support Program: Building a Foundation for SCALE (Sustainable Communities, Agriculture, Landscapes, and Energy)

Dates

Collected
2024-12-11
All source datasets and imagery are up to date.

Software

Repository URL
https://github.com/stidjaco/GMSEUS
Programming language
Python , JavaScript
Development Status
Active