Published July 27, 2022 | Version 1
Dataset Open

Yield Prediction Through Integration of Genetic, Environment, and Management Data Through Deep Learning: Cleaned Data

Authors/Creators

  • 1. USDA-ARS

Description

The included files and script are to allow for reconstruction of the data directory and cleaned data used in "Yield Prediction Through Integration of Genetic, Environment, and Management Data Through Deep Learning" ( https://doi.org/10.1101/2022.07.29.502051 ). Code used is available at 10.5281/zenodo.7401113 .

Filename Description
interim.tar.gz Contains site grouping dictonary
processed.tar.gz Processed data
raw.tar.gz Input data
SetupInstructions.sh Bash script to prepare folders and unzipped data expected by code in 10.5281/zenodo.7401113
SetupInstructions.txt Instructions for unzipping the data
Train_Test_Split_Reference_Phenotypes.csv Reference spreadsheet to allow for easily exploring training and test set groupings

This work was supported through funding from the USDA Agricultural Research Service, ARS project number 5070-21000-041-000-D. Raw data provided by the [Genomes to Field Initiative](https://www.genomes2fields.org/) and the [Daymet database](https://daymet.ornl.gov/).

Files

SetupInstructions.txt

Files (13.7 GB)

Name Size Download all
md5:d9ef13d83f02b0df5960be76199b98af
511 Bytes Download
md5:25f58023cee6e3c61d4ff0b74707d59e
7.3 GB Download
md5:e90f14fc6be137aedb6b9f50e0ff05b8
6.4 GB Download
md5:2f53d6b6f24b2a322004417dd554ed30
144 Bytes Download
md5:d1812032428de700c18947a52f7b6df9
683 Bytes Preview Download
md5:47c5f407b691d7e1083ccf041dde1c68
12.4 MB Preview Download