Project title: 
Code accompanying the manuscript "Quality Assurance framework for the design, collection and curation of Standard Data Products from a distributed ecology network"

Project overview:
Files are provided here to generate a Standard Data Product from a raw data file for the Plantpopnet project. Data, code and outputs are detailed below. 

DATA
The code is used to generate a Standard Data Product from a raw data file.
1. The demographic raw data file is: 
fullPPN_datasetY0_2020-11-18_.csv
2. The coordinates for the sites are contained in: 
Coordinates_Oct2020_site_level.csv
3. The Standard Data Product output from this code is: 
PLANTPOPNET_Y0_V1.3_2020-11-18_.csv

DATA CLEANING CODE
An R markdown file contains the primary annotated data cleaning code for production of the SDP:
PPN_Compiled_Cleaner_anon.Rmd
An html knitted version of the R markdown file is: 
PPN_Compiled_Cleaner_2025-08-15.html
Code in the R markdown file calls various source codes (.R files) to complete specific tasks:
Global1_remove_na_rows_func.R
site_data_cleaner.R
var1_site_code.R
var2-5_coordinate_locations.R
var6_7_transect_plot_numbers.R
var8_number_seedling.R
var9_PlantID.R
var10_11_xy_coords.R
var12_suspected_clone_binary.R
var13_survival.R
var14_no_rosettes.R
var15_rosette_id.R
var16-21_outlier_detection.R
var22_InflorPhenology.R
var23_disease_yes_no.R
var25_herbivory_yes_no.R
var28_29_c_year_and_s_year.R

OUTPUTS
The primary output is the Year 0 Standard Data Product: 
PLANTPOPNET_Y0_V1.3_2020-11-18_.csv
Other data files are produced by running the data cleaning scripts and are used to check aspects of the data cleaning process:
mydata_ros_check.csv 
mydata_surv_check.csv
