Published September 19, 2020 | Version v1
Dataset Open

Precipitation and Temperature Data for the Sydney Catchment Area, Australia

  • 1. University of Technology Sydney

Description

This dataset contains time series for monthly precipitation over six sites (Blackheath, Braidwood, Darkes Forest, Goulburn, Lithgow and Moss Vale) in the Sydney Catchment Area (SCA) and monthly mean maximum and mean minimum temperature for three sites (Goulburn, Lithgow, and Moss Vale) in the SCA. This data was used in the study Attribution and Prediction of Precipitation and Temperature Trends within the Sydney Catchment Using Machine Learning. The data was originally from the Australian Bureau of Meteorology Climate Data Online (http://www.bom.gov.au/climate/data/index.shtml), but has been updated to have missing values (8% of data) filled using a moving average centred on the year for which the data is missing. 

Below is the abstract for the paper:

 

Droughts in southeastern Australia can profoundly affect the water supply to Sydney, Australia's largest city. Increasing population, a warming climate, land surface changes, and expanded agricultural use increase water demand and reduce catchment runoff. Studying Sydney's water supply is necessary to manage water resources and lower the risk of severe water shortages. This study aims at understanding Sydney water supply by analysing precipitation and temperature trends across the catchment. A decreasing trend in annual precipitation was found across the Sydney catchment area. Annual precipitation also is significantly less variable, due to fewer years above the 80th percentile. These trends result from significant reductions in precipitation during spring and autumn, especially over the last 20 years. Wavelet analysis is applied to assess how the influence of climate drivers has changed over time. Attribute selection was carried out using linear regression and machine learning techniques including random forests and support vector regression. Drivers of annual precipitation included Niño3.4, SAM, DMI and measures of global warming such as the Tasman Sea Sea Surface temperature anomalies. The support vector regression model with a polynomial kernel achieved correlations of 0.921 and a skill score compared to climatology of 0.721. The linear regression model also performed well with a correlation of 0.815 and skill score of 0.567, highlighting the importance of considering both linear and non-linear methods when developing statistical models. Models were also developed on autumn and winter precipitation but performed worse than annual precipitation on prediction. For example, the best performing model on autumn precipitation, which accounts for approximately one quarter of annual precipitation, achieved an RMSE of 418.036 mm2 on the testing data while annual precipitation achieved an RMSE of 613.704 mm2. However, the seasonal models provided valuable insight into whether the season would be wet or dry compared to the climatology.

Files

Blackheath.txt

Files (51.7 kB)

Name Size Download all
md5:dafd3ded06e1dff0b9c7aa3c899c0c9f
4.6 kB Preview Download
md5:027a6a75f67f0b6d73fc55158f604599
4.4 kB Preview Download
md5:6494cee488ad9a3d3653d19bc7c0237f
4.7 kB Preview Download
md5:24ebd1d1e436c08d6ed8f7551895d642
4.4 kB Preview Download
md5:923672220d235192b9986ab6e8521e53
4.4 kB Preview Download
md5:57d8fd7ab75a812f06219f61d8273723
3.9 kB Preview Download
md5:261d408156f7a70582eeb29971ff4fb1
4.5 kB Preview Download
md5:6105f6abc7abd3c74023fcb99df68613
4.4 kB Preview Download
md5:7948b497089d300d66e9ee41fb179575
3.8 kB Preview Download
md5:3e25f94ca1cbc4a349e07f23243789db
4.5 kB Preview Download
md5:23be309b89f2b6580e11fb9582318664
4.4 kB Preview Download
md5:c5422241118405de5143737ec91f1ea6
3.8 kB Preview Download