Published December 4, 2024 | Version 1.0
Report Open

Renewable Energy Communities Digital Twin (RECDT)

Authors/Creators

  • 1. fabbricadigitale S.r.l.

Description

This project was conceived after the project “Planning Renewable Energy Communities 
(REC): the perspective of Municipalities and Small and Medium Enterprises 
(SME)” (PLANREC), that was as well part of the ERIGrid 2.0 Lab Access programme. In this 
previous project we developed a tool for the forecast of energy performance and metrics of 
Renewable Energy Communities.  
The goal of this new project is to investigate whether it is possible to obtain similar results by 
relying exclusively on socio-economic data of the REC, excluding historical consumption data 
of individual households from the necessary prediction data. In most proposed approaches, the 
short or long-term history of energy consumption is used to predict future energy load de
mands, but the historical load data needed for this kind of model is not always available. With 
our work we try to overcome the possible lack of this data and enable accurate REC energy 
forecasts even when the historical data are not available. 
Even though the proposed model does not require historical load data to make its predictions, 
it still needs historical load data for the training phase.  We searched for public sources that 
included both electricity load and socioeconomic data for the training and testing of the model. 
We found many datasets, but the most complete for our purposes was the IDEAL dataset 
(IDEAL). We created a database in pgAdmin by reworking the IDEAL dataset for our purposes, 
in which we included the weather data and the location of the households. 
It is important to notice that, since we do not have any historical data, our task is a regression 
problem, not a time-series forecasting problem. We tested some regression models: - 
RandomForestRegressor; - - 
ExtraTreesRegressor; 
XGBoost Regressor. 
From the grid searches that we conducted on these models, we observed that the best per
forming model was the ExtraTreesRegressor. An important thing to remember is that even 
though in the best case the ExtraTrees score was slightly better than the RandomForest and 
XGBoost best score, the results of the three models were very similar.  
What really made the difference in the model performance was the introduction of synthetic 
feature related to the load of the households: - 
Mean_daily_min_load: average of the daily minimum consumption for each household; - - 
Change_point_down: indicates whether the household is in stand-by (only basic con
sumption) or if there are other consumption that can be attributed to the presence of 
people; 
Change_point_up: indicates the presence of consumption peak. 
We validated the performance of the model with R-squared score and the final result was a 
score of 0.82. We also tested the model with 24-hour forecasts for some specific households, 
achieving a prediction of the consumption patterns with an acceptable degree of accuracy.

Files

ERIGrid2-Report-RECDT.pdf

Files (898.6 kB)

Name Size Download all
md5:45eefdf216610e85580c40edafca2b3f
898.6 kB Preview Download

Additional details

Funding

European Commission
ERIGrid 2.0 - European Research Infrastructure supporting Smart Grid and Smart Energy Systems Research, Technology Development, Validation and Roll Out – Second Edition 870620