*** Dataset_for_paper_Smarter open government data for Society 5.0: are your OGD smart enough?_2021 *** Authors: Anastasija Nikiforova University of Latvia, Faculty of Computing Corresponding author: Anastasija Nikiforova Contact Information: anastasija.nikiforova@lu.lv **General Introduction*** This dataset contains data collected during a study "Smarter open government data for Society 5.0: are your open data smart enough" conducted by Anastasija Nikiforova (University of Latvia). It being made public both to act as supplementary data for "Smarter open government data for Society 5.0: are your open data smart enough" paper and in order for other researchers to use these data in their own work. The data in this dataset were collected in the result of the inspection of 60 countries and their OGD portals (total of 51 OGD portal in May 2021) to find out whether they meet the trends of Society 5.0 and Industry 4.0 obtained by conducting an analysis of relevant OGD portals. ***Purpose of the survey*** The purpose of this survey was to gain an insight into the state of the OGD portal and their data in relation to Society 5.0 and Industry 4.0 trends. ***Test procedure*** Each portal has been studied starting with a search for a data set of interest, i.e. “real-time”, “sensor” and “covid-19”, follwing by asking a list of additional questions. These questions were formulated on the basis of combination of (1) crucial open (government) data-related aspects, including open data principles, success factors, recent studies on the topic, PSI Directive [29] etc., (2) trends and features of Society 5.0 and Industry 4.0, (3) elements of the Technology Acceptance Model (TAM) and the Unified Theory of Acceptance and Use Model (UTAUT) The method used belongs to typical / daily tasks of open data portals sometimes called “usability test” – keywords related to a research question are used to filter data sets, i.e. “real-time”, “real time” and “real time”, “sensor”, covid”, “covid-19”, “corona”, “coronavirus”, “virus”. In most cases, “real-time”, “sensor” and “covid” keywords were sufficient. The examination of the respective aspects for less user-friendly portals was adapted to particular case based on the portal or data set specifics, by checking: 1. are the open data related to the topic under question ({sensor; real-time; Covid-19}) published, i.e. available? 2. are these data available in a machine-readable format? 3. are these data current, i.e. regularly updated? Where the criteria on the currency depends on the nature of data, i.e. Covid-19 data on the number of cases per day is expected to be updated daily, which won’t be sufficient for real-time data as the title supposes etc. 4. is API ensured for these data? having most importance for real-time and sensor data; 5. have they been published in a timely manner? which was verified mainly for Covid-19 related data. The timeliness is assessed by comparing the dates of the first case identified in a given country and the first release of open data on this topic. 6. what is the total number of available data sets? 7. does the open government data portal provides use-cases / showcases? 8. does the open government portal provide an opportunity to gain insight into the popularity of the data, i.e. does the portal provide statistics of this nature, such as the number of views, downloads, reuses, rating etc.? 9. is there an opportunity to provide a feedback, comment, suggestion or complaint? 10. (9a) is the artifact, i.e. feedback, comment, suggestion or complaint, visible to other users? ***Description of the data in this data set: possible answers (type of question, pre-defined options(if any))*** 1. country 2. country in human development index, where very high human development – VHHD, High human development -HHD, Medium human development – MHD, low human development – LHD 3. rank in human development index 4. OGD portal link 5. total number of datasets available on the portal 6. statistics (number of views, downloads, reuses, rating etc.) - Booolean, i.e. 1 - if available, 0 - not available 7. statistics popularity if the portal allows statistics of these nature, i.e number of views, downloads, reuses, rating etc. - type (string) 8. opportunity to provide a feedback, comment, suggestion or complaint - Booolean, i.e. 1 - if available, 0 - not available 9. feedback, comment, suggestion or complaint and their visibility for other users - Booolean, i.e. 1 - if available, 0 - not available 10. showcases - Booolean, i.e. 1 - if available, 0 - not available 11. total number of use-cases on the portal (if any AND if these data are provided) - number 12. sensor data - Booolean, i.e. 1 - if available, 0 - not available 13. (sensor data) total number of data sets on the portal (if any AND if these data are provided) - number 14. (sensor data) up-to-date / updated frequently - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 15. (sensor data) machine-readable format - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 16. (real-time) API available - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 17. (sensor data) total number of use-cases - number 18. real-time - Booolean, i.e. 1 - if available, 0 - not available 19. (real-time) total number of data sets on the portal (if any AND if these data are provided) - number 20. (real-time) up-to-date / updated frequently - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 21. (real-time) machine-readable format - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 22. (real-time) API available - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 23. (real-time) total number of use-cases - number 24. Covid-19 data - Booolean, i.e. 1 - if available, 0 - not available 25. (Covid-19 data) total number of data sets on the portal (if any AND if these data are provided) - number 26. first case of Covid-19 identified in the country - date (retrieved from https://en.wikipedia.org/wiki/COVID-19_pandemic_by_country_and_territory) 27. first mentioning on the OGD portal, i.e. first data set, where the relevant keyword was met - date 28. Covid-19-related data set release - date 29. (Covid-19 data) up-to-date / updated frequently - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 30. (Covid-19 data) machine-readable format - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 31. (Covid-19 data) API available - {-1, 0, 1} - 1 – yes/ 0 – not always/ -1 – no 32. (Covid-19 data) total number of use-cases - number * 0/1 or 1/0 may sometimes appear, which indicate the case, where the result tend to be closer to one of number. For example 0/1 should be interpret, that a vast majority shows results that would be assessed with 0, but there are still some data sets to be assesed with 1 point. Sheets #2-4 provide data on the topic ranked by the results. ***Format of the file*** .xls ***Licenses or restrictions*** CC-BY