There is a newer version of the record available.

Published January 20, 2023 | Version v1
Dataset Open

A dataset from a survey investigating disciplinary differences in data citation

  • 1. Université de Montréal
  • 2. University of Ottawa
  • 3. ZBW Leibniz Information Center for Economics

Description

GENERAL INFORMATION

Title of Dataset:  A dataset from a survey investigating disciplinary differences in data citation

Date of data collection: January to March 2022

Collection instrument: SurveyMonkey

Funding: Alfred P. Sloan Foundation


SHARING/ACCESS INFORMATION

Licenses/restrictions placed on the data:  These data are available under a CC BY 4.0 license 

Links to publications that cite or use the data: 

Gregory, K., Ninkov, A., Ripp, C., Peters, I., & Haustein, S. (2022). Surveying practices of data citation and reuse across disciplines. Proceedings of the 26th International Conference on Science and Technology Indicators. International Conference on Science and Technology Indicators, Granada, Spain. https://doi.org/10.5281/ZENODO.6951437

Gregory, K., Ninkov, A., Ripp, C., Roblin, E., Peters, I., & Haustein, S. (2023). Tracing data:
A survey investigating disciplinary differences in data citation.
Zenodo. https://doi.org/10.5281/zenodo.7555266


DATA & FILE OVERVIEW

File List

  • Filename: MDCDatacitationReuse2021Codebook.pdf
    Codebook
  • Filename: MDCDataCitationReuse2021surveydata.csv
    Dataset format in csv
  • Filename: MDCDataCitationReuse2021surveydata.sav
    Dataset format in SPSS
  • Filename: MDCDataCitationReuseSurvey2021QNR.pdf
    Questionnaire

Additional related data collected that was not included in the current data package: Open ended questions asked to respondents


METHODOLOGICAL INFORMATION

Description of methods used for collection/generation of data: 

The development of the questionnaire (Gregory et al., 2022) was centered around the creation of two main branches of questions for the primary groups of interest in our study: researchers that reuse data (33 questions in total) and researchers that do not reuse data (16 questions in total). The population of interest for this survey consists of researchers from all disciplines and countries, sampled from the corresponding authors of papers indexed in the Web of Science (WoS) between 2016 and 2020. 

Received 3,632 responses, 2,509 of which were completed, representing a completion rate of 68.6%. Incomplete responses were excluded from the dataset. The final total contains 2,492 complete responses and an uncorrected response rate of 1.57%. Controlling for invalid emails, bounced emails and opt-outs (n=5,201) produced a response rate of 1.62%, similar to surveys using comparable recruitment methods (Gregory et al., 2020).

Methods for processing the data: 

Results were downloaded from SurveyMonkey in CSV format and were prepared for analysis using Excel and SPSS by recoding ordinal and multiple choice questions and by removing missing values.

Instrument- or software-specific information needed to interpret the data: 

The dataset is provided in SPSS format, which requires IBM SPSS Statistics. The dataset is also available in a coded format in CSV. The Codebook is required to interpret to values.


DATA-SPECIFIC INFORMATION FOR: MDCDataCitationReuse2021surveydata

Number of variables: 94

Number of cases/rows: 2,492

Missing data codes: 999        Not asked

Refer to MDCDatacitationReuse2021Codebook.pdf for detailed variable information.

Files

MDCDatacitationReuse2021Codebook.pdf

Files (1.9 MB)

Name Size Download all
md5:2e3ec05a44de16b5cd3beeedc6e0308f
522.7 kB Preview Download
md5:a238a3b033fab24304e99ed8b7cb5582
584.6 kB Preview Download
md5:cdce8ba348ee58b6dbffd8aee739eaf7
621.3 kB Download
md5:9e0ffed90b8a2198bceed7c56aa45a4e
167.6 kB Preview Download
md5:2f8a0ddafa80917133f6c4ebaae35075
4.9 kB Preview Download

Additional details

Related works

Is supplement to
Preprint: 10.5281/zenodo.7555266 (DOI)
Conference paper: 10.5281/ZENODO.6951437 (DOI)