This README.txt file was generated on 2023-01-20 by C. Ripp Updated 2023-04-21 -------------------- CHANGE LOG -------------------- Amended codebook: Relabelled variable 13. Added variable Weight2. Version 2 of csv and .sav datasets with weight2. -------------------- GENERAL INFORMATION -------------------- 1. Title of Dataset: A dataset from a survey investigating disciplinary differences in data citation 2. Author Information A. Researcher Contact Information Name: Dr. Anton Boudreau Ninkov Institution: Université de Montréal Email: Université de Montréal B. Data Curation Contact Information Name: Chantal Ripp Institution: University of Ottawa Email: chantal.ripp@uottawa.ca C. Research Contact Information Name: Dr. Kathleen Gregory Institution: University of Vienna Email: kathleen.gregory@univie.ac.at D. Researcher Contact Information Name: Dr. Isabella Peters Institution: Kiel University Email: ipe@informatik.uni-kiel.de E. Principal Investigator Contact Information Name: Dr. Stefanie Haustein Institution: University of Ottawa Email: stefanie.haustein@uottawa.ca 3. Date of data collection: January to March 2022 4. Collection instrument: SurveyMonkey 5. Information about funding sources that supported the collection of the data: Alfred P. Sloan Foundation 6. How to cite: Ninkov, A., Ripp, C., Gregory, K., Peters, I., & Haustein, S. (2023). A dataset from a survey investigating disciplinary differences in data citation (Version v2) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.7853477 --------------------------- SHARING/ACCESS INFORMATION --------------------------- 1. Licenses/restrictions placed on the data: These data are available under a CC BY 4.0 license 2. Links to publications that cite or use the data: Gregory, K., Ninkov, A., Ripp, C., Peters, I., & Haustein, S. (2022). Surveying practices of data citation and reuse across disciplines. Proceedings of the 26th International Conference on Science and Technology Indicators. International Conference on Science and Technology Indicators, Granada, Spain. https://doi.org/10.5281/ZENODO.6951437 Gregory, K., Ninkov, A., Ripp, C., Roblin, E., Peters, I., & Haustein, S. (2023). Tracing data: A survey investigating disciplinary differences in data citation. Zenodo. https://doi.org/10.5281/zenodo.7555266 --------------------- DATA & FILE OVERVIEW --------------------- VERSION 2 DOI: 1. File List A. Filename: MDCDatacitationReuse2021Codebookv2.pdf Short description: Codebook B. Filename: MDCDataCitationReuse2021surveydatav2.csv Short description: Dataset format in csv C. Filename: MDCDataCitationReuse2021surveydatav2.sav Short description: Dataset format in SPSS VERSION 1 DOI: https://doi.org/10.5281/zenodo.7555363 1. File List A. Filename: MDCDatacitationReuse2021Codebook.pdf Short description: Codebook B. Filename: MDCDataCitationReuse2021surveydata.csv Short description: Dataset format in csv C. Filename: MDCDataCitationReuse2021surveydata.sav Short description: Dataset format in SPSS D. Filename: MDCDataCitationReuseSurvey2021QNR.pdf Short description: Questionnaire 2. Additional related data collected that was not included in the current data package: Open ended questions asked to respondents --------------------------- METHODOLOGICAL INFORMATION --------------------------- 1. Description of methods used for collection/generation of data: The development of the questionnaire (Gregory et al., 2022) was centered around the creation of two main branches of questions for the primary groups of interest in our study: researchers that reuse data (33 questions in total) and researchers that do not reuse data (16 questions in total). The population of interest for this survey consists of researchers from all disciplines and countries, sampled from the corresponding authors of papers indexed in the Web of Science (WoS) between 2016 and 2020. Received 3,632 responses, 2,509 of which were completed, representing a completion rate of 68.6%. Incomplete responses were excluded from the dataset. The final total contains 2,492 complete responses and an uncorrected response rate of 1.57%. Controlling for invalid emails, bounced emails and opt-outs (n=5,201) produced a response rate of 1.62%, similar to surveys using comparable recruitment methods (Gregory et al., 2020). 2. Methods for processing the data: Results were downloaded from SurveyMonkey in CSV format and were prepared for analysis using Excel and SPSS by recoding ordinal and multiple choice questions and by removing missing values. 3. Instrument- or software-specific information needed to interpret the data: The dataset is provided in SPSS format, which requires IBM SPSS Statistics. The dataset is also available in a coded format in CSV. The Codebook is required to interpret to values. ----------------------------------------------------------------- DATA-SPECIFIC INFORMATION FOR: MDCDataCitationReuse2021surveydata ----------------------------------------------------------------- 1. Number of variables: 94 2. Number of cases/rows: 2492 3. Missing data codes: 999 Not asked Refer to MDCDatacitationReuse2021Codebook.pdf for detailed variable information.