Published August 25, 2025 | Version v2
Dataset Open

Data for assigning a proxy variable for office worker in open-ended responses on occupation

Contributors

Contact person:

Project leader:

Work package leader:

  • 1. Mälardalen University (MDU)
  • 2. Karolinska Institutet
  • 3. ROR icon Linköping University

Description

These three datasets contain data and R code to assign a proxy variable for office worker, based on responses to an open-ended question (OEQ) about occupation in Swedish surveys. The R code and proxy variable can be applied to any dataset with Swedish OEQ about occupation; the R code is also adaptable for OEQ in any language, provided there is a standard classification of occupations in that language.

The R code can be found in the dataset Assigning_office_worker_proxy.R, and the proxy variable in the dataset SSYK12_modified.xlsx (and SSYK12_modified.csv).

The dataset Occupation_response.xlsx (and Occupation_response.csv) gives an example of what can be extracted from a Swedish questionnaire with an OEQ about occupation. The dataset can be replaced with optional data as long as it includes two variables named “ID” and “Occupation_swe” (i.e., occupation title given by respondent).

Files

Occupation_response.csv

Files (571.6 kB)

Name Size Download all
md5:5fab26ece1155f104f47baffc4ba856f
3.3 kB Download
md5:be4be96e598c009d16106a4a0b7c02b3
25.7 kB Preview Download
md5:b879b0ade32964e30159fe9c5f5c4a26
23.1 kB Download
md5:b9d8b735330e661618abcfcda461fe46
276.3 kB Preview Download
md5:4348bb4c67eedf3f4e4cc7cd9c81d4de
243.1 kB Download

Additional details

Funding

Knowledge Foundation

Dates

Available
2025-02-19
Published for access
Available
2025-08-25
Updated with .csv files

Software

Repository URL
https://github.com/annti71/SOFCO
Programming language
R