Published June 8, 2012 | Version v1
Presentation Open

Data coding and harmonization: How DataCoH and Charmstats are transforming social science data

  • 1. GESIS - Leibniz Institute for the Social Sciences

Description

Comparative social researchers are often confronted with the challenge of making key theoretical concepts comparable across nations and/or time. One example is the socio-demographic variable 'Education'. To operationalize 'education' researchers must review multiple educational systems across nations and/or changing educational structures within one nation across time. Further, researchers have multiple ways to recode education into a harmonized variable including (inter alia): the Hoffmeyer-Zlotnik/Warner matrix; the CASMIN education scheme; the International Standard Classification of Education; or a harmonized variable provided by the dataset itself. GESIS is developing two electronic resources to assist social researchers. The website DataCoH (Data Coding and Harmonization) will provide a centralized online library of data coding and harmonization for existing variables to increase transparency and variable replication. DataCoH initially will contain socio-demographic variables used across the social sciences and then expand to discipline-specific variables. The software program Charmstats (Coding and Harmonizing Statistics) will provide a structured approach to data harmonization by allowing researchers to: 1) download harmonization protocols; 2) document variable coding and harmonization processes; 3) access variables from existing datasets for harmonization; and 4) create harmonization protocols for publication and citation. This paper explains DataCoH and Charmstats and demonstrates how they work.

Files

2012_g1_winters_etal.pdf

Files (9.8 MB)

Name Size Download all
md5:0e2025e492fdfc1ab28b5725a78bf91f
9.8 MB Preview Download