Software Open Access
Diers, Erik; Pacharra, Marlene; Merz, Christian J.; Ernst, Thomas M.; Otto, Tobias
Subject Code Generator
Intended use and advice on use
The Subject Generator helps investigators and subjects to generate SFB 1280 compliant subject codes (see SFB 1280 research data management policy). The subject code represents a pseudonymization and not an anonymization.
Please be aware, that any subject code which is based on indirect identification features, e.g. birth name of one's mother, can be decrypted with more or less effort. In accordance with the Ethics Committee of the German Psychological Society (DGPs), the SFB 1280 recommends only the time-limited and purpose-limited use of the generated subject codes. If long-term storage is required, data should be stored under anonymous codes. Under no circumstances should data be made publicly available using the generated subject codes.
The 11-digit SFB 1280 subject code was developed based on the 8-digit subject code originally recommended by the DGPs Ethics Committee (now DGPs "variant 1"), which generated too many duplicate subject codes in the SFB's large data sets. The 11-digit code adheres to design principles of the original code (i.e., letters of the subject's mother's maiden name are used), but results in less duplicates. It is now included as "variant 2" in the recommendations for personal code words by the DGPs Ethics Committee.
In a large test dataset with 5146 artificially created names for parents of only children and a random number for the birthday the “variant 1” code with 8-digits created 2.14% duplicates whereas the “variant 2” code with 11-digits created only 0.071% duplicates.
Instructions for generating the subject code were originally created by the SFB's INF projects and are available also as a paper pencil version. A reference guide with translations in English is also available here.
About this project
This project was developed to enable a faster and more reliable way to generate the code than the paper-pencil version. The official web version of this code can be found on the SFB1280 website and on the RUB GitLab. The Subject Code Generator is available in English and in German.
About the project structure
This project ships many, maybe on first glance, confusing looking files. Here is a quick rundown on what which files do:
|All versions||This version|
|Data volume||2.6 MB||2.6 MB|