kendavidn/yaounde_serocovpop_shared: Initial release
Creators
- 1. University of Geneva
Description
Release of data and code to reproduce all analyses in the paper "SARS-CoV-2 antibody seroprevalence and associated risk factors in an urban district of Cameroon".
Under the Creative Commons 4.0 attribution license, users of this data are required to cite the associated publication:
Nwosu, K., Fokam, J., Wanda, F. et al. SARS-CoV-2 antibody seroprevalence and associated risk factors in an urban district in Cameroon. Nature Communications 12, 5851 (2021). https://doi.org/10.1038/s41467-021-25946-0
Repository overview
This repository provides all the analysis code for the paper **"SARS-CoV-2 antibody seroprevalence and associated risk factors in an urban district in Cameroon"**.
All tabular, graphical and other analytic outputs in the paper can be reproduced by knitting the `figures_for_paper.Rmd` file in the home directory.
This Rmd file will source analytics scripts in the /scripts folder.
All data referenced in the paper is provided in the /data folder.
All maps used during the study are in the /maps folder.
The study flowchart is in the /plots folder.
The study questionnaire (*study_questionnaire.pdf*) is included in the home directory.
Data description
A minimal, anonymized version of the dataset is provided in this repository.
This is the *yaounde_covid_seroprev_dataset.csv file* in the \\*data* folder.
Each column in the CSV corresponds to a question in the study questionnaire.
The original dataset (exported from KoboCollect) was modified in the following ways:
1\.
Most variables from questionnaire sections 10 to 14, which were not addressed in the seroprevalence paper, are not included.
2\.
The address variable, \`loc_hhld_addr\`, was removed, and replaced with the statement "data removed to protect privacy"
3\.
The date of birth variable, \`dt_birth\` was similarly removed, although the age at survey (in years) remains available.
4\.
Dates of deaths of household members were truncated to the first day of the month of death.
(e.g. 2020-06-30 to 2020-06-01)
5\.
GPS coordinates were de-localized by adding ≤ 0.001 degrees (with the R code \`sample(seq(-0.001, 0.001, by = 0.0001))\`) of random noise to the latitude and longitude, and ≤ 50 metres ( with the R code \`sample( seq(-25, 25, by = 1))\`) of random noise to the altitudes.
Variable names
For naming variables, we use Emily Riederer's controlled vocabulary recommendations.
See here: [<https://emilyriederer.netlify.app/post/column-name-contracts/>]([https://emilyriederer.netlify.app/post/column-name-contracts/){.uri}.](https://emilyriederer.netlify.app/post/column-name-contracts/){.uri}.)
The data dictionary (*data/data_dictionary.xlsx*) contains a spreadsheet which shows which variable name corresponds to which survey question.
To understand what each variable represents, the user is also encouraged to consult the original questionnaire, which is also present in this repository (*study_questionnaire.pdf*).
Package versions and reproducibility
Package versions for this repository are controlled with the `renv` package.
After cloning the repository, the user should run the function `renv::restore()` to install the package versions saves in the *renv.lock* file.
This will ensure that the repository code is fully reproducible and should run without bugs.
Files
kendavidn/yaounde_serocovpop_shared-v1.0.0.zip
Files
(32.1 MB)
Name | Size | Download all |
---|---|---|
md5:4621569df0b3c843485a0955760ed692
|
32.1 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/kendavidn/yaounde_serocovpop_shared/tree/v1.0.0 (URL)