BY-COVID - WP5 - Baseline Use Case: SARS-CoV-2 vaccine effectiveness assessment

Imputation of missing values

Handling missing data


variable nr_not_imp
age_nm 1211
institutionalized_bl 226620
socecon_lvl_cd 167024
Variable Method Nr_missing Pct_missing Number of imputed values Missing_values Comorbidity comorb_incl Immunestatus imm_incl Core MCAR Perc_miss_lt Perc_miss_lt5 Perc_miss_lt15
person_id No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE FALSE FALSE TRUE TRUE TRUE
age_nm Imputation of missing values (not MCAR) 23146 0.007 21935 TRUE FALSE TRUE FALSE TRUE TRUE FALSE TRUE TRUE TRUE
sex_cd No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE TRUE FALSE TRUE TRUE TRUE
socecon_lvl_cd Imputation of missing values (not MCAR) 167376 0.047 352 TRUE FALSE TRUE FALSE TRUE TRUE FALSE FALSE TRUE TRUE
residence_area_cd No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE TRUE FALSE TRUE TRUE TRUE
country_cd No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE FALSE FALSE TRUE TRUE TRUE
foreign_bl No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE TRUE FALSE TRUE TRUE TRUE
exitus_dt Allow missing values and don’t impute missing values (no core variable) 3452476 0.977 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE FALSE
exitus_bl Allow missing values and don’t impute missing values (no core variable) 5788 0.002 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE TRUE TRUE TRUE
essential_worker_bl No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE TRUE FALSE TRUE TRUE TRUE
institutionalized_bl Imputation of missing values (not MCAR, limit causal interpretation) 226620 0.064 0 TRUE FALSE TRUE FALSE TRUE TRUE FALSE FALSE FALSE TRUE
dose_1_brand_cd Allow missing values and don’t impute missing values (no core variable) 437143 0.124 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
dose_1_dt Allow missing values and don’t impute missing values (no core variable) 437143 0.124 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
dose_2_brand_cd Allow missing values and don’t impute missing values (no core variable) 524915 0.149 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
dose_2_dt Allow missing values and don’t impute missing values (no core variable) 524915 0.149 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
dose_3_brand_cd Allow missing values and don’t impute missing values (no core variable) 1349309 0.382 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE FALSE
dose_3_dt Allow missing values and don’t impute missing values (no core variable) 1349309 0.382 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE FALSE
doses_nm No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE FALSE FALSE TRUE TRUE TRUE
fully_vaccinated_dt Allow missing values and don’t impute missing values (no core variable) 474343 0.134 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
fully_vaccinated_bl No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE FALSE FALSE TRUE TRUE TRUE
vaccination_schedule_cd Allow missing values and don’t impute missing values (no core variable) 437143 0.124 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
confirmed_case_dt Allow missing values and don’t impute missing values (no core variable) 2483034 0.703 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE FALSE
confirmed_case_bl No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE FALSE FALSE TRUE TRUE TRUE
previous_infection_dt Allow missing values and don’t impute missing values (no core variable) 3093838 0.875 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE FALSE
previous_infection_bl No missing values 0 0.000 0 FALSE FALSE TRUE FALSE TRUE FALSE FALSE TRUE TRUE TRUE
test_type_cd Allow missing values and don’t impute missing values (no core variable) 2483034 0.703 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE FALSE
variant_cd Allow missing values and don’t impute missing values (no core variable) 3534537 1.000 0 TRUE FALSE TRUE FALSE TRUE FALSE FALSE FALSE FALSE FALSE
diabetes_bl Allow missing values and don’t impute missing values (comorbidity) 226620 0.064 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
obesity_bl Allow missing values and don’t impute missing values (comorbidity) 226620 0.064 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
heart_failure_bl Allow missing values and don’t impute missing values (comorbidity) 226620 0.064 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
copd_bl Allow missing values and don’t impute missing values (comorbidity) 226620 0.064 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
solid_tumor_without_metastasis_bl Allow missing values and don’t impute missing values (comorbidity) 226620 0.064 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
chronic_kidney_disease_bl Allow missing values and don’t impute missing values (comorbidity) 226620 0.064 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
sickle_cell_disease_bl Allow missing values and don’t impute missing values (comorbidity) 3534537 1.000 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE FALSE
hypertension_bl Allow missing values and don’t impute missing values (comorbidity) 226620 0.064 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
chronic_liver_disease_bl Allow missing values and don’t impute missing values (comorbidity) 226620 0.064 0 TRUE TRUE TRUE FALSE TRUE FALSE FALSE FALSE FALSE TRUE
blood_cancer_bl Allow missing values and don’t impute missing values (immune status) 226620 0.064 0 TRUE FALSE TRUE TRUE TRUE FALSE FALSE FALSE FALSE TRUE
transplanted_bl Allow missing values and don’t impute missing values (immune status) 226620 0.064 0 TRUE FALSE TRUE TRUE TRUE FALSE FALSE FALSE FALSE TRUE
hiv_infection_bl Allow missing values and don’t impute missing values (immune status) 226620 0.064 0 TRUE FALSE TRUE TRUE TRUE FALSE FALSE FALSE FALSE TRUE
primary_immunodeficiency_bl Allow missing values and don’t impute missing values (immune status) 3534537 1.000 0 TRUE FALSE TRUE TRUE TRUE FALSE FALSE FALSE FALSE FALSE
immunosuppression_bl Allow missing values and don’t impute missing values (immune status) 226620 0.064 0 TRUE FALSE TRUE TRUE TRUE FALSE FALSE FALSE FALSE TRUE
pregnancy_bl Exclude core variable as matching variable (more than 15% missing values) 3534537 1.000 0 TRUE FALSE TRUE FALSE TRUE TRUE FALSE FALSE FALSE FALSE

Listwise deletion

flag_listwise_del==TRUE flag_listwise_del==FALSE
304375 3230162