Published March 19, 2020 | Version v1
Preprint Open

On the convergence of epidemiology, biostatistics, and data science

  • 1. Drexel University Dornsife School of Public Health

Description

Epidemiology, biostatistics, and data science are broad disciplines that incorporate a variety of substantive areas. Common amongst them is a focus on quantitative approaches for solving intricate problems. When the substantive area is health and healthcare, the overlap is further cemented. Researchers in these disciplines are fluent in statistics, data management and analysis, and health and medicine, to name but a few competencies. Yet there are important and perhaps mutually exclusive attributes of these fields that warrant a tighter integration. For example, epidemiologists receive substantial training in the science of study design, measurement, and the art of causal inference. Biostatisticians are well versed in the theory and application of methodological techniques, as well as the design and conduct of public health research. Data scientists receive equivalently rigorous training in computational and visualization approaches for high dimensional data. Compared to data scientists, epidemiologists and biostatisticians may have less expertise in computer science and informatics, while data scientists may benefit from a working knowledge of study design and causal inference. Collaboration and cross-training offer the opportunity to share and learn of the constructs, frameworks, theories, and methods of these fields with the goal of offering fresh and innovate perspectives for tackling challenging problems in health and healthcare. In this article, we first describe the evolution of these fields focusing on their convergence in the era of electronic health data, notably electronic medical records (EMRs). Next we present how a collaborative team may design, analyze, and implement an EMR-based study. Finally, we review the curricula at leading epidemiology, biostatistics, and data science training programs, identifying gaps and offering suggestions for the fields moving forward.

Files

Files (5.6 MB)

Name Size Download all
md5:f0f6decc7898e739f67464343d849514
5.6 MB Download
md5:61e1e5f2c01f457fff626f17b0854eaf
37.4 kB Download