Published November 16, 2021 | Version Version presented at the Computational Intelligence Methods for Bioinformatics and Biostatistics 2021 (CIBB2021)
Conference paper Open

The need of standardised metadata to encode causal relationships: Towards safer data-driven machine learning biological solutions

Description

In this paper, we discuss the importance of considering causal relations in the development of machine learning solutions to prevent factors hampering the robustness and generalisation capacity of the models, such as induced biases. This issue often arises when the algorithm decision is affected by confounding factors. In this work, we argue that the integration of causal relationships can identify potential confounders. We call for standardised meta-information practices as a crucial step for proper machine learning solutions development, validation, and data sharing. Such practices include detailing the dataset generation process, aiming for automatic integration of causal relationships. 

 

Files

Authors__guidelines_for_CIBB_short_papers (1).pdf

Files (360.3 kB)

Name Size Download all
md5:e79ab151c2eae88041ddf91b58ca231d
360.3 kB Preview Download