Published May 27, 2022 | Version v1
Poster Open

Reproducing Deep Learning experiments: common challenges and recommendations for improvement

  • 1. University of São Paulo, BR
  • 2. FRB-CESAB, Montpellier, FR
  • 3. The University of Queensland, AU
  • 4. ERINHA (European Research Infrastructure on Highly Pathogenic Agents) AISBL, FR
  • 5. Research-Team ICAR, LIRMM, CNRS, Univ. Montpellier, FR
  • 6. IRD
  • 7. American Geophysical Union, USA
  • 8. LIRMM, CNRS
  • 9. Espace-Dev (IRD-UM-UG-UR-UA-UNC), Montpellier, FR
  • 10. MARBEC, University of Montpellier

Description

In computer science, there are more and more efforts to improve reproducibility. However, it is still difficult to reproduce the experiments of other scientists, and even more difficult when it comes to Deep Learning (DL). Making a DL research experiment reproducible requires a lot of work to document, verify, and make the system usable. These challenges are increased by the inherent complexity of DL, such as the number of (hyper)parameters, the huge amount of data, the versioning of the learning model, among others. Based on the reproduction of three DL case studies on real-world tasks, such as poverty estimation from remote sensing imagery, we identified common problems in the reproduction. Therefore, we proposed a set of recommendations ('fixes') to overcome these issues that a researcher may encounter in order to improve reproducibility and replicability and reduce the likelihood of wasted effort. These strategies can be used as "swiss army knife" to move from DL to more general areas as they are organized as (i) the quality of the dataset (and associated metadata), (ii) the Deep Learning method, (iii) the implementation, and the infrastructure used.

Poster to be presented during RDA 19th Plenary Meeting, Part Of International Data Week, 20–23 June 2022, Seoul, South Korea

Notes

Acknowledgments: The PARSEC project is funded by the Belmont Forum, Collaborative Research Action on Science-Driven e-Infrastructures Innovation. J.M. is grateful for the support from FAPESP (grant 2020/03514–9).

Files

Reproducing Deep Learning experiments common challenges and recommendations for improvement.pdf