Conference paper Open Access

Adoption and Effects of Software Engineering Best Practices in Machine Learning - Supplementary Material

Alex Serban; Koen van der Blom; Holger Hoos; Joost Visser

DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="" xmlns="" xsi:schemaLocation="">
  <identifier identifierType="DOI">10.5281/zenodo.3961914</identifier>
      <creatorName>Alex Serban</creatorName>
      <affiliation>Radboud University</affiliation>
      <creatorName>Koen van der Blom</creatorName>
      <affiliation>Leiden University</affiliation>
      <creatorName>Holger Hoos</creatorName>
      <affiliation>Leiden University</affiliation>
      <creatorName>Joost Visser</creatorName>
      <affiliation>Leiden University</affiliation>
    <title>Adoption and Effects of Software Engineering Best Practices in Machine Learning - Supplementary Material</title>
    <date dateType="Issued">2020-07-15</date>
  <resourceType resourceTypeGeneral="Text">Conference paper</resourceType>
    <alternateIdentifier alternateIdentifierType="url"></alternateIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.3946453</relatedIdentifier>
    <rights rightsURI="">Creative Commons Attribution 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
    <description descriptionType="Abstract">&lt;p&gt;The increasing reliance on applications with machine learning (ML) components calls for mature engineering techniques that ensure these are built in a robust and future-proof manner.&amp;nbsp;We aim to empirically determine the state of the art in how teams develop, deploy and maintain software with ML&amp;nbsp;components.&amp;nbsp;We mined both academic and grey literature and identified 29 engineering best practices for ML&amp;nbsp;applications.&amp;nbsp;We conducted a survey among 313&amp;nbsp;practitioners to determine the degree of adoption for these practices and to validate their perceived effects.&amp;nbsp;Using the survey responses, we quantified practice adoption, differentiated along demographic characteristics, such as geography or team size.&amp;nbsp;Our findings indicate, for example, that larger teams tend to adopt more practices, and that traditional software engineering practices tend to have lower adoption than ML&amp;nbsp;specific practices.&amp;nbsp;Also, the statistical models can accurately predict perceived effects such as agility, software quality and traceability, from the degree of adoption for specific sets of practices.&amp;nbsp;Combining practice adoption rates with practice importance, as revealed by statistical models, we identify practices that are important but have low adoption and practices that are widely adopted but are less important for the effects we studied.&amp;nbsp;Overall, our survey and response analysis provide a quantitative basis for assessment and step-wise improvement of practice adoption by ML&amp;nbsp;teams.&lt;/p&gt;


&lt;p&gt;The supplementary material consist of the questionnaire (Questionnaire.pdf), the data (csvs/exports-clean.csv) and the code used to process it. The file contains all the information&amp;nbsp;for&amp;nbsp;running the scripts.&lt;/p&gt;</description>
All versions This version
Views 18979
Downloads 97
Data volume 4.0 MB3.1 MB
Unique views 12973
Unique downloads 97


Cite as