Assessing Computational Notebook Understandability through Code Metrics Analysis

Mojtaba Mostafavi

doi:10.5281/zenodo.8435192

Published October 12, 2023 | Version 1

Dataset Open

Assessing Computational Notebook Understandability through Code Metrics Analysis

Mojtaba Mostafavi

Computational notebooks have become the primary coding environment for data scientists. Despite their popularity, research on the code quality of these notebooks is still in its infancy, and the code shared in these notebooks is often of poor quality. Considering the importance of maintenance and reusability, it is crucial to pay attention to the comprehension of the notebook code and identify the notebook metrics that play a significant role in their comprehension. The level of code comprehension is a qualitative variable closely associated with the user's opinion about the code. Previous studies have typically employed two approaches to measure it. One approach involves using limited questionnaire methods to review a small number of code pieces. Another approach relies solely on metadata, such as the number of likes and user votes for a project in the software repository. In our approach, we enhanced the measurement of the understandability level of notebook code by leveraging user comments within a software repository. As a case study, we started with 248,761 Kaggle Jupyter notebooks introduced in previous studies and their relevant metadata. To identify user comments associated with code comprehension within the notebooks, we utilized a fine-tuned DistillBERT transformer. We established a \emph{user comment based criterion} for measuring code understandability by considering the number of code understandability-related comments, the upvotes on those comments, the total views of the notebook, and the total upvotes received by the notebook. This criterion has proven to be more effective than alternative methods, making it the ground truth for evaluating the code comprehension of our notebook set. In addition, we collected a total of 34 metrics for 10,857 notebooks, categorized as script-based and notebook-based metrics. These metrics were utilized as features in our dataset. Using the Random Forest classifier, our predictive model achieved 85% accuracy in predicting code comprehension levels in computational notebooks, identifying developer expertise and markdown-based metrics as key factors.

Files

EMSE2023_CODE_DATA.zip

Files (21.8 MB)

Name	Size	Download all
EMSE2023_CODE_DATA.zip md5:d580ccca6ae47a77edb7d373ad0866fd	21.8 MB	Preview Download

	All versions	This version
Views	13	12
Downloads	1	1
Data volume	21.8 MB	21.8 MB

Assessing Computational Notebook Understandability through Code Metrics Analysis

Creators

Description

Files

EMSE2023_CODE_DATA.zip

Files (21.8 MB)