Dataset Open Access

A Dataset of Pull Requests and A Trained Random Forest Model for predicting Pull Request Acceptance

Tapajit Dey; Audris Mockus


JSON-LD (schema.org) Export

{
  "description": "<p>A Curated Dataset of 470,925 pull requests for 3349 popular NPM packages, description of the variables, code snippet for creating a Random Forest model for predicting pull request acceptance, and a pre-trained&nbsp;&nbsp;Random Forest model (in R). The dataset is for the ESEM-2020 paper: &quot;Impact of Technical and Social Factors on Pull Request Quality for the NPM Ecosystem&quot; (<a href=\"https://arxiv.org/abs/2007.04816\">https://arxiv.org/abs/2007.04816</a>)</p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "University of Tennessee", 
      "@id": "https://orcid.org/0000-0002-1379-8539", 
      "@type": "Person", 
      "name": "Tapajit Dey"
    }, 
    {
      "affiliation": "University of Tennessee", 
      "@type": "Person", 
      "name": "Audris Mockus"
    }
  ], 
  "url": "https://zenodo.org/record/3858046", 
  "datePublished": "2020-05-26", 
  "keywords": [
    "Pull Request", 
    "Random Forest"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/4e849a51-1a85-4d9e-b87d-55c087e3a82d/Curated_Pull_Request_Data.csv", 
      "encodingFormat": "csv", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/4e849a51-1a85-4d9e-b87d-55c087e3a82d/description.pdf", 
      "encodingFormat": "pdf", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/4e849a51-1a85-4d9e-b87d-55c087e3a82d/PRMODEL.Rdata", 
      "encodingFormat": "rdata", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/4e849a51-1a85-4d9e-b87d-55c087e3a82d/snippet.R", 
      "encodingFormat": "r", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.3858046", 
  "@id": "https://doi.org/10.5281/zenodo.3858046", 
  "@type": "Dataset", 
  "name": "A Dataset of Pull Requests and A Trained Random Forest Model for predicting Pull Request Acceptance"
}
57
54
views
downloads
All versions This version
Views 5754
Downloads 5453
Data volume 2.3 GB2.3 GB
Unique views 4343
Unique downloads 3131

Share

Cite as