Journal article Open Access

A Replication Package For The Paper "Improved Retrieval of Programming Solutions with Code Examples Using a Multi-featured Score"

Rodrigo F. Silva; Mohammad Masudur Rahman; CARLOS EDUARDO DE CARVALHO DANTAS; Chanchal Roy; Foutse Khomh; Marcelo A. Maia


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/08e2b522-9904-470c-8784-5b005a21a62b/backup2019crar-min.backup"
      }, 
      "checksum": "md5:38045fe56bbb37d3ea0898fec4ea5462", 
      "bucket": "08e2b522-9904-470c-8784-5b005a21a62b", 
      "key": "backup2019crar-min.backup", 
      "type": "backup", 
      "size": 11141927778
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/08e2b522-9904-470c-8784-5b005a21a62b/crar.jar"
      }, 
      "checksum": "md5:ecdf4c1cec9b721bfbe51e170b3aa4d3", 
      "bucket": "08e2b522-9904-470c-8784-5b005a21a62b", 
      "key": "crar.jar", 
      "type": "jar", 
      "size": 694716137
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/08e2b522-9904-470c-8784-5b005a21a62b/data.zip"
      }, 
      "checksum": "md5:3c249fd0045fc775ef28104f547643c6", 
      "bucket": "08e2b522-9904-470c-8784-5b005a21a62b", 
      "key": "data.zip", 
      "type": "zip", 
      "size": 920411
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/08e2b522-9904-470c-8784-5b005a21a62b/Readme%20-%20Instructions%20for%20reproduction.docx"
      }, 
      "checksum": "md5:63fe6529a8bb7a98a9624e4ac08d29fe", 
      "bucket": "08e2b522-9904-470c-8784-5b005a21a62b", 
      "key": "Readme - Instructions for reproduction.docx", 
      "type": "docx", 
      "size": 9703
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/08e2b522-9904-470c-8784-5b005a21a62b/replication-package-application.txt"
      }, 
      "checksum": "md5:ecdee892bf63fd47f3139416788269e9", 
      "bucket": "08e2b522-9904-470c-8784-5b005a21a62b", 
      "key": "replication-package-application.txt", 
      "type": "txt", 
      "size": 2873
    }
  ], 
  "owners": [
    220395
  ], 
  "doi": "10.5281/zenodo.5115300", 
  "stats": {
    "version_unique_downloads": 49.0, 
    "unique_views": 144.0, 
    "views": 193.0, 
    "version_views": 193.0, 
    "unique_downloads": 49.0, 
    "version_unique_views": 144.0, 
    "volume": 265998921605.0, 
    "version_downloads": 84.0, 
    "downloads": 84.0, 
    "version_volume": 265998921605.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.5115300", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.5115299", 
    "bucket": "https://zenodo.org/api/files/08e2b522-9904-470c-8784-5b005a21a62b", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.5115299.svg", 
    "html": "https://zenodo.org/record/5115300", 
    "latest_html": "https://zenodo.org/record/5115300", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.5115300.svg", 
    "latest": "https://zenodo.org/api/records/5115300"
  }, 
  "conceptdoi": "10.5281/zenodo.5115299", 
  "created": "2021-07-24T17:51:46.699110+00:00", 
  "updated": "2021-08-03T20:14:59.368812+00:00", 
  "conceptrecid": "5115299", 
  "revision": 4, 
  "id": 5115300, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.5115300", 
    "description": "<p>Developers often depend on code search engines to obtain solutions for their programming tasks. However, finding an expected solution containing code examples along with their explanations is challenging due to several issues. There is a vocabulary mismatch between the search keywords (the query) and the appropriate solutions. Semantic gap may increase for similar bag of words due to antonyms and negation. Moreover, documents retrieved by search engines might not contain solutions containing both code examples and their explanations. So, we propose CRAR (Crowd Answer Recommender) to circumvent those issues aiming at improving retrieval of relevant answers from Stack Overflow containing not only the expected code examples for the given task but also their explanations. Given a programming task, we investigate the effectiveness of &nbsp;combining information retrieval techniques along with a set of features to enhance the ranking of important threads (i.e., the units containing questions along with their answers) for the given task and then selects relevant answers contained in those threads, including semantic features, like word embeddings and sentence embeddings, for instance, a Convolutional Neural Network (CNN). CRAR also leverages social aspects of Stack Overflow discussions like popularity to select relevant answers for the tasks. Our experimental evaluation shows that the combination of the different features performs better than each one individually. We also compare the retrieval performance with the state-of-art CROKAGE (Crowd Knowledge Answer Generator), which is also a system aimed at retrieving relevant answers from Stack Overflow. We show that CRAR outperforms CROKAGE &nbsp;in Mean Reciprocal Rank and Mean Recall with small and medium effect sizes, respectively.</p>", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "title": "A Replication Package For The Paper \"Improved Retrieval of Programming Solutions with Code Examples Using a Multi-featured Score\"", 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "5115299"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "5115300"
          }
        }
      ]
    }, 
    "keywords": [
      "Mining Crowd Knowledge, Stack Overflow, Word Embedding"
    ], 
    "publication_date": "2021-07-24", 
    "creators": [
      {
        "affiliation": "Faculty of Computing, Federal University of Uberl\u00e2ndia", 
        "name": "Rodrigo F. Silva"
      }, 
      {
        "orcid": "0000-0003-3821-5990", 
        "affiliation": "Faculty of Computer Science, Dalhousie University", 
        "name": "Mohammad Masudur Rahman"
      }, 
      {
        "affiliation": "Faculty of Computing, Federal University of Uberl\u00e2ndia", 
        "name": "CARLOS EDUARDO DE CARVALHO DANTAS"
      }, 
      {
        "affiliation": "Department of Computer Science, University of Saskatchewan, Canada", 
        "name": "Chanchal Roy"
      }, 
      {
        "affiliation": "\u00c9cole Polytechnique de Montr\u00e9al, Canada", 
        "name": "Foutse Khomh"
      }, 
      {
        "orcid": "0000-0003-3578-1380", 
        "affiliation": "Faculty of Computing, Federal University of Uberl\u00e2ndia", 
        "name": "Marcelo A. Maia"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "subtype": "article", 
      "type": "publication", 
      "title": "Journal article"
    }, 
    "related_identifiers": [
      {
        "scheme": "url", 
        "identifier": "https://github.com/ISEL-UFU/crar-replication-package", 
        "relation": "isSupplementTo"
      }, 
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.5115299", 
        "relation": "isVersionOf"
      }
    ]
  }
}
193
84
views
downloads
All versions This version
Views 193193
Downloads 8484
Data volume 266.0 GB266.0 GB
Unique views 144144
Unique downloads 4949

Share

Cite as