Dataset Open Access

JTeC: A Large Collection of Java Test Classes forTest Code Analysis and Processing

Corò, Federico; Verdecchia, Roberto; Cruciani, Emilio; Miranda, Breno; Bertolino, Antonia


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="DOI">10.5281/zenodo.3711509</identifier>
  <creators>
    <creator>
      <creatorName>Corò, Federico</creatorName>
      <givenName>Federico</givenName>
      <familyName>Corò</familyName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-7321-3467</nameIdentifier>
      <affiliation>Gran Sasso Science Institute</affiliation>
    </creator>
    <creator>
      <creatorName>Verdecchia, Roberto</creatorName>
      <givenName>Roberto</givenName>
      <familyName>Verdecchia</familyName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-9206-6637</nameIdentifier>
      <affiliation>Gran Sasso Science Institute &amp; Vrije Universiteit Amsterdam</affiliation>
    </creator>
    <creator>
      <creatorName>Cruciani, Emilio</creatorName>
      <givenName>Emilio</givenName>
      <familyName>Cruciani</familyName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-4744-5635</nameIdentifier>
      <affiliation>Gran Sasso Science Institute</affiliation>
    </creator>
    <creator>
      <creatorName>Miranda, Breno</creatorName>
      <givenName>Breno</givenName>
      <familyName>Miranda</familyName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-9608-9393</nameIdentifier>
      <affiliation>Federal University of Pernambuco</affiliation>
    </creator>
    <creator>
      <creatorName>Bertolino, Antonia</creatorName>
      <givenName>Antonia</givenName>
      <familyName>Bertolino</familyName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-8749-1356</nameIdentifier>
      <affiliation>Consiglio Nazionale delle Ricerche</affiliation>
    </creator>
  </creators>
  <titles>
    <title>JTeC: A Large Collection of Java Test Classes forTest Code Analysis and Processing</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2019</publicationYear>
  <subjects>
    <subject>Software Testing, GitHub, Test Suite, Large Scale</subject>
  </subjects>
  <dates>
    <date dateType="Issued">2019-05-19</date>
  </dates>
  <language>en</language>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/3711509</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.2558713</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/msr</relatedIdentifier>
  </relatedIdentifiers>
  <version>2.0</version>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;The recent push towards test automation and test-driven development continues to scale up the dimensions of test code that needs to be maintained, analysed, and processed side-by-side with production code. As a consequence, on the one side regression testing techniques, e.g., for test suite prioritization or test case selection, capable to handle such large-scale test suites become indispensable; on the other side, as test code exposes own characteristics, specific techniques for its analysis and refactoring are actively sought. We present JTeC, a large-scale dataset of test cases that researchers can use for benchmarking the above techniques or any other type of tool expressly targeting test code. JTeC collects more than 2.5M+ test classes belonging to 31K+ GitHub projects and summing up to more than 430 Million LOCs of ready-to-use real-world test code.&lt;/p&gt;</description>
    <description descriptionType="Other">Companion page for the JTeC dataset at https://github.com/JTeCDataset/JTeC</description>
  </descriptions>
</resource>
929
1,606
views
downloads
All versions This version
Views 929148
Downloads 1,606211
Data volume 155.9 GB76.0 GB
Unique views 800134
Unique downloads 1,270138

Share

Cite as