Published September 2, 2024 | Version v1
Other Open

Archival data for Kerblam Project structure

Description

This is a snapshot of the fetched data used for the plot of the Kerblam! paper. The code used to generate these files is available on GitHub at https://github.com/MrHedmad/ds_project_structure with commit `18de8d62bb19e849f6034db1bd159c5af16e6d38`.

The data was generated using Kerblam! version 1.0.0 (available here) and the `make_plot.sh` workflow. Here I report:
- `data_cookies.json`: The list of repositories as fetched by the Github Cli utility 2.55.0 on 2024-07-12 with the command `gh search repos cookiecutter data --sort stars --json stargazersCount,url --visibility public -L 50`
- `data_generic.json`: The same as above, with the command `gh search repos research project template --sort stars --json stargazersCount,url --visibility public -L 50`
- `repos.tar.gz`: The resulting (fetched) repositories;
- `data.json`: The combination of `data_cookies.json` and `data_generic.json`;
- `plot.png` and `plot.pdf`: The plots generated with the information in `results.csv`, as depicted in the publication above.
- `results.csv`: The result of the folder and file enumeration of the repositories in the `repos.tar.gz` file, with the following columns:
  - `path`: The full path from the root of the `repos` directory to the file;
  - `count`: The frequency of this specific item in the various repositories;
  - `types`: An enumeration of either "directory" for directories or "file" for files.

Files

results.csv

Files (367.2 MB)

Name Size Download all
md5:dd575bf3297c04e88de213b6e26d95ac
10.2 kB Preview Download
md5:5d3b4e4eb4db43cd40775a8205b8d19b
4.3 kB Preview Download
md5:ddb52982def773466054274456622068
4.1 kB Preview Download
md5:f8f59af7f3f4d110015f3f80b2a6e088
697.2 kB Preview Download
md5:acaadf450edd7464f7573a67cd4c352d
1.3 MB Preview Download
md5:405958af31187e44f51bcbfc5ca0d730
365.0 MB Download
md5:be401eaa3387f616da630f19e4ac0050
222.3 kB Preview Download