Published August 11, 2020 | Version v2
Dataset Open

Polish is quantitatively different on quartzite flakes used on different worked materials [Python analysis]

  • 1. TraCEr, MONREPOS, RGZM
  • 2. School of Life Sciences, University of Bradford
  • 3. Scientifc Computing and Bioinformatics, Institute of Computer Science, Johannes Gutenberg University
  • 4. IPHES

Description

This upload includes the following files related to the Python analysis:
    1.    Raw data as a XLSX table (processing-quartzite-final-2020-04-29.xlsx) is the output from R Script #1 (see https://doi.org/10.5281/zenodo.3979139), even though the filename is slightly different.

Plus, for each analysis (full and restricted datasets), included in the corresponding ZIP archive:
    2.    Jupyter notebooks of the analysis (Classification_RandSplitFeature_Revision_VXX.ipynb) rendered to HTML file (Classification_RandSplitFeature_Revision_VXX.html)
    3.    Dataframe including the artificially filled datapoints
    4.    Output of the analysis as PDF:
    •    Confusion matrices ("CM")
    •    Decision trees on selected features ("DecisionTreeSel")
    •    Balanced accuracy vs. maximum depth ("depth")
    •    Mutual information ("MI")
    •    Pairplots of selected features ("pairplot")
    •    Performance on training sets ("performance")

Instructions to download all files at once are given here: https://doi.org/10.5281/zenodo.4011952

Files

Analysis_full_dataset.zip

Files (3.5 MB)

Name Size Download all
md5:0fa17adabec61117a981b7dd163e64fe
1.9 MB Preview Download
md5:0f4a60261dd7c71a83530b663446b758
1.6 MB Preview Download
md5:e0512bf1464578f7dc576f4bc382fee2
47.6 kB Download

Additional details

Related works

Is supplement to
Journal article: 10.1371/journal.pone.0243295 (DOI)
Is supplemented by
Software documentation: 10.5281/zenodo.4011952 (DOI)