Published April 21, 2026 | Version 1.0
Dataset Open

simulated dataset for sparse sampling filtering

Authors/Creators

  • 1. ROR icon University of North Carolina at Charlotte

Description

Dataset title:
Simulation inputs and results for Plasmodium falciparum genomic data analyses

Overview:
This dataset contains input files and simulation results used for analyses of Plasmodium falciparum genomic data under different spatial pattern settings, together with results from an application based on real data from Cambodia.

The repository includes simulation input data for a 3-hex spatial pattern, simulation input data for a random spatial pattern, simulation results generated from the 3-hex pattern, simulation results generated from the random pattern, and simulation results for the Cambodia real-data application.

The target parasite in this dataset is Plasmodium falciparum (P. falciparum).

Contents:

  1. 3-hex_pattern_simulation.zip
    Description: Input data for simulations based on a 3-hex spatial pattern using simulated P. falciparum genomic data.
    MD5: 16047297ea79519adf17f3a49ead521e
    Size: 426.06 MB
  2. random_pattern_simulation.zip
    Description: Input data for simulations based on a random spatial pattern using simulated P. falciparum genomic data.
    MD5: 368e70b4d8faf7cb09ff7de9c0680b22
    Size: 426.07 MB
  3. 3-hex_simulation_results.zip
    Description: Simulation results generated from the 3-hex spatial pattern using simulated P. falciparum genomic data.
    MD5: 2a7f411a45acd41aa64347b6de01a428
    Size: 24.13 MB
  4. random_pattern_simulation_results.zip
    Description: Simulation results generated from the random spatial pattern using simulated P. falciparum genomic data.
    MD5: 9d30e1612f6e7b0883b514b74ff0b51d
    Size: 800.23 MB
  5. Cambodia_simulation_results.zip
    Description: Simulation results for the real-data application based on P. falciparum genomic data from Cambodia.
    MD5: ea565bf803a743377d4e5aa9479add11
    Size: 1.29 MB

File organization:
The dataset is organized into two main components. The first component contains simulation input data used to generate scenarios under different spatial patterns. The second component contains the corresponding simulation outputs, including results from both simulated datasets and the Cambodia real-data application.

Suggested use:
These files are intended to support reproduction of the simulation experiments, comparison of results across different spatial pattern settings, and analyses involving P. falciparum parasite genomic data.

Data integrity:
Users can verify file integrity using the MD5 checksums provided above after downloading the files.

Related study:
Please cite the associated publication if you use this dataset in your work.

Files

random_pattern_simulation.zip

Files (1.7 GB)

Name Size Download all
md5:16047297ea79519adf17f3a49ead521e
426.1 MB Preview Download
md5:2a7f411a45acd41aa64347b6de01a428
24.1 MB Preview Download
md5:ea565bf803a743377d4e5aa9479add11
1.3 MB Preview Download
md5:368e70b4d8faf7cb09ff7de9c0680b22
426.1 MB Preview Download
md5:9d30e1612f6e7b0883b514b74ff0b51d
800.2 MB Preview Download