Published January 16, 2024 | Version v1.0.0
Dataset Open

Bactria v1.0.0 data release - COI-5P phylogeny of Primates at BIN level

  • 1. ROR icon Naturalis Biodiversity Center

Description

Results produced by the bactria pipeline v1.0.0 based on the following configuration from config.yml:


# Used for parallelization by snakemake
cpu_cores: 8

# Used for scatter/gather processing, corresponds with the number of families within the taxon defined in fasta_filter
nfamilies: 17

# Number of outgroups to include in each family-level analysis. Minimum is 2.
outgroups: 3

# Used for verbosity, see `logging` from stdlib
log_level: 'INFO'

# Either use COI-5P or matK_rbcL
marker: COI-5P

# Substitution model in RAxML CLI syntax
model: GTR+G

# Minimum length to include (this fits COI-5P)
minlength: 600

# Minimum number of sequences per alignment
minseq: 3

# Maximum number of sequences per alignment
maxseq: 12000

# Nucleotide alignments (NT) or aminoacid (AA) for MACSE alignment
datatype: NT

# Choose which records to use from the database for the pipeline. filter_name only takes one name, so does filter level.
# filter levels: class, order, family, genus, all (no filter)
fasta_filter:
filter_level: order
filter_name: Primates

name: phylogeny

dependencies:
- pip:
- -r requirements.txt

file_names:
bold_tsv: resources/BOLD_Public.18-Dec-2023/BOLD_Public.18-Dec-2023.tsv
open_tre: resources/opentree/opentree14.9_tree/labelled_supertree/labelled_supertree.tre
hmm: resources/hmm/COI-5P.hmm
fasta_dir: results/fasta/family

Files

results.zip

Files (730.9 MB)

Name Size Download all
md5:9ca53e26081bc61735281ab68fc2e3d0
730.9 MB Preview Download

Additional details