Published October 10, 2022 | Version 1.1
Dataset Open

FunTaxDB database for uBin software

  • 1. University Duisburg-Essen, Germany
  • 2. Tim Burg, Im Acker 59, 56072 Koblenz, Germany

Description

This dataset consists of the FunTaxDB database used for the uBin software. The uBin software is designed to facilitate the curation of metagenome-asssembled genomes (MAGs). Please see a preprint on the uBin software on https://www.biorxiv.org/content/10.1101/2020.07.15.204776v2 . The FunTaxDB is based on the UniRef100 database with additional taxonomic strings in the FASTA headers. Those entries that did not have a taxonomic affiliation, were additionally BLASTed vs the ncbi-nr database.  Entries that had a 100% similarity match to NCBI-nr recieved the NCBI-nr taxonomic affiliation of the matched record. Special characters in the taxonomic levels, separated by ';', were replaced by underscores to make working with regex less error-prone.

Notes

This study was funded by the Ministerium für Kultur und Wissenschaft des Landes Nordrhein- Westfalen ("Nachwuchsgruppe Dr. Alexander Probst").

Files

Files (55.8 GB)

Name Size Download all
md5:9d1b603d59f3278a103640a3a5834bc9
55.8 GB Download

Additional details

Related works

Is cited by
Software: https://github.com/ProbstLab/uBin-helperscripts (URL)
Is compiled by
Preprint: http10.1101/2020.07.15.204776v2 (Handle)