Published July 30, 2021
| Version 0.10.0
Software
Open
vanheeringen-lab/genomepy: Version 0.10.0
Authors/Creators
- 1. Radboud University
- 2. @bhi-kimlab
- 3. @github
Description
[0.10.0] - 2021-07-30
Added
- Annotation class, containing
- regex filter (
genomepy.Annotation.filter_regex()) - sanitize functions (
genomepy.Annotation.sanitize())- option to skip filtering and/or matching the annotation to the genome (also on CLI)
- gene name remapping to various formats (
genomepy.Annotation.map_genes())- using MyGene.info. Can be queried separately (
genomepy.annotation.query_mygene())
- using MyGene.info. Can be queried separately (
- contig name remapping to other provider formats (
genomepy.Annotation.map_locations()) - get the annotations, or gene locations, as dataframes (
genomepy.Annotation.gtf,bedorgene_coords()respectively) - get the gene names as a list (
genomepy.Annotation.genes("gtf")orgenomepy.Annotation.genes("bed"))
- regex filter (
genomepy installnow attempts to install the NCBI assembly report- NCBI provider also indexes the NCBI
genbank_historicalsummary genomepy searchnow shows if the genome has an annotation- this slows down the results a bit
- to compensate, results are now shown as soon as they are found
- for UCSC, availability of any of the 4 annotations is shown
genomepy annotationshows the first line(s) of each gene annotation.gtf- for developers:
- pre-commit-hooks for linting
- formatting/linting script
tests/format.sh(optional argumentlint) - isort & autoflake formatters
- provider module split per provider
- ProviderBase overhauled, now called Provider
- regex filtering separated from
Provider.download_genome - utils module split into utils, files and online
- now using loguru for pretty logging
- accession
searchimproved- now finds GCA and GCF accessions
- now ignores patch levels
genomepy installautomatic provider selection refactoredProvider.online_providersreturns a generator (faster!)
genomepy installuses a combined filter function (faster!)genomepy installonly zips annotation files if the genome is zipped (with the bgzip flag) (faster!)- NCBI provider should be parsed faster (faster!)
- new dependency: pandas
- tests no longer format code
- broken URLs should keep genomepy occupied for less long (check_url will immediately return on "Not Found" errors 404/450) (faster!)
- the
Genomeclass now passes arguments to the parentFastaclass - the
Genomeclass now regenerates the sizes and gaps files similarly to theFastaclass and its index (when the genome is younger) (faster!) - somewhat more pythonic tests
Files
vanheeringen-lab/genomepy-0.10.0.zip
Files
(834.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:73e912422cf135c475037a1275023ef9
|
834.5 kB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/vanheeringen-lab/genomepy/tree/0.10.0 (URL)