Acanthamoeba castellanii genome assembly and infection by Legionella pneumophila
Description
Data associated with the publication "Regulation of the Acanthamoeba castellanii genome upon infection by Legionella pneumophila". The record contains 4 archives, each associated with a github repository, and a "shared assets" archive, which contains processed files used by some repositories. The code from github repositories is embedded in each tarball, along with input and output data. Analyses are organized as independent snakemake pipelines for each part.
For convenient reanalysis, genomes, annotations and merged contact maps used in the publication can be found in the `shared_assets.tar.gz` archive. The infection analysis results are located in the `data/output` folder of Acastellanii_legionella_infection.tar.gz.
All archives can be downloaded at the bottom of the page.
Hybrid genome assembly:
Genome assembly pipeline code and output data used for the assembly of 2 A. castellanii strains (Neff and C3) through a hybrid pipeline combining Illumina shotgun, Hi-C and Oxford Nanopore long reads.
Github: https://github.com/cmdoret/Acastellanii_hybrid_assembly
Archive: Acastellanii_hybrid_assembly.tar.gz
Genome annotation:
Genome annotation pipeline used for functional annotation of A. castellanii strains C3 and Neff, and associated output files.
Github: https://github.com/cmdoret/Acastellanii_genome_annotation
Archive: Acastellanii_genome_annotation.tar.gz
Genome analyses:
Code and data related to general analyses of genomic properties of A. castellanii strains C3 and Neff.
Github: https://github.com/cmdoret/Acastellanii_genome_analysis
Archive: Acastellanii_genome_analysis.tar.gz
Infection analyses:
Code and data related to the analysis of structural changes in the A. castellanii C3 genome during infection by L. pneumophila.
Github: https://github.com/cmdoret/Acastellanii_legionella_infection
Archive: Acastellanii_legionella_infection.tar.gz
Shared assets:
This archive contains processed files (genomes, annotations, Hi-C matrices, differential expression results) which can be useful for reanalysis, and are automatically pulled when executing the pipeline of some repositories.
Archive: shared_assets.tar.gz
Supp. analyses:
Code and data related to short ad-hoc analyses on the genomic location of specific sequences in the genomes of C3 and Neff. The archive contains two subfolders: `telomere_repeats` where we analyse the distribution of TTAGGG subtelomeric repeats throughout the A. castellanii assemblies, and `C3_exclusive_regions` where we visualize the genomic distribution of C3-specific sequences (i.e. absent from Neff) along the C3 assembly.
Archive: supp_analyses.tar.gz
Files
Files
(1.8 GB)
Name | Size | Download all |
---|---|---|
md5:bec100203738537d473e831b170f3924
|
334.8 MB | Download |
md5:fce9f2abc7418cb60331f7ee8cee758f
|
249.0 MB | Download |
md5:a8ad10a883eac18eff795372a707424e
|
116.4 MB | Download |
md5:50c319c46ca40afcc43b1ee5a9d3210c
|
347.6 MB | Download |
md5:f9b73075fec00ec70f12639d8179f6c4
|
691.0 MB | Download |
md5:73b98972850641171ae3a84aaa6689f3
|
30.7 MB | Download |