Dataset Open Access

Acanthamoeba castellanii genome assembly and infection by Legionella pneumophila

Cyril Matthey-Doret

Data associated with the publication "Regulation of the Acanthamoeba castellanii genome upon infection by Legionella pneumophila". The record contains 4 archives, each associated with a github repository, and a "shared assets" archive, which contains processed files used by some repositories. The code from github repositories is embedded in each tarball, along with input and output data. Analyses are organized as independent snakemake pipelines for each part.

 

For convenient reanalysis, genomes, annotations and merged contact maps used in the publication can be found in the `shared_assets.tar.gz` archive. The infection analysis results are located in the `data/output` folder of Acastellanii_legionella_infection.tar.gz.

All archives can be downloaded at the bottom of the page.

 

Hybrid genome assembly:

Genome assembly pipeline code and output data used for the assembly of 2 A. castellanii strains (Neff and C3) through a hybrid pipeline combining Illumina shotgun, Hi-C and Oxford Nanopore long reads.

Github: https://github.com/cmdoret/Acastellanii_hybrid_assembly

Archive: Acastellanii_hybrid_assembly.tar.gz

 

Genome annotation:

Genome annotation pipeline used for functional annotation of A. castellanii strains C3 and Neff, and associated output files.

Github: https://github.com/cmdoret/Acastellanii_genome_annotation

Archive: Acastellanii_genome_annotation.tar.gz

 

Genome analyses:

Code and data related to general analyses of genomic properties of A. castellanii strains C3 and Neff.

Github: https://github.com/cmdoret/Acastellanii_genome_analysis

Archive: Acastellanii_genome_analysis.tar.gz

 

Infection analyses:

Code and data related to the analysis of structural changes in the A. castellanii C3 genome during infection by L. pneumophila.

Github: https://github.com/cmdoret/Acastellanii_legionella_infection

Archive: Acastellanii_legionella_infection.tar.gz
 

Shared assets:

This archive contains processed files (genomes, annotations, Hi-C matrices, differential expression results) which can be useful for reanalysis, and are automatically pulled when executing the pipeline of some repositories.

Archive: shared_assets.tar.gz

 

Supp. analyses:

Code and data related to short ad-hoc analyses on the genomic location of specific sequences in the genomes of C3 and Neff. The archive contains two subfolders: `telomere_repeats` where we analyse the distribution of TTAGGG subtelomeric repeats throughout the A. castellanii assemblies, and `C3_exclusive_regions` where we visualize the genomic distribution of C3-specific sequences (i.e. absent from Neff) along the C3 assembly.

 

Archive: supp_analyses.tar.gz
 

Files (1.8 GB)
Name Size
Acastellanii_genome_analysis.tar.gz
md5:bec100203738537d473e831b170f3924
334.8 MB Download
Acastellanii_genome_annotation.tar.gz
md5:fce9f2abc7418cb60331f7ee8cee758f
249.0 MB Download
Acastellanii_hybrid_assembly.tar.gz
md5:a8ad10a883eac18eff795372a707424e
116.4 MB Download
Acastellanii_legionella_infection.tar.gz
md5:50c319c46ca40afcc43b1ee5a9d3210c
347.6 MB Download
shared_assets.tar.gz
md5:f9b73075fec00ec70f12639d8179f6c4
691.0 MB Download
supp_analyses.tar.gz
md5:73b98972850641171ae3a84aaa6689f3
30.7 MB Download
328
209
views
downloads
All versions This version
Views 328111
Downloads 20989
Data volume 95.6 GB35.4 GB
Unique views 26895
Unique downloads 7836

Share

Cite as