Published September 30, 2021 | Version v7
Dataset Open

Acanthamoeba castellanii genome assembly and infection by Legionella pneumophila

  • 1. Institut Pasteur, Department of Genomes and Genetics

Description

Data associated with the publication "Regulation of the Acanthamoeba castellanii genome upon infection by Legionella pneumophila". The record contains 4 archives, each associated with a github repository, and a "shared assets" archive, which contains processed files used by some repositories. The code from github repositories is embedded in each tarball, along with input and output data. Analyses are organized as independent snakemake pipelines for each part.

 

For convenient reanalysis, genomes, annotations and merged contact maps used in the publication can be found in the `shared_assets.tar.gz` archive. The infection analysis results are located in the `data/output` folder of Acastellanii_legionella_infection.tar.gz.

All archives can be downloaded at the bottom of the page.

 

Hybrid genome assembly:

Genome assembly pipeline code and output data used for the assembly of 2 A. castellanii strains (Neff and C3) through a hybrid pipeline combining Illumina shotgun, Hi-C and Oxford Nanopore long reads.

Github: https://github.com/cmdoret/Acastellanii_hybrid_assembly

Archive: Acastellanii_hybrid_assembly.tar.gz

 

Genome annotation:

Genome annotation pipeline used for functional annotation of A. castellanii strains C3 and Neff, and associated output files.

Github: https://github.com/cmdoret/Acastellanii_genome_annotation

Archive: Acastellanii_genome_annotation.tar.gz

 

Genome analyses:

Code and data related to general analyses of genomic properties of A. castellanii strains C3 and Neff.

Github: https://github.com/cmdoret/Acastellanii_genome_analysis

Archive: Acastellanii_genome_analysis.tar.gz

 

Infection analyses:

Code and data related to the analysis of structural changes in the A. castellanii C3 genome during infection by L. pneumophila.

Github: https://github.com/cmdoret/Acastellanii_legionella_infection

Archive: Acastellanii_legionella_infection.tar.gz
 

Shared assets:

This archive contains processed files (genomes, annotations, Hi-C matrices, differential expression results) which can be useful for reanalysis, and are automatically pulled when executing the pipeline of some repositories.

Archive: shared_assets.tar.gz

 

Supp. analyses:

Code and data related to short ad-hoc analyses on the genomic location of specific sequences in the genomes of C3 and Neff. The archive contains two subfolders: `telomere_repeats` where we analyse the distribution of TTAGGG subtelomeric repeats throughout the A. castellanii assemblies, and `C3_exclusive_regions` where we visualize the genomic distribution of C3-specific sequences (i.e. absent from Neff) along the C3 assembly.

 

Archive: supp_analyses.tar.gz
 

Files

Files (1.8 GB)

Name Size Download all
md5:bec100203738537d473e831b170f3924
334.8 MB Download
md5:fce9f2abc7418cb60331f7ee8cee758f
249.0 MB Download
md5:a8ad10a883eac18eff795372a707424e
116.4 MB Download
md5:50c319c46ca40afcc43b1ee5a9d3210c
347.6 MB Download
md5:f9b73075fec00ec70f12639d8179f6c4
691.0 MB Download
md5:73b98972850641171ae3a84aaa6689f3
30.7 MB Download