spermatogenesis across mammals - Ensembl Release 112 mapping
Description
This repo contains remapped single-cell Anndata (h5ad) objects from the original publication "The molecular evolution of spermatogenesis across mammals (Florent Murat, Noe Mbengue et al 2023; https://doi.org/10.1038/s41586-022-05547-7)". Data was (pseudo-)remapped against Ensembl Release 112 genomes and annotations with kb-python (v0.28.2). For Macaca mulatta the NCBI GCF_003339765.1_Mmul_10_genomic.fna and GCF_003339765.1_Mmul_10_genomic.gtf was used due to low protein number in Ensembl Release 112 file Macaca_mulatta.Mmul_10.pep.all.fa.gz. Only filtered counts are provided.
If you use this data, please cite Murat and Mbengue et al. 2023.
Series information
organism | name | sample | age | chemistry | source | genome | gtf |
bonobo | Pan paniscus | SN219 | 36yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700457[accn] | Pan_paniscus.panpan1.1.dna.toplevel.fa.gz | Pan_paniscus.panpan1.1.112.gtf.gz |
bonobo | Pan paniscus | SN224 | 15yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700458[accn] | Pan_paniscus.panpan1.1.dna.toplevel.fa.gz | Pan_paniscus.panpan1.1.112.gtf.gz |
chicken | Gallus gallus | SN264 | adult | v3 | https://www.ncbi.nlm.nih.gov/sra/ERX6700407[accn] | Gallus_gallus.bGalGal1.mat.broiler.GRCg7b.dna.toplevel.fa.gz | Gallus_gallus.bGalGal1.mat.broiler.GRCg7b.112.gtf.gz |
chicken | Gallus gallus | SN265 | adult | v3 | https://www.ncbi.nlm.nih.gov/sra/ERX6700408[accn] | Gallus_gallus.bGalGal1.mat.broiler.GRCg7b.dna.toplevel.fa.gz | Gallus_gallus.bGalGal1.mat.broiler.GRCg7b.112.gtf.gz |
chimp | Pan troglodytes | SN074 | 45yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700425[accn] | Pan_troglodytes.Pan_tro_3.0.dna.toplevel.fa.gz | Pan_troglodytes.Pan_tro_3.0.112.gtf.gz |
chimp | Pan troglodytes | SN112 | 14yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700426[accn] | Pan_troglodytes.Pan_tro_3.0.dna.toplevel.fa.gz | Pan_troglodytes.Pan_tro_3.0.112.gtf.gz |
chimp | Pan troglodytes | SN193 | 21yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700427[accn] | Pan_troglodytes.Pan_tro_3.0.dna.toplevel.fa.gz | Pan_troglodytes.Pan_tro_3.0.112.gtf.gz |
gibbon | Nomascus leucogenys | SN181 | 5yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700418[accn] | Nomascus_leucogenys.Nleu_3.0.dna.toplevel.fa.gz | Nomascus_leucogenys.Nleu_3.0.112.gtf.gz |
gibbon | Nomascus leucogenys | SN194 | 5yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700419[accn] | Nomascus_leucogenys.Nleu_3.0.dna.toplevel.fa.gz | Nomascus_leucogenys.Nleu_3.0.112.gtf.gz |
gorilla | Gorilla gorilla | SN180 | 43yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700420[accn] | Gorilla_gorilla.gorGor4.dna.toplevel.fa.gz | Gorilla_gorilla.gorGor4.112.gtf.gz |
gorilla | Gorilla gorilla | SN223 | 51yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700421[accn] | Gorilla_gorilla.gorGor4.dna.toplevel.fa.gz | Gorilla_gorilla.gorGor4.112.gtf.gz |
human | Homo sapiens | SN007 | 32yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700428[accn] | Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz | Homo_sapiens.GRCh38.112.gtf.gz |
human | Homo sapiens | SN011 | 32yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700429[accn] | Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz | Homo_sapiens.GRCh38.112.gtf.gz |
human | Homo sapiens | SN052 | 32yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700430[accn] | Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz | Homo_sapiens.GRCh38.112.gtf.gz |
human | Homo sapiens | SN111 | 28yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700431[accn] | Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz | Homo_sapiens.GRCh38.112.gtf.gz |
human | Homo sapiens | SN142 | 28yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700432[accn] | Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz | Homo_sapiens.GRCh38.112.gtf.gz |
macaque | Macaca mulatta | SN116 | 7yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700422[accn] | GCF_003339765.1_Mmul_10_genomic.fna.gz | GCF_003339765.1_Mmul_10_genomic.gtf.gz |
macaque | Macaca mulatta | SN143 | 9yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700423[accn] | GCF_003339765.1_Mmul_10_genomic.fna.gz | GCF_003339765.1_Mmul_10_genomic.gtf.gz |
marmoset | Callithrix jacchus | SN117 | 10yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700543[accn] | Callithrix_jacchus.mCalJac1.pat.X.dna.toplevel.fa.gz | Callithrix_jacchus.mCalJac1.pat.X.112.gtf.gz |
marmoset | Callithrix jacchus | SN130 | 10yo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700544[accn] | Callithrix_jacchus.mCalJac1.pat.X.dna.toplevel.fa.gz | Callithrix_jacchus.mCalJac1.pat.X.112.gtf.gz |
mouse | Mus musculus | SN090 | 9wo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700409[accn] | Mus_musculus.GRCm39.dna.primary_assembly.fa.gz | Mus_musculus.GRCm39.112.gtf.gz |
mouse | Mus musculus | SN115 | 9wo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700410[accn] | Mus_musculus.GRCm39.dna.primary_assembly.fa.gz | Mus_musculus.GRCm39.112.gtf.gz |
opossum | Monodelphis domestica | SN067 | adult | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700414[accn] | Monodelphis_domestica.ASM229v1.dna.toplevel.fa.gz | Monodelphis_domestica.ASM229v1.112.gtf.gz |
opossum | Monodelphis domestica | SN071 | 15.5mo | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700415[accn] | Monodelphis_domestica.ASM229v1.dna.toplevel.fa.gz | Monodelphis_domestica.ASM229v1.112.gtf.gz |
opossum | Monodelphis domestica | SN277 | adult | v2 | https://www.ncbi.nlm.nih.gov/sra/ERX6700416[accn] | Monodelphis_domestica.ASM229v1.dna.toplevel.fa.gz | Monodelphis_domestica.ASM229v1.112.gtf.gz |
platypus | Ornithorhynchus anatinus | SN253 | adult | v3 | https://www.ncbi.nlm.nih.gov/sra/ERX6700411[accn] | Ornithorhynchus_anatinus.mOrnAna1.p.v1.dna.toplevel.fa.gz | Ornithorhynchus_anatinus.mOrnAna1.p.v1.112.gtf.gz |
platypus | Ornithorhynchus anatinus | SN260 | adult | v3 | https://www.ncbi.nlm.nih.gov/sra/ERX6700412[accn] | Ornithorhynchus_anatinus.mOrnAna1.p.v1.dna.toplevel.fa.gz | Ornithorhynchus_anatinus.mOrnAna1.p.v1.112.gtf.gz |
Methods
Extract fastq:
fastq-dump --split-files ERRXXX
Create index (kb_python 0.28.2):
kb ref -i organism.idx -g organism.t2g.txt -f1 organism.cdna.fa organism.genome.fa organism.gtf
Get counts (kb_python 0.28.2):
kb count -i organism.idx -g organism.t2g.txt -x 10XV2/10XV3 --h5ad --filter bustools ERRXXX_2.fastq ERRXXX_3.fastq
Files
Files
(2.0 GB)
Name | Size | Download all |
---|---|---|
md5:a8131703ec20cb4ad1a4011f1fa7437f
|
78.2 MB | Download |
md5:8afac90497e3b8c41f8b66587f1f1478
|
104.2 MB | Download |
md5:beaff0a2099d571f9640498037fb7139
|
63.9 MB | Download |
md5:e2e8477a452ac6ef23ded11527153ee2
|
57.2 MB | Download |
md5:e0c86e4d3a5316d252ec14ab3c5b825f
|
34.2 MB | Download |
md5:986a4caf8dadeef30e44ab761bb6e7f3
|
62.3 MB | Download |
md5:cabc84382841131a25492f4873897c3f
|
67.2 MB | Download |
md5:1922f209b04bb33e775b1f6edf060435
|
19.4 MB | Download |
md5:cd4cceef37cac2ea3cbe3e3268ccca18
|
26.3 MB | Download |
md5:8ccf449c0aacbbdc7b72ff4c134aa6b2
|
22.9 MB | Download |
md5:2285700f28337b23362cbce9968621b6
|
77.8 MB | Download |
md5:ebc4e6192276863a539d6c8b5448cbe9
|
70.6 MB | Download |
md5:cf741b7586f076b3ef023b8f5da562e8
|
99.0 MB | Download |
md5:94e55ce09568ba87ea1846eed731fd74
|
54.5 MB | Download |
md5:0fb927abd0aa87890190a713d0034344
|
48.5 MB | Download |
md5:835a3be3fe2ea45a6eb9574f236fc844
|
103.6 MB | Download |
md5:b5a4eb94af602e484093d11a02a43da5
|
29.1 MB | Download |
md5:619278c57d40940ef7bb004705d79f02
|
37.2 MB | Download |
md5:95090d71263d3a7b23be420a1b4cbe89
|
23.5 MB | Download |
md5:235c34368f80f4a8e90cac8c737cd879
|
27.2 MB | Download |
md5:c7ed4b7ced0e070340c3c37cc6ce031c
|
54.7 MB | Download |
md5:078ff113582faaf67e8075872724764f
|
49.9 MB | Download |
md5:65ce3565d18faa4c028d2ec18d6d83a0
|
17.1 MB | Download |
md5:2d132fd8eeca367566aff930e1ad3414
|
18.9 MB | Download |
md5:2543b63a33b00bbd408ca36579b40a22
|
27.4 MB | Download |
md5:90ac96c974d5537a319fb34c06fcd976
|
321.4 MB | Download |
md5:d13b50b9b17753e88b2922ecc51ed39b
|
363.9 MB | Download |
Additional details
Related works
- References
- Publication: 10.1038/s41586-022-05547-7 (DOI)
Dates
- Created
-
2024-07-17
References
- Murat, Florent, et al. "The molecular evolution of spermatogenesis across mammals." Nature 613.7943 (2023): 308-316.