Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published February 8, 2023 | Version v1
Dataset Open

CPA-Perturb-seq: Multiplexed single-cell characterization of alternative polyadenylation regulators (PBMC data)

  • 1. New York Genome Center, Center for Genomics and Systems Biology, New York University, New York University Grossman School of Medicine
  • 2. New York Genome Center, Center for Genomics and Systems Biology, New York University
  • 3. Department of Genetics, Stanford University, Department of Computer Science, Stanford University
  • 4. New York Genome Center

Description

This site provides access to datasets from the CPA-Perturb-seq manuscript Kowalski*, Wessels*, Linder* et al., including PBMC data to replicate analyses in Figure 6. We release these data as Seurat objects, where each object contains single-cell quantifications of gene expression (RNA assay), and in addition, quantifications of polyA site usage (polyA site assay). To explore these data, please install the PASTA (PolyA Site analysis using relative Transcript Abundance) package which provides infrastructure and analytical tools to explore alternative polyadenylation at single-cell resolution. For each dataset, we also include a fragment file which enables visualization of read coverage plots across groups of cells. 

 

To replicate the analysis in Figure 6, in which we analyze a dataset of circulating human peripheral blood mononuclear cells,  we provide a vignette available here. To following files are used:

 

  1. matrix.mtx: 10X file containing RNA for PBMC dataset

  2. barcodes.tsv: 10X file containing barcodes for PBMC dataset

  3. genes.tsv: 10X file containing genes for PBMC dataset

  4. PBMC_meta_data.csv: meta data for PBMC dataset

  5. PBMC_pA_counts.tab.gz: Counts file containing polyA quantification for PBMC dataset

  6. PBMC_fragments.tsv.gz: Fragment file to visualize the PBMC dataset.

  7. PBMC_fragments.tsv.gz.tbi: Fragment file index for the PBMC dataset.    

  8. PBMC_polyA_peaks.gff: Gff file containing location of polyA site read regions.

  9. human_PAS_hg38.txt: Text file containing information from polyAdbv3 resource. 

Files

human_PAS_hg38.txt

Files (33.7 GB)

Name Size Download all
md5:aedae2961e28d9959c78c014c8228839
1.6 MB Download
md5:39b9982990a0e2e13a6a9b9e1e332fe7
216.1 kB Download
md5:1424b1f92ca47e4c15dc31ddfc0c7263
43.1 MB Preview Download
md5:281ff58b562a442091b952a500033c3f
1.5 GB Download
md5:603e9e333051c3a11402c1f8d64b38b0
29.8 GB Download
md5:f06b4f43b675914dbcc61bc8b409f41d
2.0 MB Download
md5:d5209506b182d61b3437092c5286b61e
4.9 MB Preview Download
md5:0e78b25109bee70c92e0fbf0c0441653
2.4 GB Download
md5:620111deb83bc3a6217a2353720c4226
4.3 MB Download