cGTEx_dataset:A multi-tissue atlas of regulatory variants in cattle
Creators
- Liu, Shuli1
- Gao,Yahui2
- Canela-Xandri, Oriol3
- Wang,Sheng4
- Yu,Ying5
- Cai,Wentao6
- Li,Bingjie7
- Xiang,Ruidong8
- Chamberlain, Amanda J.9
- Pairo-Castineira,Erola10
- D'Mellow,Kenton3
- Rawlik,Konrad11
- Xia,Charley11
- Yao,Yuelin3
- Navarro,Pau3
- Rocha,Dominique12
- Li,Xiujin13
- Yan,Ze5
- Li, Congjun14
- Rosen, Benjamin D.14
- Tassell,Curtis P. Van14
- Vanraden,Paul M.14
- Zhang,Shengli5
- Ma,Li15
- Cole,John B.14
- Liu, George E.14
- Tenesa, Albert10
- Fang, Lingzhao16
- 1. Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA; National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China School of Life Sciences, Westlake University, Hangzhou, China
- 2. Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA Department of Animal and Avian Sciences, University of Maryland, College Park, MD, USA
- 3. MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK
- 4. State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- 5. National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
- 6. Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
- 7. Scotland's Rural College (SRUC), Roslin Institute Building, Midlothian, UK
- 8. Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, Victoria, Australia Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, Australia
- 9. Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, Australia
- 10. MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, UK
- 11. The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, UK
- 12. INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas, France
- 13. Guangdong Provincial Key Laboratory of Waterfowl Healthy Breeding, College of Animal Science & Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
- 14. Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA
- 15. Department of Animal and Avian Sciences, University of Maryland, College Park, MD, USA
- 16. Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK
Description
The files are raw data of the cGTEX dataset used in the publication https://doi.org/10.1038/s41588-022-01153-5. For details, please read the Methods section.
1. cGTEx_meta_data_8646sample.xlsx
Metadata consists of sample names with their sample accession, including information such as data size, cleaned reads, mapping rate, and age. The data is extracted from SRA (https://www.ncbi.nlm.nih.gov/sra/) and BIGD (https://bigd.big.ac.cn/bioproject/) ( samples starting with CRS)
2. cGTEx_count_8646sample_27607gene.txt.gz
Data consist of raw RNA-seq read count of 27607 genes (column names as Ensembl gene id )of 8646 samples (as row names)
3. cGTEx_TPM_8646sample_27607gene.txt.gz
Data consist of TPM values of 27607 genes (column names as Ensembl gene id) in samples (8646 samples as row names)
4. cGTEx_imputed_vcf.tar.gz
Imputed genotypes (SNP) of 7297 RNA-seq samples in 29 autosomes.
5. cGTEx_exon_junction_8646sample.tar.gz
Exon junction files of 8646 files
Note: Small discrepancies in some sample names or the absence of headers in some data sets compared to https://cgtex.roslin.ed.ac.uk/ are sorted out in this upload.
Notes
Files
Files
(17.9 GB)
Additional details
Related works
- Compiles
- Journal article: 10.1038/s41588-022-01153-5 (DOI)
- Is identical to
- Dataset: https://cgtex.roslin.ed.ac.uk/ (URL)
- Is supplemented by
- Workflow: https://zenodo.org/record/6510550 (URL)
Funding
- TRAINEd – TRAIN@Ed 801215
- European Commission
- Prediction of genes and regulatory elements in farm animal genomes BBS/E/D/10002070
- UK Research and Innovation
- Genetic improvement of farmed animals BBS/E/D/30002275
- UK Research and Innovation
- Vast-scale linear mixed modelling genetic discovery approaches for genome- and exome-wide association analyses to enable therapeutic target validation MR/R025851/1
- UK Research and Innovation
- Understanding disease through environment-wide association studies MR/P015514/1
- UK Research and Innovation