Published November 10, 2023 | Version v1
Dataset Open

Haplotype-aware reference genome reveals hidden somatic mutations of sweet orange

Creators

Description

Filename: ASE_in_five_fruit_development.txt

Description: Based on our haplotype sequences, we confirmed biallelic genes showed significant expression difference between two alleles in at least one fruit developmental stage. We collected the RNA-seq data from fruit of Newhall navel orange at five developmental stages (90, 120, 150, 180 and 210 days after bloom). RNA-seq data from previous project GSE108930 in NCBI database.

 

Filename: Biallelic_genes_haplogenomes.tsv

Description: The biallelic genes were identified using the Genespace program.

 

Filename: Haplogenomes_CENH3_chip_peaks.bw

Description: The CENH3 sequences were collected from BankIt ID 2305947. These reads (including the input library as a control) were aligned to the two assembled haplotypes using Bowtie2 (v2.5.1) with default parameters. MACS2 (v2.2.7.1) with the additional parameters “-f BAM -ghs -B -q 0.01” was used to perform peak calling. The peaks generated from CENH3 chip-seq.

 

Filename: Haplogenomes_Control_chip_peaks.bw

Description: The peaks generated from Control chip-seq.

 

Filename: Haplotype_based_79accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the haplotype-based method. The derived somatic mutations were identified based on nine samples from the outgroup (Earlier Clade I).

 

Filename: HaplotypeA_CuteSV.vcf

Description: The HiFi reads were mapped to haplotype A. We called SVs using the CuteSV program.

 

Filename: HaplotypeA_gene_function_annotation.tsv

Description: The gene annotations of haplotype A.

 

Filename: HaplotypeA_gene_model.gff3

Description: The gene structure model of haplotype A.

 

Filename: HaplotypeA_genome.fa

Description: The genome sequences of haplotype A.

 

Filename: HaplotypeA_PEPPER_OUTPUT.zip

Description: The small variations of sweet orange using the haplotype A as the reference genome.

 

Filename: HaplotypeA_TEs_annotation.gff3

Description: The TE annotations of haplotype A.

 

Filename: HaplotypeB_gene_function_annotation.tsv

Description: The gene annotations of haplotype B.

 

Filename: HaplotypeB_gene_model.gff3

Description: The gene structure model of haplotype B.

 

Filename: HaplotypeB_genome.fa

Filename: ASE_in_five_fruit_development.txt

Description: Based on our haplotype sequences, we confirmed biallelic genes showed significant expression difference between two alleles in at least one fruit developmental stage. We collected the RNA-seq data from fruit of Newhall navel orange at five developmental stages (90, 120, 150, 180 and 210 days after bloom). RNA-seq data from previous project GSE108930 in NCBI database.

 

Filename: Biallelic_genes_haplogenomes.tsv

Description: The biallelic genes were identified using the Genespace program.

 

Filename: Haplogenomes_CENH3_chip_peaks.bw

Description: The CENH3 sequences were collected from BankIt ID 2305947. These reads (including the input library as a control) were aligned to the two assembled haplotypes using Bowtie2 (v2.5.1) with default parameters. MACS2 (v2.2.7.1) with the additional parameters “-f BAM -ghs -B -q 0.01” was used to perform peak calling. The peaks generated from CENH3 chip-seq.

 

Filename: Haplogenomes_Control_chip_peaks.bw

Description: The peaks generated from Control chip-seq.

 

Filename: Haplotype_based_79accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the haplotype-based method. The derived somatic mutations were identified based on nine samples from the outgroup (Earlier Clade I).

 

Filename: HaplotypeA_CuteSV.vcf

Description: The HiFi reads were mapped to haplotype A. We called SVs using the CuteSV program.

 

Filename: HaplotypeA_gene_function_annotation.tsv

Description: The gene annotations of haplotype A.

 

Filename: HaplotypeA_gene_model.gff3

Description: The gene structure model of haplotype A.

 

Filename: HaplotypeA_genome.fa

Description: The genome sequences of haplotype A.

 

Filename: HaplotypeA_PEPPER_OUTPUT.zip

Description: The small variations of sweet orange using the haplotype A as the reference genome.

 

Filename: HaplotypeA_TEs_annotation.gff3

Description: The TE annotations of haplotype A.

 

Filename: HaplotypeB_gene_function_annotation.tsv

Description: The gene annotations of haplotype B.

 

Filename: HaplotypeB_gene_model.gff3

Description: The gene structure model of haplotype B.

 

Filename: HaplotypeB_genome.fa

Description: The genome sequences of haplotype B.

 

Filename: HaplotypeB_TEs_annotation.gff3

Description: The TE annotations of haplotype B.

 

Filename: Single_reference_87accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the single reference genome (Haplotype A).

 

Filename: Somatic_material_RNA_seq_matrix.txt

Description: The expression matrix of BT_3 and BT_5 (a set of somatic mutation material).

 

Filename: ASE_in_five_fruit_development.txt

Description: Based on our haplotype sequences, we confirmed biallelic genes showed significant expression difference between two alleles in at least one fruit developmental stage. We collected the RNA-seq data from fruit of Newhall navel orange at five developmental stages (90, 120, 150, 180 and 210 days after bloom). RNA-seq data from previous project GSE108930 in NCBI database.

 

Filename: Biallelic_genes_haplogenomes.tsv

Description: The biallelic genes were identified using the Genespace program.

 

Filename: Haplogenomes_CENH3_chip_peaks.bw

Description: The CENH3 sequences were collected from BankIt ID 2305947. These reads (including the input library as a control) were aligned to the two assembled haplotypes using Bowtie2 (v2.5.1) with default parameters. MACS2 (v2.2.7.1) with the additional parameters “-f BAM -ghs -B -q 0.01” was used to perform peak calling. The peaks generated from CENH3 chip-seq.

 

Filename: Haplogenomes_Control_chip_peaks.bw

Description: The peaks generated from Control chip-seq.

 

Filename: Haplotype_based_79accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the haplotype-based method. The derived somatic mutations were identified based on nine samples from the outgroup (Earlier Clade I).

 

Filename: HaplotypeA_CuteSV.vcf

Description: The HiFi reads were mapped to haplotype A. We called SVs using the CuteSV program.

 

Filename: HaplotypeA_gene_function_annotation.tsv

Description: The gene annotations of haplotype A.

 

Filename: HaplotypeA_gene_model.gff3

Description: The gene structure model of haplotype A.

 

Filename: HaplotypeA_genome.fa

Description: The genome sequences of haplotype A.

 

Filename: HaplotypeA_PEPPER_OUTPUT.zip

Description: The small variations of sweet orange using the haplotype A as the reference genome.

 

Filename: HaplotypeA_TEs_annotation.gff3

Description: The TE annotations of haplotype A.

 

Filename: HaplotypeB_gene_function_annotation.tsv

Description: The gene annotations of haplotype B.

 

Filename: HaplotypeB_gene_model.gff3

Description: The gene structure model of haplotype B.

 

Filename: HaplotypeB_genome.fa

Description: The genome sequences of haplotype B.

 

Filename: HaplotypeB_TEs_annotation.gff3

Description: The TE annotations of haplotype B.

 

Filename: Single_reference_87accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the single reference genome (Haplotype A).

 

Filename: Somatic_material_RNA_seq_matrix.txt

Description: The expression matrix of BT_3 and BT_5 (a set of somatic mutation material).

 

Filename: ASE_in_five_fruit_development.txt

Description: Based on our haplotype sequences, we confirmed biallelic genes showed significant expression difference between two alleles in at least one fruit developmental stage. We collected the RNA-seq data from fruit of Newhall navel orange at five developmental stages (90, 120, 150, 180 and 210 days after bloom). RNA-seq data from previous project GSE108930 in NCBI database.

 

Filename: Biallelic_genes_haplogenomes.tsv

Description: The biallelic genes were identified using the Genespace program.

 

Filename: Haplogenomes_CENH3_chip_peaks.bw

Description: The CENH3 sequences were collected from BankIt ID 2305947. These reads (including the input library as a control) were aligned to the two assembled haplotypes using Bowtie2 (v2.5.1) with default parameters. MACS2 (v2.2.7.1) with the additional parameters “-f BAM -ghs -B -q 0.01” was used to perform peak calling. The peaks generated from CENH3 chip-seq.

 

Filename: Haplogenomes_Control_chip_peaks.bw

Description: The peaks generated from Control chip-seq.

 

Filename: Haplotype_based_79accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the haplotype-based method. The derived somatic mutations were identified based on nine samples from the outgroup (Earlier Clade I).

 

Filename: HaplotypeA_CuteSV.vcf

Description: The HiFi reads were mapped to haplotype A. We called SVs using the CuteSV program.

 

Filename: HaplotypeA_gene_function_annotation.tsv

Description: The gene annotations of haplotype A.

 

Filename: HaplotypeA_gene_model.gff3

Description: The gene structure model of haplotype A.

 

Filename: HaplotypeA_genome.fa

Description: The genome sequences of haplotype A.

 

Filename: HaplotypeA_PEPPER_OUTPUT.zip

Description: The small variations of sweet orange using the haplotype A as the reference genome.

 

Filename: HaplotypeA_TEs_annotation.gff3

Description: The TE annotations of haplotype A.

 

Filename: HaplotypeB_gene_function_annotation.tsv

Description: The gene annotations of haplotype B.

 

Filename: HaplotypeB_gene_model.gff3

Description: The gene structure model of haplotype B.

 

Filename: HaplotypeB_genome.fa

Description: The genome sequences of haplotype B.

 

Filename: HaplotypeB_TEs_annotation.gff3

Description: The TE annotations of haplotype B.

 

Filename: Single_reference_87accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the single reference genome (Haplotype A).

 

Filename: Somatic_material_RNA_seq_matrix.txt

Description: The expression matrix of BT_3 and BT_5 (a set of somatic mutation material).

Filename: ASE_in_five_fruit_development.txt

Description: Based on our haplotype sequences, we confirmed biallelic genes showed significant expression difference between two alleles in at least one fruit developmental stage. We collected the RNA-seq data from fruit of Newhall navel orange at five developmental stages (90, 120, 150, 180 and 210 days after bloom). RNA-seq data from previous project GSE108930 in NCBI database.

 

Filename: Biallelic_genes_haplogenomes.tsv

Description: The biallelic genes were identified using the Genespace program.

 

Filename: Haplogenomes_CENH3_chip_peaks.bw

Description: The CENH3 sequences were collected from BankIt ID 2305947. These reads (including the input library as a control) were aligned to the two assembled haplotypes using Bowtie2 (v2.5.1) with default parameters. MACS2 (v2.2.7.1) with the additional parameters “-f BAM -ghs -B -q 0.01” was used to perform peak calling. The peaks generated from CENH3 chip-seq.

 

Filename: Haplogenomes_Control_chip_peaks.bw

Description: The peaks generated from Control chip-seq.

 

Filename: Haplotype_based_79accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the haplotype-based method. The derived somatic mutations were identified based on nine samples from the outgroup (Earlier Clade I).

 

Filename: HaplotypeA_CuteSV.vcf

Description: The HiFi reads were mapped to haplotype A. We called SVs using the CuteSV program.

 

Filename: HaplotypeA_gene_function_annotation.tsv

Description: The gene annotations of haplotype A.

 

Filename: HaplotypeA_gene_model.gff3

Description: The gene structure model of haplotype A.

 

Filename: HaplotypeA_genome.fa

Description: The genome sequences of haplotype A.

 

Filename: HaplotypeA_PEPPER_OUTPUT.zip

Description: The small variations of sweet orange using the haplotype A as the reference genome.

 

Filename: HaplotypeA_TEs_annotation.gff3

Description: The TE annotations of haplotype A.

 

Filename: HaplotypeB_gene_function_annotation.tsv

Description: The gene annotations of haplotype B.

 

Filename: HaplotypeB_gene_model.gff3

Description: The gene structure model of haplotype B.

 

Filename: HaplotypeB_genome.fa

Description: The genome sequences of haplotype B.

 

Filename: HaplotypeB_TEs_annotation.gff3

Description: The TE annotations of haplotype B.

 

Filename: Single_reference_87accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the single reference genome (Haplotype A).

 

Filename: Somatic_material_RNA_seq_matrix.txt

Description: The expression matrix of BT_3 and BT_5 (a set of somatic mutation material).

Description: The genome sequences of haplotype B.

 

Filename: HaplotypeB_TEs_annotation.gff3

Description: The TE annotations of haplotype B.

 

Filename: Single_reference_87accessions_somatic_variations.vcf

Description: The small somatic variations generated based on the single reference genome (Haplotype A).

 

Filename: Somatic_material_RNA_seq_matrix.txt

Description: The expression matrix of BT_3 and BT_5 (a set of somatic mutation material).

Files

ASE_in_five_fruit_development.txt

Files (1.3 GB)

Name Size Download all
md5:7e69c3195c044106ee57a8d60db6575d
1.5 MB Preview Download
md5:2ce66571bf509bd34f882f6abab88bb9
766.8 kB Download
md5:0dba379d4c19ffd3feb1c01d32930ee5
76.6 MB Download
md5:2dc2178998f41b112db382297486c979
107.8 MB Download
md5:5cb0f38665fd424be769e22b75888d1b
27.9 MB Download
md5:de2ec2f85d0d6531c30fd316b91583e1
82.3 MB Download
md5:3097f45420818c65c61ca210a3bf8572
41.2 MB Download
md5:668ab20c075e9a47ba4b514645f040a2
26.1 MB Download
md5:861786795711e47465717d44be5d0134
326.2 MB Download
md5:18daff9776ab8b797f552f4422859a66
68.3 MB Preview Download
md5:b19c28ecb45fc1ea7db430b9c4dddf08
46.9 MB Download
md5:837d3b8406c995dd86658200cc196c17
40.1 MB Download
md5:bdda650b7de146dda19d993734cc183d
25.7 MB Download
md5:d7436e2114e47e0fb1859a2ec4dc2ce7
310.8 MB Download
md5:3a5256706c25aa64dfd40d00b7b0b0c5
46.4 MB Download
md5:971bcdcf22854ee7de190e72d6082c89
25.6 MB Download
md5:f1df660d1e5ecdbc4191247531b523e9
16.6 MB Preview Download