Published August 31, 2015 | Version v1
Dataset Open

Codon similarity data in ATTED-II ver 8.0 (Bra, Mtr)

  • 1. Tohoku University

Description

Codon similarity data in ATTED-II ver 8.0

The gene-to-gene codon similarity data is organized in the form of tables, each named according to the Entrez Gene ID of a particular query gene. Each table encompasses three columns, specifying: the Entrez Gene ID of a corresponding gene, an MR (Mutual Rank) value (where a smaller number signifies a stronger relationship), and a Pearson correlation coefficient (where a larger number suggests a stronger association).

Protein-coding sequences utilized in this study were retrieved from NCBI's RefSeq database. For each gene, a 61-dimensional vector was derived from the count of codons in the protein-coding sequence. In instances where multiple RefSeq sequences were associated with a single gene, the longest sequence was selected for the codon usage calculation. Pearson correlation coefficients (PCCs) were determined between the vectors of any two given genes. These PCCs were subsequently converted into MRs, employed as an index to evaluate the similarity in codon usage between the genes.

Files

codon_Bra.v15-08.G45949-S61.codon.mrgeo.d.zip

Files (36.2 GB)

Name Size Download all
md5:14f44de7f2f68e651ed33ad8af8d44c9
19.0 GB Preview Download
md5:25f450ba8534139dee52e8b07a3d8d49
17.2 GB Preview Download

Additional details

Related works

Is supplement to
Journal article: 10.1093/pcp/pcv165 (DOI)

References

  • Aoki Y, Okamura Y, Tadaka S, Kinoshita K, Obayashi T. (2016) ATTED-II in 2016: a plant coexpression database towards lineage-specific coexpression. Plant Cell Physiology, 57, e5.