Dataset and Scripts for: RefPlantNLR: a comprehensive collection of experimentally validated plant NLRs (v.20200528_415)
Description
RefPlantNLR v.20200528_415
See bioRxiv 2020.07.08.193961; doi: https://doi.org/10.1101/2020.07.08.193961
SUPPLEMENTAL DATA
Table S1: Description of RefPlantNLR.
Table S2: Plant orders represented in RefPlantNLR.
Supplemental dataset 1: Amino acid sequences of RefPlantNLR entries (fasta format). This file contains 415 amino acid sequences.
Supplemental dataset 2: CDS sequences of RefPlantNLR entries (fasta format). This file contains 400 CDS sequences. CDS sequences could not be retrieved for 15 RefPlantNLR entries.
Supplemental dataset 3: Annotated genomic sequences of RefPlantNLR entries (GenBank flat file format). This file contains 329 genomic loci containing the gene models of 344 RefPlantNLR entries and 56 RefPlantNLR mRNA entries lacking genomic information.
Supplemental dataset 4: InterProScan annotation of the RefPlantNLR amino acid sequences (GFF3 format). This file contains the InterProScan annotation of 415 amino acid sequences.
Supplemental dataset 5: InterProScan annotation of the RefPlantNLR CDS sequences (GFF3 format). This file contains the InterProScan annotation of the 400 CDS sequences.
Supplemental dataset 6: Amino acid sequences of the extracted RefPlantNLR NB-ARC domains (fasta format). This file contains 424 NB-ARC domain (SUPERFAMILY signature SSF52540) amino acid sequences belonging to 415 RefPlantNLR entries.
Supplemental dataset 7: Amino acid sequences of the unique RefPlantNLR extracted NB-ARC domains (fasta format). This file contains 347 unique NB-ARC domain (SUPERFAMILY signature SSF52540) amino acid sequences.
Supplemental dataset 8: Clustal Omega alignment of the unique RefPlantNLR extracted NB-ARC domains (PHYLIP format). This file contains the Clustal Omega alignment of 346 unique NB-ARC domains (SUPERFAMILY signature SSF52540) with all positions with less than 95% coverage removed. Pb1 was omitted from this alignment.
Supplemental dataset 9: NB-ARC domain phylogeny of the RefPlantNLR entries using the Maximum likelihood method (Newick format). This file contains the phylogenetic analysis of the NB-ARC domain of the RefPlantNLR entries using the JTT method.
Supplemental dataset 10: Amino acid sequences of the non-redundant RefPlantNLR entries (fasta format). This file contains 235 amino acid sequences representing the non-redundant RefPlantNLR entries at a 90% amino acid identity threshold per genus according to the NB-ARC domain.
Supplemental dataset 11: Amino acid sequences of the NB-ARC domains of the non-redundant RefPlantNLR entries (fasta format). This file contains 241 amino acid sequences representing the extracted NB-ARC domains of the 235 non-redundant RefPlantNLR.
Appendix S1: R script used to generate annotations and figures.
Appendix S2: InterProScan descriptions used for generating annotations.
Files
Appendix_S2_20200423_InterPro_v5.44-79.0_Description.zip
Files
(20.3 MB)
Name | Size | Download all |
---|---|---|
md5:e2f6b34ee65466560d805895b4c55519
|
31.3 kB | Download |
md5:250e08dff79c19173d8bc8abd126e069
|
1.3 MB | Preview Download |
md5:670e352e14d17d4bff7760b1634f7af1
|
263.1 kB | Download |
md5:02c443fd07a25013d300d83626674257
|
65.6 kB | Download |
md5:5a5d1ccc0cbefc51b48c0ed57d8d89ee
|
463.6 kB | Download |
md5:6d2ce37f98869017227516c75c6ef7f2
|
1.3 MB | Download |
md5:c274c245d1ae62e16a9d49fa8bd1f02b
|
14.7 MB | Download |
md5:a0ba30529bcd1ad0c43fd63f49e15c08
|
909.9 kB | Download |
md5:705acd3d2bfdb5d9ed3f7df6972b08b5
|
886.2 kB | Download |
md5:fe986de09b89188201a25cfde67d854a
|
116.5 kB | Download |
md5:a19a09a970feef41bfa831e3d0d438ac
|
95.6 kB | Download |
md5:86599105e0b9fadf8d27dd068db6484d
|
121.4 kB | Download |
md5:bd4b4864e14dbddac562f79b5ef8afc8
|
21.1 kB | Download |
md5:0c06e48c2478cb4be8de7f75c1d2c5a5
|
66.6 kB | Download |
md5:fc2eaa15a669f3afdc400e6316037d00
|
11.8 kB | Download |