Published September 4, 2020 | Version v2
Journal article Open

Cayla et al., 2020, Wellcome Open Research - Extented data

  • 1. University of Edinburgh

Description

Extended data for the identification of low complexity regions (LCRs) in the proteome of Trypanosoma brucei.

- Supplement Figure S1 (Cumulative distribution functions of the entropy values. Representation of the empirical cumulative distribution functions (ecdf) of the entropy values of the T. brucei proteome, calculated with the Shannon’s formula as implemented in the entropy.plugin() function, for the different window sizes. The vertical lines represent the different possible thresholds: 0.5, 1, 1.5, 2, 3.5, 4, 4.5, 5% under which LCR have been called.)

- Supplement Figure S2 (Statistics on the LCRs obtained from different thresholds.) Statistics obtained from the cumulative ecdf distributions for each window size (Windows). Values = numbers of LCRs identified, Mean and SD = mean and standard-deviation obtained from the cumulative ecdf, the remainder of the numbers are the different possible thresholds: 0.5, 1, 1.5, 2, 3.5, 4, 4.5, 5% under which LCR have been called, with their value indicated for each window size.

- Supplement File 1.pdf (Visualisation of LCRs, InterPro domain (InterPro) and PTMs for every protein (excluding VSGs) of the T. brucei proteome. Each plot represents a protein (ID and product). The X-axis indicates the protein size in amino acids and on the plot are represented the final combined LCRs (in red), the identified InterPro domains in blue and the overlap regions between LCR and InterPro domain indicated in yellow. Post-translational modifications (PTMs) identified in experimental analysis by different studies are in dictated above by “+” symbol. Each modification is coloured in blue when present in an InterPro domain, in red when present in an LCR or in black if present in neither.)

Notes

This work was also supported by the Marie Sklodowska Curie postdoctoral fellowship (proposal number 65470) and a Royal Society Research merit award [WM140045].

Files

supplement_file_1.pdf

Files (23.0 MB)

Name Size Download all
md5:1cef557bc0921c5399369816273de9ca
3.1 MB Download
md5:426e4700b041a4c06e57db6f4f32c2a9
1.0 MB Download
md5:47c244556caa3cb8189dec498583b317
907.7 kB Download
md5:404f76646f7201fca3840eb7ac9ecab6
18.1 MB Preview Download

Additional details

Related works

Is supplement to
Journal article: 10.12688/wellcomeopenres.16286.1 (DOI)

Funding

Wellcome Trust
Environmental sensing and cell-cell communication in African trypanosomes. 103740

References

  • Cayla et al (2020). A global analysis of low-complexity regions in the Trypanosoma brucei proteome reveals enrichment in the C-terminus of nucleic acid binding proteins providing potential targets of phosphorylation. Wellcome Open Res, 5:219 (https://doi.org/10.12688/wellcomeopenres.16286.1)