Hydractinia strain 236-21 genome assembly and Alr domain predictions
Creators
- 1. University of Pittsburgh
- 2. National Institutes of Health
- 3. University of Florida
Description
This dataset is related to the preprint "A family of unusual A family of unusual immunoglobulin superfamily genes in an invertebrate histocompatibility complex" (https://www.biorxiv.org/content/10.1101/2022.03.04.482883v2).
Preprint Abstract:
Most colonial marine invertebrates are capable of allorecognition, the ability to distinguish between themselves and conspecifics. One long-standing question is whether invertebrate allorecognition genes are homologous to vertebrate histocompatibility genes. In the cnidarian Hydractinia symbiolongicarpus, allorecognition is controlled by at least two genes, Allorecognition 1 (Alr1) and Allorecognition 2 (Alr2), which encode highly polymorphic cell surface proteins that serve as markers of self. Here, we show that Alr1 and Alr2 are part of a family of 41 Alr genes, all of which reside a single genomic interval called the Allorecognition Complex (ARC). Using sensitive homology searches and highly accurate structural predictions, we demonstrate that the Alr proteins are members of the immunoglobulin superfamily (IgSF) with V-set and I-set Ig domains unlike any previously identified in animals. Specifically, their primary amino acid sequences lack many of the motifs considered diagnostic for V-set and I-set domains, yet they adopt secondary and tertiary structures nearly identical to canonical Ig domains. Thus, the V-set domain, which played a central role in the evolution of vertebrate adaptive immunity, was present in the last common ancestor of cnidarians and bilaterians. Unexpectedly, several Alr proteins also have immunoreceptor tyrosine-based activation motifs (ITAMs) and immunoreceptor tyrosine-based inhibitory motifs (ITIMs) in their cytoplasmic tails, suggesting they could participate in pathways homologous to those that regulate immunity in humans and flies. This work expands our definition of the IgSF with the addition of a family of unusual members, several of which play a role in invertebrate histocompatibility.
This dataset contains:
- Hsym-236-21-genome-assembly.fa.gz: A gzip-compressed FASTA-formatted file of the genome assembly generated in the paper.
- Alr-domain-structure-predictions.zip: a zip-compressed file with structural predictions produced with Colabfold for all domains of the Alr proteins described in that manuscript.
Files
Alr-domain-structure-predictions.zip
Files
(125.5 MB)
Name | Size | Download all |
---|---|---|
md5:424c8bdb4b7865317e5f5a7a8a6509b6
|
1.9 MB | Preview Download |
md5:0c119300cd52b5e53451b9f053808559
|
123.6 MB | Download |
Additional details
Related works
- Is cited by
- Preprint: 10.1101/2022.03.04.482883 (DOI)