Characterising a Hybrid Zone between a Cryptic Species Pair of Freshwater Snails

README for structure_input.txt

This is the infile used in the STRUCTURE analysis.

This is a tab-delimited text file containing the nuclear marker genotypes at 9 microsatellite markers and 3 coding genes for 409 Radix snails from the core and extended population data sets. 

Note that the text file was created in Windows 8.1. The data has been formatted for use with the program STRUCTURE (http://pritchardlab.stanford.edu/structure.html). To convert the genotype 
data for use with other programs (e.g. FSTAT, GDA), I would recommend using PGD Spider (http://www.cmpg.unibe.ch/software/PGDSpider/).

The first row contains self-explanatory headers indicating the contents of their respective columns. The subsequent rows contain genotype data for each individual.

Column 1: Individual identifiers

Most individuals are identified by the three letter population code given in Table S1, supporting information file [available at http://onlinelibrary.wiley.com/journal/10.1111/%28ISSN%291365-294X] 
followed by a number, e.g. ALS1. Individuals in rows 331-410 have different labels corresponding to samples in storage in our laboratory.

Column 2: Population identifiers

The integers correspond to the following:

1=ALS
2=BLL
3=BSC
4=CMT
5=CPG
6=DAS
7=DSP
8=DUN
9=ESP
10=FCT
11=FEL
12=FVI
13=GNL
14=HEN
15=ISG
16=JRC
17=LES
18=LMY
19=MBK
20=MBZ
21=ORN
22=PYO
23=RSL
24=SSB
25=TLR
26=TYR
27=URG
28=VDB
29=SWA (N.B. individuals from this population have prefix 159m* in column 1)
30=POR (N.B. individuals from this population have prefix 301m* in column 1)
31=DIJ (N.B. individuals from this population have prefix 303m* in column 1)
32=ABO (N.B. individuals from this population have prefix 304mn in column 1)
33=LAB (N.B. individuals from this population have prefix 305m* in column 1)
34=CSV (N.B. individuals from this population have prefix CH_D0* in column 1)

Columns 3  20: Microsatellite marker genotypes.

Microsatellite alleles are given as fragment lengths in base pairs. -9 represents missing data. Each individuals diploid genotype for each microsatellite marker is given in one row, 
so there are two columns per marker.

Columns 21  26: coding gene genotypes for act1a, HSPA2 and Psmd2.

Different alleles are given as haplotype number for act1a and nested haplotype numbers for HSPA2 and Psmd2 (see figure S4, supporting information file available at 
http://onlinelibrary.wiley.com/journal/10.1111/%28ISSN%291365-294X). -9 represents missing data. Again, there are two columns for each gene representing diploid genotypes.
