Published December 7, 2017 | Version v1
Dataset Open

Data from: Reducing cryptic relatedness in genomic datasets via a central node exclusion algorithm

Description

Cryptic relatedness is a confounding factor in genetic diversity and genetic association studies. Development of strategies to reduce cryptic relatedness in a sample is a crucial step for downstream genetic analyzes. The present study uses a node selection algorithm, based on network degrees of centrality, to evaluate its applicability and impact on evaluation of genetic diversity and population stratification. 1,036 Guzerá (Bos indicus) females were genotyped using Illumina Bovine SNP50 v2 BeadChip. Four strategies were compared. The first and second strategies consists on a iterative exclusion of most related individuals based on PLINK kinship coefficient (φij) and VanRaden's φij, respectively. The third and fourth strategies were based on a node selection algorithm. The fourth strategy, Network G matrix, preserved the larger number of individuals with a better diversity and representation from the initial sample. Determining the most probable number of populations was directly affected by the kinship metric. Network G matrix was the better strategy for reducing relatedness due to producing a larger sample, with more distant individuals, a more similar distribution when compared with the full dataset in the MDS plots and keeping a better representation of the population structure. Resampling strategies using VanRaden's φij as a relationship metric was better to infer the relationships among individuals. Moreover, the resampling strategies directly impact the genomic inflation values in Genome-wide association studies. The use of the node selection algorithm also implies better selection of the most central individuals to be removed, providing a more representative sample.

Notes

Files

Gwas_simulations_All_Animals.zip

Files (1.9 GB)

Name Size Download all
md5:3de4c333b7dcb6ea510b120e0cada955
375.5 MB Preview Download
md5:38af44549e09a713bd07e23ac369d44a
372.0 MB Preview Download
md5:1d044155e10ae2424543ceca53659cb8
369.1 MB Preview Download
md5:d5fae145ee7621caaf7a4e7c34769ff3
362.5 MB Preview Download
md5:777dafdff62e1906dd33f0468387cf7d
361.3 MB Preview Download
md5:38debd261e7dd64b5c42c4a799c3a509
2.1 kB Download
md5:8087c38bf3429aed23d18ac66a0b0bf7
19.9 MB Preview Download
md5:880d40be5d3a73baed26c594b6c86bc4
39.1 MB Preview Download

Additional details

Related works

Is cited by
10.1111/1755-0998.12746 (DOI)