Published February 21, 2023 | Version v2023.03.01
Dataset Open

[Dataset] Data for the course "Population Genomics" at Aarhus University

  • 1. Aarhus University

Description

Datasets, conda environments and Softwares for the course "Population Genomics" of Prof Kasper Munch. This course material is maintained by the health data science sandbox. This webpage shows the latest version of the course material.

  1.  Data.tar.gz Contains the datasets and executable files for some of the softwares
    You can unpack by simply doing
    tar -zxf Data.tar.gz -C ./
    This will create a folder called Data with the uncompressed material inside
  2. Course_Env.packed.tar.gz Contains the conda environment used for the course. This needs to be unpacked to adjust all the prefixes (Note this environment is created on Ubuntu 22.10). You do this in the command line by
    1. creating the folder Course_Env:  mkdir Course_Env
    2. untar the file: tar -zxf Course_Env.packed.tar.gz -C Course_Env
    3. Activate the environment: conda activate ./Course_Env
    4. Run the unpacking script (it can take quite some time to get it done): conda-unpack
  3. Course_Env.unpacked.tar.gz The same environment as above, but will work only if untarred into the folder /usr/Material - so use the version above if you are using it in another folder. This file is mostly to execute the course in our own cloud environment.
  4. environment_with_args.yml The file needed to generate the conda environment. Create and activate the environment with the following commands:
    1. conda env create -f environment_with_args.yml -p ./Course_Env
    2. conda activate ./Course_Env

 

The data is connected to the following repository: https://github.com/hds-sandbox/Popgen_course_aarhus. The original course material from Prof Kasper Munch is at https://github.com/kaspermunch/PopulationGenomicsCourse.

 

Description

The participants will after the course have detailed knowledge of the methods and applications required to perform a typical population genomic study.

The participants must at the end of the course be able to:

  • Identify an experimental platform relevant to a population genomic analysis.
  • Apply commonly used population genomic methods.
  • Explain the theory behind common population genomic methods.
  • Reflect on strengths and limitations of population genomic methods.
  • Interpret and analyze results of population genomic inference.
  • Formulate population genetics hypotheses based on data

The course introduces key concepts in population genomics from generation of population genetic data sets to the most common population genetic analyses and association studies. The first part of the course focuses on generation of population genetic data sets. The second part introduces the most common population genetic analyses and their theoretical background. Here topics include analysis of demography, population structure, recombination and selection. The last part of the course focus on applications of population genetic data sets for association studies in relation to human health.

Curriculum

The curriculum for each week is listed below. "Coop" refers to a set of lecture notes by Graham Coop that we will use throughout the course.

Course plan

  1. Course intro and overview:
  2. Drift and the coalescent:
    • Coop chapter 4; Paper: Platypus
    • Exercise: Read mapping and base calling
  3. Recombination:
  4. Population strucure and incomplete lineage sorting:
  5. Hidden Markov models:
  6. Ancestral recombination graphs:
  7. Past population demography:
  8. Direct and linked selection:
  9. Admixture:
  10. Genome-wide association study (GWAS):
  11. Heritability:
    • Lecture: Coop Lecture notes Sec. 2.2 (p23-36) + Chap. 7 (p119-142)
    • Exercise: Association testing
  12. Evolution and disease:
    • Lecture:  Coop Lecture notes Sec. 11.0.1 (p217-221)
    • Exercise: Estimating heritability

Files

Files (12.8 GB)

Name Size Download all
md5:8ff44ad0e3ef46e4a6bd4385c82ab155
845.1 MB Download
md5:b2850992cd083141862e2dcad6206d4c
845.1 MB Download
md5:d5ece0d6bd64cfabcc034fab6a679287
11.1 GB Download
md5:0ac6d902adba0806d6ccd178b79f2039
615 Bytes Download