Problem Set 2 - Ancestry

You should make a directory for problem set 2 in your working directory with subdirectories for code and results:

mkdir /oasis/projects/nsf/csd524/$USER/ps2
mkdir /oasis/projects/nsf/csd524/$USER/ps2/code
mkdir /oasis/projects/nsf/csd524/$USER/ps2/results

Installing python packages

Use the following commands to install useful python packages:

pip install --user sklearn pandas pyvcf

PS2 data

The data directory contains the following files you will use in the problem set:

PS2 templates

Using 23andMe data

and edit the paths appropriately.