--- tags: ggg, ggg2020, ggg298 --- # GGG 201(b), Lab Homework 1 Due at the beginning of class on Friday, 1/31. ## 1. Sign up for GitHub Classroom Sign up for GitHub Classroom using https://classroom.github.com/a/VoyiLDCW. This will probably involve creating a (free) GitHub account. ## 2. Log into farm & clone your github repo for hw1 On farm, clone your assignment repository, which will be something like `https://github.com/dib-lab/2020-ggg-201b-lab-hw1-ctb`; do so like this, ``` git clone YOUR_REPO_URL 201b-lab-hw1 ``` Change into that directory: ``` cd 201b-lab-hw1/ ``` ## 3. Activate the `vc` conda environment. ``` conda activate vc ``` ## 4. Edit your Snakefile to do variant calling on three more sample data sets. Add these three data sets to the Snakefile; they are all in `~ctbrown/data/ggg201b`. ``` SRR2584403_1.fastq.gz SRR2584404_1.fastq.gz SRR2584405_1.fastq.gz ``` Please make sure of the following: * the VCF output should be sent to `SAMPLE-variants.vcf` for each of the four samples. * Running `snakemake` by itself should go from raw data to final results for all samples (see 'default rule' instructions from [week 3](https://hackmd.io/cGYzxz07SseGxH0y2gjYJw?view#Create-a-good-%E2%80%9Cdefault%E2%80%9D-rule)) * All of the generated files (including intermediates) are in at least one 'output:' annotation, so that e.g. `snakemake --delete-all-output` removes all of the generated files in the directory. (See [this section of lab 3](https://hackmd.io/cGYzxz07SseGxH0y2gjYJw?view#Re-running-everything)) It turns out that the copy_data rule for all four files is a bit tricky; here's a freebie, courtesy of Ashleigh: ``` # copy data from /home/ctbrown/data/ggg201b rule copyData: output: "SRR2584857_1.fastq.gz", "SRR2584403_1.fastq.gz", "SRR2584404_1.fastq.gz", "SRR2584405_1.fastq.gz" shell: "ln -s /home/ctbrown/data/ggg201b/*.fastq.gz ." ``` ## 5. Commit and push your changes back to github. At any time, do the following to save changes. ``` git commit -am "updated Snakefile" git push origin master ``` (you can do this as many times as you want, and save intermediate changes, etc. etc.) ## 6. Relax in knowledge of a job well done. If you run into any trouble with submission, that's ok - we'll debug it on Friday's class. Reminder - you can ask questions and request help [on the MAD site](https://mad.oxli.org/c/GGG-lab-courses-for-winter-2020/9)!