* During the workshop, you will be able to register for the second day of the workshop, if you want to attend.
To register - follow the information on the workshop's UPPMAX web page https://www.uppmax.uu.se/support/courses-and-workshops/awk-workshop-winter-2022/
Linux command line tools survey: https://forms.gle/8brjbNEav6uPSKBe6
Results (including previous surveys)
Course material: https://pmitev.github.io/to-awk-or-not/
Q&A: https://hackmd.io/@pmitev/to-awk-or-not-QA
Suggest a topic or check recent suggestions
Have a brief look at the course web page https://pmitev.github.io/to-awk-or-not/ to peek in on the contents of the workshop. The course will not cover all the material that is available but rather present you with typical examples and solutions for some common problems.
On the first day in the morning, the course will start with a general introduction and basic concepts of the tool. The material is not organized linearly, so we can try to adapt the material of the course guided by your questions and particular interests.
In the afternoon exercise-session we will practice awk on some typical situations where you could probably find solutions to problems relevant to your work. The material covered in the first day should be enough to learn how to use awk for the most commonly used purposes - awk "one-liners" and small scripts.
If you decide to attend the second day, we will start with a task from the bioinformatics field and try to go through a tutorial which will demonstrate how one can combine awk with other common command-line tools to analyze and manipulate the output from genome analysis.
Then we will focus on more advanced features of awk that will be illustrated with some easy to follow "case studies" in the materials science field mixed again with examples from the bioinformatics field. The order will depend on your expressed interests.
The Zoom meeting will be active 30 minutes before the workshop, so we can help you with some trivial setup problems and general questions. If you have serious troubles with the setup, please let us know in advance (by replying to this email), so we can try to resolve the problem.
We will start with this tutorial https://pmitev.github.io/to-awk-or-not/Case_studies/manipulating_vcf/
There are 3 large files that need to be downloaded if you want to follow the tutorial yourself. Here are the direct links to download them with wget
or another program.
If you work on Rackham - they will be available in /tmp/awk-course/
.
Documtation for the file formats at:
GFF3: http://genome.ucsc.edu/FAQ/FAQformat.html#format3
FASTA: http://genetics.bwh.harvard.edu/pph/FASTA.html
VCF: http://genome.ucsc.edu/FAQ/FAQformat.html#format10.1
AWK is an interpreted programming language designed for text processing and typically used as a data extraction and reporting tool.
This two-days workshop aims to promote and demonstrate the flexibility of the tool, where the overhead of more sophisticated approaches and programming languages is not worth the bother.
Learn how to
grep
command, capable of arithmetic selection rules with control over the content of the matched lines.Use what you learn and dive into the basic concepts of bioinformatics with simple exercises on typical scientific problems and tasks.
Second day case studies
Use Awk to easy typical computational setup scenarios
$ script.awk filename parameter1 parameter2
linkThe system provided awk version will work for most of the examples during the workshop with few exceptions, which are noted in the online material.
Tilda ~
sign on Mac with Swedish keyboard layout - Alt + ^
Several distributions have other awk flavors installed by default. The easiest fix is to install the gnu version gawk
i.e. for Ubuntu: sudo apt install gawk
Just follow some tutorial on how to setup and use the virtual Linux environment.
The meeting will be active 30 minutes before the workshop, so we can help you with some trivial setup problems and general questions.
The workshop is intended to be interactive, which could be a challenging task on computers with small screens. The best is to leave space for the shared Zoom window and an active terminal. The material for the workshop is available online https://pmitev.github.io/to-awk-or-not/
| 2022.09 | 2022.01 | 2021.09 | 2021.01 | 2020.08 | 2020.01 | 2019.08 | 2019.01 |2018.08 | 2018.01 | 2017.01 | 2017.08 | 2016.08 | 2016.01 | 2015.10 |
awk
, UPPMAX
, intro course
, SNIC