Hackathon March 2021


DSL2


Primary focus

  1. Add new modules
  2. Add tests to existing modules
  3. Add overhead memory requirements

Group members

  • Harshil Patel
  • Kevin Menden
  • Friederike Hanssen
  • Gregor Sturm
  • Jose Espinosa
  • Michael Heuer
  • Francesco Lescai
  • Robert Petit
  • Mark S. Hill
  • Carlos Ruiz
  • Batool Almarzouq
  • Santiago Revale
  • Aron Skaftason
  • Hédia Tnani
  • Anthony Fullam
  • Suzanne Jin
  • Ravneet Bhuller
  • Kevin Brick
  • Pernilla Ericsson
  • Alex Peltzer
  • Maxime Borry
  • Yuk Kei Wan
  • Nick Toda
  • Edmund Miller

New modules - Achievements

Update test files for all modules

  • New test_data.config
  • Requires ALL modules to be updated
  • Everyone involved

New module sequenzautils/bam2seqz

  • opened a new issue
  • work in progress

New module sequenzautils/gcwiggle

New module cnvkit

New module picard/collectwgsmetrics

  • Answered issue https://github.com/nf-core/modules/issues/264
    • Most of the heavy-lifting had already been done by hpatel
  • PR submitted: https://github.com/nf-core/modules/pull/304
    • Currently failing 1 EditorConfig lint test
    • Info: Can't use md5sum checks for this picard module with pytest due to time stampe in output. Used contain arguments instead.
    • Info: Pytest still complains about exit code locally, even though running the nextflow command on it's own seems to work fine

New module prokka

Update module shovill

New module vcftools

New module samtools/merge

New module samtools/fastq

New module strelka/germline

New module ucsc/bed12tobigbed

New module freebayes/single

  • Created the software
  • Running the tests, error encountered

New module fgbio/fastqtobam

  • Added module FastqToBam from FGBIO toolkit, to be used as part of workflow to handle UMIs
  • Responds to issue https://github.com/nf-core/modules/issues/189
  • PR submitted as https://github.com/nf-core/modules/pull/306
  • Tuesday update:
    • quite some work into fixing a strange change in the pytest yaml, and mismatch with upstream (had to revert changes, rebase, checkout a specific file, add missing test manually)
    • pytests are failing with different messages
    • docker: complaining a file that should be there is not actually there. running test locally (but error message is not transparent with pytest) seems there might be a problem with the docker image of fgbio in biocontainers
    • conda: test runs just fine locally, but on github CI pytest it complains the md5sum is not equal to expected
    • singularity: test complains md5sum not equal to expected (same as with conda)
  • Wednesday update
    • after further testing (all nextflow local run worked, while pytest variable depending on environment) I have removed the md5sum for the output bam file
    • now both conda and singularity CI tests pass
    • further conflicts resolved
  • docker remains an issue, with likely the biocontainer running into a library problem: this might require modification of the original container and therefore cannot be resolved within the timeframe of the hackathon.

New module adam/markduplicates

New module unicycler

New module: AdapterRemoval

New module: bismark/align

  • added module for alignment with bismark

New module: bismark/report

  • added module for bismark_report
    • generates single-sample reports from bismark alignments

New module: bismark/summary

  • added module for bismark_summary report
    • generates summary reports over many samples

New module: allelecounter

New module Kallisto/quant

New module: DamageProfiler

New module: prodigal [Gregor]

New module: gatk4/applybqsr

New module: gatk4/baserecalibrator

New module: gatk4/indexfeaturefile

new module: gatk4/fastqtosam

  • Added gatk4 module for converting fastq to sam files
  • Merged with master branch

New module: gatk4/genotypegvcfs

New module: gatk4/markduplicates

New module: ensembl-VEP

New module: msisensor/scan

New module: msisensor/msi

New module: kb/ref

  • Submitted by (@flowuenne)
  • Issue: https://github.com/nf-core/modules/issues/358
  • Module for creating a reference for scrnaseq pipeline processing with kb
  • In progress:
    • test data does not seem to work with indexing of kb ref
    • Need human test data. Local test with only chr21 worked. Simply need a small file with few genes and transcripts and subset genomic fasta.
    • Will wait for new human test data added by @Friederike Hanssen to creat PR

New module: kb/count

  • Submitted by (@flowuenne)
  • Issue: https://github.com/nf-core/modules/issues/360
  • Module for quantifying scRNA-seq data (fastq files) using kb-python
  • In progress:
    • For proper tests, will need the index created by kb/ref from new test data. Local test done with index of chromosome 21 work.
    • Will wait for new human test data added by @Friederike Hanssen to creat PR

New module: NanoPlot

  • Yuk Kei (Wednesday): this module also uses sequencing_summary.txt in addition to viralcon's NanoPlot module, which only uses the fastq option
  • Submitted PR: https://github.com/nf-core/modules/pull/364
  • PyTest working locally but not on GitHub checks

Modules Testing - Achievements

Human data set

  • Rike (Monday): Search for a suitable human data set
  • Rike (Tuesday): Download data, convert to fastqs, determine UMI Adapter, start sarek
  • Rike (Wednesday): Data is finally running now to generate all the indices, unfortunately not enough reads mapped to chr6, so chr22 it is for now

Nanopore SARS-CoV2 test dataset

Changes to test data

  • Add readgroup to existing test bam files for compatibility with gatk tools
  • Add recalibration table (.table) for testing gatk tools.
  • Updated fastp tests to deal with non-deterministic ouputs

Remove runOptions from docker profile

Other - Achievements

  • Re-organised test data for different platforms

  • New test data config (@everyone!🎉)

  • Update CI tests to add linting and to bypass limit (Edmund)

  • Added more docs about local installation (properly tested modules has helped!)

  • Updated PR template to encourage linking the corresponding issues [#312 by Gregor]

  • Add docs for running tests locally with pytest [#338 by Harshil]

modules with custom scripts


Select a repo