---
title: "nf-core/genomeassembler assembly validation and curation workflow planning"
tags: pipeline,genomeassembler,planning,plans
---
# nf-core/genomeassembler assembly validation and curation discussion
## Desired components:
- K-mer copy number spectra profile - Merqury
- Contamination profile - Blobtools
- Gene space checks - Busco ( Dogma too?)
- Assembly comparison and stats - Quast
- Other accuracy metrics ( Inspector ? FRCBam ? )
- Pairwise genome plots
- Assembly graph plots ( Bandage )
- Read coverage ( Qualimap, Blobtools )
- Output genome browser tracks with useful information, like polished areas, missassembly signals, coverage, etc
## Workflow diagram draft:
```mermaid
flowchart LR
Assemblies --> Quast
Assemblies --> BUSCO
Assemblies --> on-demand:Bakta
Assemblies --> Merqury
PacBioHifi --> Merqury
```
## Implementations:
- [Pairwise genome comparison workflow](https://github.com/oist/plessy_pairwiseGenomeComparison)
- Generate list of pairwise inputs:
```nextflow
workflow {
Channel.of( 'A', 'B', 'C' )
.toSortedList()
.flatMap {
it.subsequences()
.findAll { it.size() == 2 }
}.view()
// Output is [B, C], [A, B], [A, C]
}
```