Author Diana Moreno (dmorenos@ttu.edu)
Flye is a de novo assembler for single molecule sequencing reads, such as those produced by PacBio and Oxford Nanopore Technologies. It is designed for a wide range of datasets, from small bacterial projects to large mammalian-scale assemblies. It takes raw PB / ONT reads as input and outputs polished contigs.
To install flye with conda, simply run this command:
Warning
Flye is a bioconda package, therefore we need to have bioconda enabled first.
If bioconda is not enabled do the following:
Before running flye, check the available memory. For a human genome with 30x coverage, you will need ~800Gb at peak.
Flye can be easily run with a simple command line:
For nanopore raw sequences (i.e no corrected)
For pacbio raw sequences (i.e no corrected)
Flye run can take from 1 to 2 weeks on a 2GB mammal genome, if the run stops you can always restarted with the –resume-from option (e.g. –resume-from polishing)
The results will be saved on the output directory.
Genome analysis
, Diana