---
tags: BMMB554-23
---
[](https://docs.google.com/drawings/d/1f3WeEzfwVyiA1P3UFu2IxOIZ9hu3tjqq5-pgmgxsOlc/edit?usp=sharing)
# Lecture 22: Assembly from long reads
-------
Before we begin, some fundamental terms:
- **Contig** - A sequence reconstructed by assembling together sequencing reads
- **Scaffold** - An ordered collection contigs. The sequence within the gaps between the contigs is usually not known.
- **N50** - A statistic used for assessing the contiguity of a genome assembly. The contigs in an assembly are sorted by size and added, starting with the largest. The size of the contig is reported that makes the total greater than or equal to 50% of the genome size.

<small>Image credit = Mike Schatz</small>
Galaxy histories containing zebra finch assembly are [here](https://galaxyproject.org/projects/vgp/workflows/)
---
<iframe src="https://docs.google.com/presentation/d/e/2PACX-1vSOWjQT9zSrrPQmmBZtLtGG-FmONWFlqa_Dc5Y2AqeGOM-CfFb9W8EV1GBYzMmx-0T4649oLWqkuL88/embed?start=false&loop=false&delayms=3000" frameborder="0" width="683" height="541" allowfullscreen="true" mozallowfullscreen="true" webkitallowfullscreen="true"></iframe>
