- Course overview
- Search within this course
- What is antimicrobial resistance?
- How do we study pathogens?
- Public pathogen data
- A guide to the Pathogens Portal
- Identification and investigation of antimicrobial resistance genes
- Looking for antimicrobial resistance genes in different environments
- Data sharing
- The future of AMR
- Crossword: Test your knowledge
- Your feedback
- Further resources
- Help and support
- Glossary
- References
Assembly
Assembly is the process of reconstructing an organism’s complete genome from sequencing reads. The goal is to generate multiple contiguous sequences, known as contigs, which can be further combined to represent the entire genome. There are two main approaches used for generating genome assemblies from sequencing data:
- Reference-based assembly – The genome is assembled by aligning sequencing reads to an existing reference genome of the same strain, species, or a closely related organism. This approach is efficient and accurate when a high-quality reference genome is available, and is less computationally intensive than de novo assembly.
- De novo assembly – The genome is assembled from scratch, without requiring a reference. Sequencing reads are pieced together based on overlapping regions, allowing the original genome to be reconstructed. This approach is used when a reference genome for an organism does not exist, or if the organism has a highly variable genome.
