Consider a genome whose length is 1000 bp. "Shotgun" sequencing techniques are applied to the genome, resulting in 20 reads, with an average length of 50 bp. A very important point is that, even though 20×50 = 1000, there is no guarantee that ALL 1000 bp of the genome are represented in the fragments. Calculate the coverage. What does this value mean? Why would it be a good idea to have a coverage greater than 1?

Human Heredity: Principles and Issues (MindTap Course List)
11th Edition
ISBN:9781305251052
Author:Michael Cummings
Publisher:Michael Cummings
Chapter15: Genomes And Genomics
Section: Chapter Questions
Problem 6QP: Which of the following best describes the process of DNA sequencing? a. DNA is separated on a gel,...
icon
Related questions
Question
Consider a genome whose length is 1000 bp. "Shotgun" sequencing techniques are applied to the genome, resulting in 20 reads, with an average length of 50 bp.
A very important point is that, even though 20×50 = 1000, there is no guarantee that ALL 1000 bp of the genome are represented in the fragments. Calculate the coverage. What does this value mean? Why would it be a good idea to have a coverage greater than 1?
Objective: Get a sense of how genomics, the study of the genome in its entirety,
needs to think about how to go about its research.
Geonomic DNA is broken up into fragments. The 5' and 3' ends of each fragment
(a "read") are sequenced. The sequenced reads are assembled together into
contiguous sequences (“contigs") based on sequence similarity.
The idea is to sequence enough random fragments so that every nucleotide in the
genome is represented on some read. The number of such fragments needed is
called the coverage, c.
The coverage c can be calculated by the formula RL/G, where R is the number of
reads sequenced, L is the average length of a read and G is the total length of the
genome. The units of length are bases (b) or base pairs (bp).
1. Consider a genome whose length is 1000 bp. "Shotgun" sequencing techniques
are applied to the genome, resulting in 20 reads, with an average length of 50 bp.
A very important point is that, even though 20 × 50 1000, there is no guarantee
that ALL 1000 bp of the genome are represented in the fragments. Calculate the
coverage. What does this value mean?
Transcribed Image Text:Objective: Get a sense of how genomics, the study of the genome in its entirety, needs to think about how to go about its research. Geonomic DNA is broken up into fragments. The 5' and 3' ends of each fragment (a "read") are sequenced. The sequenced reads are assembled together into contiguous sequences (“contigs") based on sequence similarity. The idea is to sequence enough random fragments so that every nucleotide in the genome is represented on some read. The number of such fragments needed is called the coverage, c. The coverage c can be calculated by the formula RL/G, where R is the number of reads sequenced, L is the average length of a read and G is the total length of the genome. The units of length are bases (b) or base pairs (bp). 1. Consider a genome whose length is 1000 bp. "Shotgun" sequencing techniques are applied to the genome, resulting in 20 reads, with an average length of 50 bp. A very important point is that, even though 20 × 50 1000, there is no guarantee that ALL 1000 bp of the genome are represented in the fragments. Calculate the coverage. What does this value mean?
Expert Solution
steps

Step by step

Solved in 3 steps

Blurred answer
Knowledge Booster
Genome annotation
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, biology and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Human Heredity: Principles and Issues (MindTap Co…
Human Heredity: Principles and Issues (MindTap Co…
Biology
ISBN:
9781305251052
Author:
Michael Cummings
Publisher:
Cengage Learning