Concept explainers
The genome of the bacterium Neisseria gonorrhoeae consists of one double-stranded DNA molecule that contains 2220 kilobase pairs. If 85% of this DNA molecule is made up of the open reading frames of genes encoding proteins, and the average protein is 300 amino acids long, how many protein-encoding genes does Neisseria have? What kind of genetic information is present in the other 15% of the DNA?
To discuss:
The Neisseria gonorrhoeae bacterial genome consists of one double stranded DNA and that contains 2220 Kbp. If 85% of this DNA sequence is made up of open reading frames of genes encoding proteins, and the average protein size is 300 amino acids long, Neisseria contains how many protein-encoding genes. Other 15% of the DNA sequence contains what kind of genetic information.
Concept introduction:
Open reading frames or ORF is a specific segment of DNA or RNA molecule and that part can be translated into a protein sequence. An open reading frame (ORF) (sequence of nucleotides) contains a start codon (AUG) followed by a stretch of various codons and end with a stop codon (UAA, UGA, or UAG). An ORF region in the mRNA is essential for its translation.
Explanation of Solution
The Neisseria gonorrhoeae bacterial genome contains 2220 Kbp or 2,220,000 bp.
Each base pair size is 0.34 nm.
Therefore 2220 Kbp x 0.34 nm is 754, 800 nm.
The length of DNA is 754, 800 nm or 0.07548 cm
If 85% of the bacterial genome is composed of open reading frames, 1,887 Kbp of DNA sequence could be the open reading frames.
(
Average size of the protein is 300 amino acids. Each amino acid is encoded by three nucleotides (one codon). Therefore, 900 bp of DNA sequence in the open reading frame encode proteins.
Number of protein coding bacterial gene is 2097. The remaining 15 % of the bacterial DNA can be non-coding genes, which may regulate gene expression.
Want to see more full solutions like this?
Chapter 4 Solutions
Brock Biology of Microorganisms (15th Edition)
- You are studying a large eukaryotic gene that is 439,515 base pairs long. You find the polypeptide that this gene produces in liver cells is 46,771 amino acids long. Your colleague studies the function of this gene in brain cells, and finds the polypeptide produced in the brain is much larger – 61,438 amino acids long. How do you explain this difference? Possible Answers: A. The cell cycle of liver cells is much longer than that of brain cells. B. This is due to alternative splicing. in the brain C. There was a different complement of sequence-specific transcription factor binding sites in the CRM of the brain cells. D. There is no 5' cap added to the gene product from the liver cells.arrow_forwardDuchenne muscular dystrophy is caused by a mutation in a gene that comprises 2.5 million base pairs and specifies a protein called dystrophin. However, less than 1% of the gene actually encodes the amino acids in the dystrophin protein. On the basis of what you now know about gene structure and RNA processing in eukaryotic cells, provide a possible explanation for the large size of the dystrophin gene.arrow_forwardIn addition to the standard base-paired helical structures, DNA can form X-shaped hairpin structures called cruciforms in which most bases are involved in Watson–Crick pairs. Such structures tend to occur at sequences with inverted repeats. Draw the cruciform structure formed by the DNA sequence TCAAGTCCACGGTGGACTTGC.arrow_forward
- The mass of a gene is 32,400 units. The amount of introns is twice the amount of exons. What is the mass of protein that encodes this gene, if the mass of the amino acid is 110 and the nucleotide is 300 units.arrow_forwardPeople who carry a theoretical genetic disorder (called B-disease) can be identified from a 2kb DNA sequence.People who carry this genetic disorder have a single nucleotide polymorphism that results in a change of GTATCC to GGATCC, a site that only occurs once at nucleotide number 750 in this DNA sequence. Answer the following questions based on the information provided. (a) How can you develop a simple molecular test to identify the genetic disorder? (b) If you have carried out the molecular test (based on the information above) on a 100 individual and found that 24 were healthy (BB) and 26 were carriers (bb); 1) What is the ratio of heterozygous? 2) Show how can you identify the three types from the agarose gelarrow_forwardPeople who carry a theoretical genetic disorder (called B-disease) can be identified from a 2kb DNA sequence. People who carry this genetic disorder have a single nucleotide polymorphism that results in a change of GTATTC to GGATTC, a site that only occurs once at nucleotide number 750 in this DNA sequence. Answer the following questions based on the information provided. (a) How can you develop a simple molecular test to identify the genetic disorder? (b) If you have carried out the molecular test (based on the information above) on a 100 individuals and found that 24 were healthy (BB) and 26 were carriers (bb); 1) What is the ratio of heterozygous? 2) Show how can you identify the three types from the agarose gel.arrow_forward
- You have sequenced the genome of the bacterium Salmonella typhimurium and find a protein that is 100 percent identical to a protein in the bacterium Escherichia coli. When you compare nucleotide sequences of the S. typhimurium and E. coli genes, you find that their nucleotide sequences are only 87 percent identical. How would you interpret the observations? Please make sure to select ALL correct answer options. Because genetic code is redundant, changes in the DNA nucleotide sequence can occur without change to its encoded protein. Due to the flexibility in the third positions of most codons, the DNA sequence can accumulate changes without affecting protein structure. Natural selection will eliminate many deleterious amino acid changes. This will reduce the rate of change in the amino acid sequence and lead to sequence conservation of the proteins. Protein sequences are expected to evolve and diverge more slowly than the genes that encode them.arrow_forwardPeople who carry a theoretical genetic disorder (called B-disease) can be identified from a 2kb DNA sequence. People who carry this genetic disorder have a single nucleotide polymorphism that results in a change of GTATCC to GGATCC, a site that only occurs once at nucleotide number 750 in this DNA sequence. Answer the following questions based on the information provided. (a) How can you develop a simple molecular test to identify the genetic disorder?r B-dif w. (41 (b) If you have carried out the molecular test (based on the information above) on a 100 individual and found that 24 were healthy (BB) and 26 were carriers (bb); 1) What is the ratio of heterozygous? 2) Show how can you identify the three types from the agarose gel (H focaiarrow_forwardHere is a eukaryotic gene. The numbers given are base pairs of exon and intron. How long in bases will the pre mRNA transcript be? Explain briefly. What is the maximum number of amino acids that could make up the protein product from the final mRNA? Explain briefly.arrow_forward
- A 2500 bp region of the human genome encodes two genes. One of the genes encodes a protein of 600 amino acids and the other gene encodes a protein of 280 amino acids. The mRNA sequences of the two genes do not contain any of the same nucleotide sequences (i.e. they do not overlap). How is this possible? Fully explain your answer.arrow_forwardIn the table below, there are four versions of gene A, one of which is normal, and the other three which contain mutations that make the gene product nonfunctional. Focus on the shaded region of the sequence. Use the genetic code table to answer the question. How would you describe Mutation #2? Partial DNA sequence for gene A ("..." indicates many nucleotides of sequence not shown) 5' ... ATG GTG AGC AAG GAG GAG CTG TTC ACC TGT AAA TAG ... Normal Mutation #1 5' ... ATG GTG AGC AAG GAG AAG CTG TTC ACC TGT AAA TAG ... Mutation #2 5' ... ATG GTG AGC AAG TAG GAG CTG TTC ACC TGT AAA TAG ... Mutation #3 5' ... ATG GTG AGC AAG GAG CTG TTC ACC TGT AAA TAG ... Silent mutation Nonsense mutation Frameshift mutations Missense mútationarrow_forwardCystic fibrosis (CF) is an inherited disorder caused by different types of mutations, many of which prevent ions from moving across cell membranes. Normally there are channel proteins that allow passage of the ions, but in patients with one kind of CF these proteins seem odd. Closer examination shows that these proteins display the correct amino acid sequence. However, they fail to do their job. A) Given that the primary structure of the protein is correct, what can you infer about the DNA sequence for the gene coding this protein on this patient, is there a mutation? Explain. B) Why is the primary structure insufficient to guarantee the proper function of the protein?arrow_forward
- Biology: The Dynamic Science (MindTap Course List)BiologyISBN:9781305389892Author:Peter J. Russell, Paul E. Hertz, Beverly McMillanPublisher:Cengage LearningHuman Heredity: Principles and Issues (MindTap Co...BiologyISBN:9781305251052Author:Michael CummingsPublisher:Cengage Learning