A. Explain the reason why the cell requires a mechanism for identifying specific sequences of DNA

B. On average, how often would the nucleotide sequence CGATTG be expected to occur in a DNA strand 4000 bases long? Show your work and explain your answer.
C. Molecular processes depend upon sequence-specific interactions of proteins with DNA. Recognition sequences can be 4, 5, 6, 7, or even 8 base pairs in length for a single protein. What might be the advantages of a short recognition sequence? What might be the advantage of a longer recognition sequence?


A. Sequence information contains indicators important for the regulation of gene expression and DNA packaging. Examples include sequence indicators for where a gene starts and ends, where transcription begins, and where to assemble specific protein complexes at specialized sequences such as those found in telomeric or centromeric DNA.
B. Because 46 (= 4096) different sequences of six nucleotides can occur in DNA, any given sequence of six nucleotides would be expected to occur on average once in a DNA strand 4000 bases long, assuming a random distribution of sequences.
C. Short recognition sequences do not have as many sequence-specific contacts (which means they don't bind as tightly to the binding site in question), and they are more likely to be found randomly throughout the genome. Using the same type of calculation from part B, there are 256 possible combinations for a 4-base-pair recognition sequence, which could be found 15–16 times over a 4000-base-pair segment by random chance. This could be useful for proteins that need to bind to a large number of sites with low affinity. If we take the case of the 8-base-pair sequence, there are 65,536 different possible sequences. So, not only do they represent high-affinity binding sites, they are much less likely to be found by random chance.

Biology & Microbiology

You might also like to view...

Explain why infectious diseases are much less lethal in developed countries than in underdeveloped countries

What will be an ideal response?

Biology & Microbiology

Mutations in the TATA box increase the rate of transcription

__________________ Indicate whether the statement is true or false.

Biology & Microbiology

The density of Douglas firs in an old-growth forest is estimated by counting the Douglas firs in four sample plots of 1 hectare each. The number of fir trees in the plots is 10, 12, 7, and 11, respectively. What is the estimated density of firs in the

forest? A) 5 trees per hectare B) 10 trees per hectare C) 20 trees per hectare D) 25 trees per hectare

Biology & Microbiology

Trypanosoma and Leishmania are called ________ because they often live in the blood and tissues of their hosts.

Fill in the blank(s) with the appropriate word(s).

Biology & Microbiology