Question

In: Statistics and Probability

Nucleotide Pairs The human genome is composed of the four DNA nucleotides: A, T, G, and...

Nucleotide Pairs

The human genome is composed of the four DNA nucleotides: A, T, G, and C.

Some regions of the human genome are extremely G–C rich (i.e., a high proportion of the DNA nucleotides there are guanine, G, and cytosine, C).

Other regions are relatively A–T rich (i.e., a high proportion of the DNA nucleotides there are adenine, A, and thymine, T).

Imagine that you want to compare nucleotide sequences from two regions of the genome.

Sixty percent of the nucleotides in the first region are G–C (30% each of guanine and cytosine) and 40% are A–T (20% each of adenine and thymine).

The second region has 25% of each of the four nucleotides.

If you choose a single nucleotide at random from each of the two regions, what is the probability that they are the same nucleotide? (Hint: Where X is any of the 4 DNA nucleotides calculate Pr(X|X) for all four and sum.)

On a separate sheet draw a probability tree (first branch will have 4 limbs).

Assume that nucleotides over a single strand of DNA occur independently within regions and that you randomly sample a two-nucleotide sequence from each of the two regions. List all the possible 2-nucleotide sequences for each region and their probabilities (include pairs like XX and assume XY is the same as YX):

First region pairs:  

Second region pairs:

What is the probability that the two pairs chosen from different regions are the same?

Solutions

Expert Solution


Related Solutions

Sort the following structures in order decreasing complexity Nucleotide DNA polymerase Adenine Y Chromosome Human Genome...
Sort the following structures in order decreasing complexity Nucleotide DNA polymerase Adenine Y Chromosome Human Genome Nitrogen Neutron
Complete the genetic information (DNA base pairs, t-RNA and mRNA nucleotide bases, and the amino acids...
Complete the genetic information (DNA base pairs, t-RNA and mRNA nucleotide bases, and the amino acids this gene codes for, in the following DNA strand1 :    ATG     _____    _____   _____    _____     _____    CGC DNA strand 2 : *_____    GCC    _____   _____    _____    AGT     _____    mRNA : _____   _____    AUA    _____    UUU   _____    _____    tRNA : _____    _____   _____     UAC     _____   _____    _____ Amino acids :   ______   ______   _____   _____   ______   ______   ______ (Remember which type of RNA actually...
A. The vast majority of the DNA sequence of a plant nuclear genome is composed of...
A. The vast majority of the DNA sequence of a plant nuclear genome is composed of repetitive DNA with only a small fraction of the genome space representing protein coding gene sequences. Discuss the structure of chromatin in repeat-rich and gene-rich regions of the plant nuclear genome, including the methylation status of the DNA in these two distinct genome regions. Also, describe the conformational change in chromatin structure required to promote gene expression and explain why this conformational change is...
The total mass of DNA in the body is 50.0 g. If the number of nucleotides...
The total mass of DNA in the body is 50.0 g. If the number of nucleotides in ONE STRAND of DNA is approximately 3.0 x 106 , and the average length of a single nucleotide is 0.34 nm, what is the length (in km) of one strand of DNA when it is stretched out to its maximum length (not in a helix)?
What is DNA? What is an easy way to remember what the four nucleotides are in...
What is DNA? What is an easy way to remember what the four nucleotides are in DNA, which are purines and which are pyrimidines, what nucleotides are single versus double ringed structures, and why certain nucleotides complement one another? If an organism developed a mutation in the gene that codes for helicase, how would it affect cell division? Why did DNA evolve for genetic storage if RNA preceded it?
Describe the hierarchical approach to determining the DNA sequence of the human genome used by the...
Describe the hierarchical approach to determining the DNA sequence of the human genome used by the Human Genome Project (HGP). Your answer should include descriptions of how physical maps were established and how BAC (bacterial artificial chromosome) libraries facilitated sequencing? (Min 2 and a half pages)
Calculate the weight of DNA (in grams) contained in the nuclear genome of a human sperm...
Calculate the weight of DNA (in grams) contained in the nuclear genome of a human sperm cell. Assume the G/C content is 41%. Show your calculations.
What does it mean when we say the DNA is two complimentary, and is composed of antiparallel chains of nucleotides?
What does it mean when we say the DNA is two complimentary, and is composed of antiparallel chains of nucleotides?
What does it mean when we say the DNA is two complimentary, and is composed of antiparallel chains of nucleotides?
What does it mean when we say the DNA is two complimentary, and is composed of antiparallel chains of nucleotides?
Use the words replication, DNA, semi-conservative, complementary base pairs, enzymes, nucleotides, cell cycle and errors in...
Use the words replication, DNA, semi-conservative, complementary base pairs, enzymes, nucleotides, cell cycle and errors in a 2-4 sentences in a way that shows you know what each word means.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT