Question

In: Biology

Describe the approaches and techniques used to generate the first draft of the human genome sequence...

Describe the approaches and techniques used to generate the first draft of the human genome sequence in Feb 2001?

2 pages min with diagrams . thanks

Solutions

Expert Solution

Genome sequencing is figuring out the order of DNA nucleotides, or bases, in a genome—the order of As, Cs, Gs, and Ts that make up an organism's DNA. The human genome is made up of over 3 billion of these genetic letters.

Today, DNA sequencing on a large scale—the scale necessary for ambitious projects such as sequencing an entire genome—is mostly done by high-tech machines. Much as your eye scans a sequence of letters to read a sentence, these machines "read" a sequence of DNA bases.

A DNA sequence that has been translated from life's chemical alphabet into our alphabet of written letters might look like this

:


That is, in this particular piece of DNA, an adenine (A) is followed by a guanine (G), which is followed by a thymine (T), which in turn is followed by a cytosine (C), another cytosine (C), and so on.

APPROACHES

In 1998, a similar, privately funded quest was launched by the American researcher Craig Venter, and his firm Celera Genomics. Venter was a scientist at the NIH during the early 1990s when the project was initiated. The $300,000,000 Celera effort was intended to proceed at a faster pace and at a fraction of the cost of the roughly $3 billion publicly funded project. The Celera approach was able to proceed at a much more rapid rate, and at a lower cost than the public project because it relied upon data made available by the publicly funded project.

Celera used a technique called whole genome shotgun sequencing, employing pairwise end sequencing, which had been used to sequence bacterial genomes of up to six million base pairs in length, but not for anything nearly as large as the three billion base pair human genome.

Celera initially announced that it would seek patent protection on "only 200–300" genes, but later amended this to seeking "intellectual property protection" on "fully-characterized important structures" amounting to 100–300 targets. The firm eventually filed preliminary ("place-holder") patent applications on 6,500 whole or partial genes. Celera also promised to publish their findings in accordance with the terms of the 1996 "Bermuda Statement", by releasing new data annually (the HGP released its new data daily), although, unlike the publicly funded project, they would not permit free redistribution or scientific use of the data. The publicly funded competitors were compelled to release the first draft of the human genome before Celera for this reason. On July 7, 2000, the UCSC Genome Bioinformatics Group released a first working draft on the web. The scientific community downloaded about 500 GB of information from the UCSC genome server in the first 24 hours of free and unrestricted access.

In March 2000, President Clinton announced that the genome sequence could not be patented, and should be made freely available to all researchers. The statement sent Celera's stock plummeting and dragged down the biotechnology-heavy Nasdaq. The biotechnology sector lost about $50 billion in market capitalization in two days.

Although the working draft was announced in June 2000, it was not until February 2001 that Celera and the HGP scientists published details of their drafts. Special issues of Nature (which published the publicly funded project's scientific paper) described the methods used to produce the draft sequence and offered analysis of the sequence. These drafts covered about 83% of the genome (90% of the euchromatic regions with 150,000 gaps and the order and orientation of many segments not yet established). In February 2001, at the time of the joint publications, press releases announced that the project had been completed by both groups. Improved drafts were announced in 2003 and 2005, filling in to approximately 92% of the sequence currently.

TECHNIQUE

The process of identifying the boundaries between genes and other features in a raw DNA sequence is called genome annotation and is in the domain of bioinformatics. While expert biologists make the best annotators, their work proceeds slowly, and computer programs are increasingly used to meet the high-throughput demands of genome sequencing projects. Beginning in 2008, a new technology known as RNA-seq was introduced that allowed scientists to directly sequence the messenger RNA in cells. This replaced previous methods of annotation, which relied on the inherent properties of the DNA sequence, with direct measurement, which was much more accurate. Today, annotation of the human genome and other genomes relies primarily on deep sequencing of the transcripts in every human tissue using RNA-seq. These experiments have revealed that over 90% of genes contain at least one and usually several alternative splice variants, in which the exons are combined in different ways to produce 2 or more gene products from the same locus.

The genome published by the HGP does not represent the sequence of every individual's genome. It is the combined mosaic of a small number of anonymous donors, all of the European origin. The HGP genome is a scaffold for future work in identifying differences among individuals. Subsequent projects sequenced the genomes of multiple distinct ethnic groups, though as of today there is still only one "reference genome."


Related Solutions

Describe how the first draft of the human genome was obtained and compare this with the...
Describe how the first draft of the human genome was obtained and compare this with the next generation sequencing (NGS) technologies used to sequence human genomes. (Min 2 and a half pages.)
Describe the hierarchical approach to determining the DNA sequence of the human genome used by the...
Describe the hierarchical approach to determining the DNA sequence of the human genome used by the Human Genome Project (HGP). Your answer should include descriptions of how physical maps were established and how BAC (bacterial artificial chromosome) libraries facilitated sequencing? (Min 2 and a half pages)
A competing commercial effort to sequence the human genome was initiated by the company Celera in...
A competing commercial effort to sequence the human genome was initiated by the company Celera in 1997. How was their approach different from the Human Genome Project? A. Libraries of individual chromosomes were generated and overlapping clones were isolated before sequencing. B. They used sequence tagged sites (STS) and expression sequence tags (EST) to order contigs prior to sequencing. C. They sequenced the entire genome from one female donor. D. They generated overlapping clones from YAC and BAC libraries that...
Read the following paragraph: “The human genome sequence provides the underlying code for human biology. Despite...
Read the following paragraph: “The human genome sequence provides the underlying code for human biology. Despite intensive study, especially in identifying protein-coding genes, our understanding of the genome is far from complete, particularly with regard to non-coding RNAs, alternatively spliced transcripts and regulatory sequences. Systematic analyses of transcripts and regulatory information are essential for the identification of genes and regulatory regions and are an important resource for the study of human biology and disease. Such analyses can also provide comprehensive...
Describe the classes of repetitive elements that are present in the human genome?
Describe the classes of repetitive elements that are present in the human genome?
Describe various tools and techniques used in human resource management for a project. Use PMBOK as...
Describe various tools and techniques used in human resource management for a project. Use PMBOK as your main source of information List and describe some techniques for conflict management.
Describe the range of techniques that can be used to monitor human gene expression. Compare high-resolution...
Describe the range of techniques that can be used to monitor human gene expression. Compare high-resolution techniques, used to analyze individual genes and high-throughput techniques that analyze all human genes. (MIn 2 and a half pages)
Describe the mechanisms of dynamic evolution that have shaped the human genome? (2 pages )
Describe the mechanisms of dynamic evolution that have shaped the human genome? (2 pages )
Describe the biosynthesis of human papillomavirus (HPV). Discuss the transcription of the virus genome into new...
Describe the biosynthesis of human papillomavirus (HPV). Discuss the transcription of the virus genome into new virion particles. Make sure to include if your virus must package any special enzymes in order to be effective.
Describe telomeres and how they contribute to genome stability in human cells. Discuss their roles in...
Describe telomeres and how they contribute to genome stability in human cells. Discuss their roles in the maintenance of the genetic information. Essay include diagrams
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT