Question

In: Biology

Determine N50 for fragments of 10 kb, 20 kb,25 kb, 30 kb, 45 kb, 60 kb,...

Determine N50 for fragments of 10 kb, 20 kb,25 kb, 30 kb, 45 kb, 60 kb, 65 kb, showing your working. Explain your result in terms of genome quality.

Solutions

Expert Solution

N50 determines the contig or the genomic fragment length that is needed to cover 50% of an organism's genome. The calculation of N50 involves the following steps:

1. Arranging genomic fragments based on their lengths from highest to lowest. Here, it will hence be arranged as:

65 kb>60 kb>45 kb>30 kb>25 kb>20 kb>10 kb

2. Summing the genomic fragments and halving the value will give us a number. The fragment to which this number will correspond will determine our N50 Value.

Sum of all the fragments here = 255 kb. Halving that value is 127.5 kb. Now starting from the beginning, 65+60=125 but 127.5 is greater than that. Again, 65+60+45=170, and 127.5 lies in this value.. So 45 kb is our N50 value.. The calculation can be better comprehended through the diagram that is attached along with.

A bigger N50 value is always better. 45 kb N50 value determines moderate genome quality. If we would have large contig sizes then the N50 value could have been greater and hence would reflect a better genome quality. Smaller N50 value indicates that there are not so many biologically significant contigs that could be generated by the fragmentation of the genome and the genome consists of small little contigs that do not carry much biological importance. Therefore 45 kb N50 is implicative of a moderate genome quality as there can be values above it.


Related Solutions

Table 1 Price Quantity Demanded Quantity Supplied $10 10 60 $8 20 45 $6 30 30...
Table 1 Price Quantity Demanded Quantity Supplied $10 10 60 $8 20 45 $6 30 30 $4 40 15 $2 50 0 1   Refer to Table 1. The equilibrium price and quantity, respectively, are a. $2 and 50. b. $6 and 30. c. $6 and 60. d. $12 and 30. 2. .   Refer to Table 1. If the price were $8, a a. shortage of 20 units would exist and price would tend to rise. b. surplus of 25 units...
Given the following table: Probability X Y 20% 15% 30% 60% 25% 18% 20% 30% 20%...
Given the following table: Probability X Y 20% 15% 30% 60% 25% 18% 20% 30% 20% Calculate a) the covariance of X and Y, and b) the correlation coefficient. Select one: a. a) -20.64% b) -0.90 b. a) -20.18% b) -0.88 c. a) -17.89% b) -0.78 d. a) -19.20% b) -0.84 e. a) -18.35% b)-0.80
Consider the x, y data: x-data (explanatory variables): 10, 15, 20, 25, 30, 35, 40, 45,...
Consider the x, y data: x-data (explanatory variables): 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 y-data (response variables): 1359.9265, 1353.3046, 220.7435, 964.6208, 1861.9920, 1195.3707, 1702.0145, 2002.0900, 1129.1860, 1864.5241, 1444.2239, 2342.5453, 2410.9056, 2766.2245, 2135.2241, 3113.7662, 4311.7260, 3313.1042, 4072.0945 Compute a best fit line to the data. Report: a. The slope coefficient, β1:   b. The intercept coefficient, β0:    c. The standard error of the residuals σε:   d. The Adjusted...
Consider the x, y data: x-data (explanatory variables): 10, 15, 20, 25, 30, 35, 40, 45,...
Consider the x, y data: x-data (explanatory variables): 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 y-data (response variables): 1359.9265, 1353.3046, 220.7435, 964.6208, 1861.9920, 1195.3707, 1702.0145, 2002.0900, 1129.1860, 1864.5241, 1444.2239, 2342.5453, 2410.9056, 2766.2245, 2135.2241, 3113.7662, 4311.7260, 3313.1042, 4072.0945 Compute a best fit line to the data. Report: a. The slope coefficient, β1: ___ b. The intercept coefficient, β0: ___ c. The standard error of the residuals σε: ___ d....
Calculate F Test for given 10, 20, 30, 40, 50 and 5,10,15, 20, 25. For 10,...
Calculate F Test for given 10, 20, 30, 40, 50 and 5,10,15, 20, 25. For 10, 20, 30, 40, 50:
Given the data {20, 20, 30, 30, 40, 40, 50, 50, 60, 60}, calculate 1. Gini...
Given the data {20, 20, 30, 30, 40, 40, 50, 50, 60, 60}, calculate 1. Gini coefficient using the quintile distribution. 2. Draw the Lorenz curve with proper labels.
"Hit" Group "Smashed Into" Group 25 50 25 45 34 55 30 40 36 45 37...
"Hit" Group "Smashed Into" Group 25 50 25 45 34 55 30 40 36 45 37 41 31 50 35 35 30 37 35 45 30 55 25 50 20 45 25 43 30 42 24 40 34 36 33 45 37 50 38 41 A. Null Hypothesis? Ha: Smashed into group estimated higher speed than hit group Ho: Smashed into group estimated lower or same speed than hit group(directional hypothesis) B. Alternative hypothesis? Ha: Smashed into group estimated higher...
For the following 4 questions, consider the following 10 scores on a test: 45 45 60...
For the following 4 questions, consider the following 10 scores on a test: 45 45 60 65 75 80 85 90 90 100 A) Find the standard deviation of the data (to the nearest hundredth). B) Calculate the 25th percentile of the data.
Classes (Percentage) No of Students 0 < 10 10 10 < 20 20 20 < 30...
Classes (Percentage) No of Students 0 < 10 10 10 < 20 20 20 < 30 25 30 < 40 15 40 < 50 20 50 < 60 35 60 < 70 45 70 < 80 10 80 < 90 15 90 < 100 5 2.1 Determine the: 2.1.1 Mean number of marks (1 mark) 2.1.2 Median number of marks 2.1.3 Modal number of marks 2.2 Calculate the standard deviation
Question 1 For the dataset: 20, 20, 10, 10, 40, 50, 20, 30, 10, 20, 50,...
Question 1 For the dataset: 20, 20, 10, 10, 40, 50, 20, 30, 10, 20, 50, 60, 20, 30, 50, 20, 30, 40, 30, 30, 30, 50, 40 calculate the max, min, mode, median and mean.(20%) Draw a boxplot with inner and outer fence For the data in part (i), if the value 60 was replaced by 2000, what would you call this value in the dataset? What could be the explanation for such a value? How can you through...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT