Question

In: Biology

Use the PopSet (population study data sets) in ENTREZ to retrieve coding sequence (CDS) of amylase-related...

Use the PopSet (population study data sets) in ENTREZ to retrieve coding sequence (CDS) of amylase-related gene (amyrel) generated for Drosophila yakuba by Cariou et al. (2001). You should retrieve 5 Drosophila yakuba sequences from different population strains. ***Make sure all your sequences are Drosophila yakuba.***

(a) Report the GenBank accession numbers.

(b) Align coding regions of the 5 sequences. Which alignment software did you use?

(c) Look at the first 200 bases of the alignment. Count the numbers of segregating sites(K), the numbers of segregating sites per sites (k), and nucleotide diversity (π).

Solutions

Expert Solution

Go to the site https://www.ncbi.nlm.nih.gov/Web/Search/entrezfs.html and select ENTREZ

https://www.ncbi.nlm.nih.gov/gquery/ and select PopSet

Enter amyrel in the search column

https://www.ncbi.nlm.nih.gov/popset/12620152 - this has Cariou et al (2001) amyrel sequences for different strains of Drosophila yakuba

(a)

1) GenBank: AF280878.1 : Drosophila yakuba strain LO4

2) GenBank: AF280877.1 : Drosophila yakuba strain LBV2 clone 4

3) GenBank: AF280876.1 : Drosophila yakuba strain LBV2 clone 1

4) GenBank: AF280875.1 : Drosophila yakuba strain SA3 clone 8

5) GenBank: AF280874.1 : Drosophila yakuba strain SA3 clone 6

b) Get the FASTA sequence of each strain and use Multiple sequence alignment -Clustal Omega

https://www.ebi.ac.uk/Tools/msa/clustalo/

Results :

CLUSTAL O(1.2.4) multiple sequence alignment


AF280878.1      ATGTTCAAGTTGGCTTTGACCCTGACACTCTGCTTGGCGGGCAGCCTCTCGCTGGCCCAG    60
AF280876.1      ATGTTCAAGTTGGCTTTGACCCTGACACTCTGCTTGGCGGGCAGCCTCTCGCTGGCCCAG    60
AF280877.1      ATGTTCAAGTTGGCTTTGACCCTGACACTCTGCTTGGCGGGCAGCCTCTCGCTGGCCCAG    60
AF280875.1      ATGTTCAAGTTGGCTTTGACCCTGACACTCTGCTTGGCGGGCAGCCTCTCGCTGGCCCAG    60
AF280874.1      ATGTTCAAGTTGGCTTTGACCCTGACACTCTGCTTGGCGGGCAGCCTCTCGCTGGCCCAG    60
                ************************************************************

AF280878.1      CACAATCCCCATTGGTGGGGCAATCGCAACACCATCGTCCACTTGTTCGAGTGGAAGTGG    120
AF280876.1      CACAATCCCCATTGGTGGGGCAATCGCAACACCATCGTCCACTTGTTCGAGTGGAAGTGG    120
AF280877.1      CACAATCCCCATTGGTGGGGCAATCGCAACACCATCGTCCACTTGTTCGAGTGGAAGTGG    120
AF280875.1      CACAATCCCCATTGGTGGGGCAATCGAAACACCATCGTCCACTTGTTCGAGTGGAAGTGG    120
AF280874.1      CACAATCCCCATTGGTGGGGCAATCGAAACACCATCGTCCACTTGTTCGAGTGGAAGTGG    120
                ************************** *********************************

AF280878.1      TCGGACATTGCCCAGGAGTGTGAGAATTTTCTGGGACCCCGAGGATTCGCCGGCGTTCAA    180
AF280876.1      TCGGACATCGCCCAGGAGTGTGAGAATTTTCTGGGCCCACGAGGATTCGGCGGCGTTCAA    180
AF280877.1      TCGGACATTGCCCAGGAGTGTGAGAATTTTCTGGGACCCCGAGGATTCGCCGGCGTTCAA    180
AF280875.1      TCGGACATCGCCCAGGAGTGTGAGAATTTTCTGGGCCCACGAGGATTCGCCGGCGTTCAA    180
AF280874.1      TCGGACATCGCCCAGGAGTGCGAGAATTTTCTGGGCCCACGAGGATTCGCCGGCGTTCAA    180
                ******** *********** ************** ** ********** **********

AF280878.1      GTGAGCCCCGTGAATGAGAACATCATATCGGCGGGTCGTCCTTGGTGGGAGCGATACCAA    240
AF280876.1      GTGAGCCCCGTGAATGAGAACATCATAGCGGCGGGTCGTCCTTGGTGGGAGCGATACCAA    240
AF280877.1      GTGAGCCCCGTGAATGAGAACATCATAGCGGCGGGTCGTCCTTGGTGGGAGCGATACCAA    240
AF280875.1      GTGAGCCCCGTGAATGAGAACATCATAGCGGCGGGTCGTCCTTGGTGGGAGCGATACCAA    240
AF280874.1      GTGAGCCCCGTGAATGAGAACATCATAGCGGCGGGTCGTCCTTGGTGGGAGCGATACCAA    240
                *************************** ********************************

Related Solutions

coding the following project: Setting up structures to store and retrieve data. A major requirement of...
coding the following project: Setting up structures to store and retrieve data. A major requirement of virtually all projects in the financial world involves the storage and retrieval of data. project must be properly modularized and allow for more than one way to store the data to satisfy the various needs of clients.The first manner is by storing data using hashtables (or hash maps) as the basis of storing and retrieving data.
Use the following data related to the same population and determine if the selected independent variable...
Use the following data related to the same population and determine if the selected independent variable is affecting the dependent variable. Variable (27,29,32,31,26,25) Use an alpha of 5% for ANOVA and Correlation Coefficient. Use excel for the results. Explain the outcome. Data Sample: 30, 27, 24, 21,27,32 25, 21, 22, 28, 30, 31 26,25,25, 21, 22, 20 31, 29, 24,22,20. 29
The case study is used in qualitative research because it a. Facilitates the coding of data...
The case study is used in qualitative research because it a. Facilitates the coding of data b. Provides a comprehensive description c. Enables generalization of the research results d. Entails once off data collection
In R, Use library(MASS) to access the data sets for this test. Use the Pima.tr data...
In R, Use library(MASS) to access the data sets for this test. Use the Pima.tr data set to answer questions 1-5. What is the average age for women in this data set? What is the maximum number of pregnancies for women in this data set ? What is the median age for women who have diabetes? What is the median age for women who do not have diabetes? What is the third quartile of the skin variable?
How is it that we are able to select data sets from a larger population and...
How is it that we are able to select data sets from a larger population and draw reasonable inferences from the data sets about the larger population? What is statistical inference all about? Explain and discuss.
The two data sets in the table below are dependent random samples. The population of (...
The two data sets in the table below are dependent random samples. The population of ( x − y ) (x-y) differences is approximately normally distributed. A claim is made that the mean difference ( x − y ) (x-y) is less than -31.4. X 25 32 48 37 39 34 37 Y 73 64 66 80 78 67 84 For each part below, enter only a numeric value in the answer box. For example, do not type "z ="...
Use the following information to create SQL commands to retrieve data from Henry Books database :...
Use the following information to create SQL commands to retrieve data from Henry Books database : For each book, list the book code, book title, publisher code, and publisher name. Order the results by publisher name. For each book published by Plume, list the book code, book title, and price. List the book title, book code, and price of each book published by Plume that has a book price of at least $14. List the book code, book title, and...
Data sets for the question below Data Set G: Assume the population values are normally distributed....
Data sets for the question below Data Set G: Assume the population values are normally distributed. Random variable: x = weight of border collie in pounds sample size = 25 34.1 40.8 36.0 34.9 35.6 43.4 35.4 29.3 33.3 37.8 35.8 37.4 39.0 38.6 33.9 36.5 37.2 37.6 37.3 37.7 34.9 33.2 36.2 33.5 36.9 Use Excel (or similar software) to create the tables. Then copy the items and paste them into a Word document. The tables should be formatted...
Case Problem A Bipartisan Agenda for Change ( Need to Use R coding) In a study...
Case Problem A Bipartisan Agenda for Change ( Need to Use R coding) In a study conducted by Zogby International for the Demaocrat and Chronicile, more than 700 New Yorkrs were polled to dtermine whether the New York state government works. Respondent survyed were asked questions involving pay cuts for state legislators, restrictions on lobbyists, term limits for legislators, and whethr stat citizens should be able to put matters directly on the state ballot for a vote.The results regarding several...
In this assignment you will use the baseball salary data found in the Data Sets link...
In this assignment you will use the baseball salary data found in the Data Sets link on the menu to your left. Under R Instructions, see the document "Some R commands for the baseball salary data" in order to learn how to (a) read the data into R, and (b) use the command lm when you have a large number of independent variables. Please do the following: (1) Fit a linear regression model with salary as the response and the...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT