Question

In: Math

All work must be done in R programing. Consider this dataset provided to you as prob10.txt...

All work must be done in R programing. Consider this dataset provided to you as prob10.txt c1 t1 c2 t2 c3 t3 c4 t4 2650 3115 2619 2933 2331 2799 2750 3200 1200 1101 1200 1309 1888 1901 1315 980 1541 1358 1401 1499 1256 1238 1625 1421 1545 1910 1652 2028 1449 1901 1399 2002 1956 2999 2066 2880 1777 2898 1999 2798 1599 2710 1754 2765 1434 2689 1702 2402 2430 2589 2789 2899 2332 2300 2250 2741 1902 1910 2028 2100 1888 1901 2000 1899 1530 2329 1660 2332 1501 2298 1478 2287 2008 2485 2104 2871 1987 2650 2100 2520 (2) Read it in and set the row names to “Gene 1” through “Gene 10” It should look like this in R > prob10 c1 t1 c2 t2 c3 t3 c4 t4 Gene 1 2650 3115 2619 2933 2331 2799 2750 3200 Gene 2 1200 1101 1200 1309 1888 1901 1315 980 Gene 3 1541 1358 1401 1499 1256 1238 1625 1421 Gene 4 1545 1910 1652 2028 1449 1901 1399 2002 Gene 5 1956 2999 2066 2880 1777 2898 1999 2798 Gene 6 1599 2710 1754 2765 1434 2689 1702 2402 Gene 7 2430 2589 2789 2899 2332 2300 2250 2741 Gene 8 1902 1910 2028 2100 1888 1901 2000 1899 Gene 9 1530 2329 1660 2332 1501 2298 1478 2287 Gene 10 2008 2485 2104 2871 1987 2650 2100 2520 (3 )Perform a one-sample t-test to compare the hypothesis that the mean of the control expression values is 2000.

Solutions

Expert Solution

R code for this problem

#part a

data=matrix(c(2650, 3115, 2619, 2933, 2331, 2799, 2750 ,3200, 1200,
1101 ,1200, 1309, 1888, 1901, 1315, 980 ,1541 ,1358,
1401, 1499, 1256, 1238, 1625, 1421, 1545, 1910 ,1652,
2028, 1449, 1901, 1399, 2002, 1956, 2999, 2066 ,2880,
1777, 2898, 1999, 2798, 1599, 2710, 1754, 2765, 1434,
2689, 1702, 2402, 2430, 2589, 2789, 2899, 2332, 2300,
2250, 2741, 1902, 1910, 2028, 2100, 1888, 1901, 2000 ,
1899, 1530, 2329, 1660, 2332, 1501, 2298, 1478 ,2287,
2008, 2485, 2104, 2871, 1987, 2650, 2100, 2520),10,8,byrow=F)

#part b
row.names(data)= c("gane1","gane2","gane3",
"gane4","gane5","gane6","gane7",
"gane8","gane9","gane10")
colnames(data)=c("c1","t1","c2","t2","c3","t3","c4","t4")

#part c
test.data=as.vector(data[,c("c1","c2","c3","c4")])
t.test(test.data,mu=2000)
#remark: since p-value of this tet 0.5633 > 0.05
#we can comment that the mean of control expression is 2000 at
#95% confidence.

[please comment below if you have any problem of understanding]


Related Solutions

R work (must be done in R) Before opening the dataset needed for this problem, you’ll...
R work (must be done in R) Before opening the dataset needed for this problem, you’ll need to call the “car” package. Run the following line of code: > library(car)  Now you can import the “ Cowles” dataset and use it to answer the question below. Name the data frame with your EID: > my_eid <- Cowles Remember to include any code you use along with your answers in your submission! 3. Cowles and Davis (1987) collected data on...
Part 2– R work (must be done in R) Copy and paste your R code and...
Part 2– R work (must be done in R) Copy and paste your R code and output into a word document, along with your written answers to the questions, and upload to Canvas.   Follow these instructions to import the necessary dataset: Before opening the dataset needed for this problem, you’ll need to call the “car”package.  Run the following line of code: > library(car) Now you can import the “Prestige” dataset and use it to answer the question below. Name the data...
R Problem Set: #Work with the inbuilt dataset "Cars" View(cars) This will show you the dataset...
R Problem Set: #Work with the inbuilt dataset "Cars" View(cars) This will show you the dataset on 2 variables speed and distance. ?cars This will explain what the variables mean. #Q1) Describe the dataset. What are the main findings? #Q2) Design a relevant question to model using linear regressions #Q3) Run the regression and report the std error, t-stat, p value and f stat. #Q4) Is this a valid regression? Is the normality assumption justified? Show clearly. #Q5) Are there...
Show all work in the space provided. All work must be shown to obtain full credit....
Show all work in the space provided. All work must be shown to obtain full credit. If a calculator function was used, list exactly what was entered into the calculator. All z-score calculations must be shown. 1. According to one study, the average hours of sleep on a weeknight for men was 6.9 hours with a standard deviation of 1.5 hours. a. Find the proportion of men who sleep for more than 8 hours per night. b. Find the proportion...
Use Excel to complete the following question. You must submit/attach all work done in Excel in...
Use Excel to complete the following question. You must submit/attach all work done in Excel in order to receive full credit. 5. A health advocacy group conducted a study to determine if the nicotine content of a particular brand of cigarettes was equal to the advertised amount. The cigarette brand advertised that the average nicotine content per cigarette was 1.4 milligrams. The advocacy group randomly sampled 24 cigarettes. The nicotine level for each of the sampled cigarettes is given below....
Consider the beauty dataset from the wooldridge package in R. Suppose you wish to estimate the...
Consider the beauty dataset from the wooldridge package in R. Suppose you wish to estimate the following equation: lwage=β0+β1educ+u Using heteroscedastic-robust standard errors, conduct the hypothesis test H0:β1=0 H1:β1≠0 What is the t-value associated with this test?
I have an assignment that must be done on R/R studio R/RStudio - Chapter 4- Correlation...
I have an assignment that must be done on R/R studio R/RStudio - Chapter 4- Correlation and Regression Step 1 – Download and Install the R/RStudio software on your computer. The link and instructions to do that are on Blackboard under Course Information. Step 2 – Access the Data Sets in MyLab through StatCrunch. In chapter 4, a data set called “Used BMW prices 2017” is what we’ll use. Step 3 – When you open the data set in StatCrunch,...
***This problem must be done using R so please provide the R code used to find...
***This problem must be done using R so please provide the R code used to find the solution. I have provided the data in data-wtLoss.txt below the question. I will also give "thumbs-up for correct R code" Thanks in advance.*** The file “data-wtLoss.txt” contains data on weight loss and self esteem evaluation at three time points over a period of three months for 34 individuals who are randomly selected from a residential area. These individuals are randomly assigned to one...
***This problem must be done using R so please provide the R code used to find...
***This problem must be done using R so please provide the R code used to find the solution. I have provided the data in data-wtLoss.txt below the question. I will also give "thumbs-up for correct R code" Thanks in advance.*** The file “data-wtLoss.txt” contains data on weight loss and self esteem evaluation at three time points over a period of three months for 34 individuals who are randomly selected from a residential area. These individuals are randomly assigned to one...
Work these in R. Using library(resampledata) and the dataset Spruce to conduct a test to see...
Work these in R. Using library(resampledata) and the dataset Spruce to conduct a test to see if the mean difference in how much the seedling grew (in height) over the course of the study under these two treatments are significantly different from each other. Answer the following: a) Set up a hypothesis using appropriate notation. b)Find the value of the observed test statistic using R. c)Compute the P-value of the observed test statistic using a permutation distribution with N= 10^5-1....
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT