Question

In: Statistics and Probability

Here we are going to test a couple of hypotheses about the Old Faithful data in...

Here we are going to test a couple of hypotheses about the Old Faithful data in R. Remember, this is the faithful data frame that is built in to R. You can use data(faithful) to load data set. First split faithful into two separate data frames: (1) those entries with eruption times less than 3 minutes (eruptions < 3) and (2) those entries with eruption times greater than or equal to 3 minutes (eruptions >= 3). Answer the following about the entry wait time (waiting):

(a) For the entries with short eruption times, you want to test the hypothesis that the associated waiting last on average less than 60 minutes. What is the null hypothesis? What is the alternative hypothesis? (Write your own code) (10 pt)

(b) Give R commands to compute the statistic that you used in (a) and the resulting p-value. What values did you get? Would you reject the null hypothesis at the α = 0.05 level? (15 pt)

(c) For the entries with long eruption times, you want to test the hypothesis that the associated waiting time last on average shorter than 80 minutes. What is the null hypothesis? What is the alternative hypothesis? (Write your own code) (10 pt)

(d) Give R commands to compute the statistic you used in (c) and the resulting p-value to test the hypothesis you came up with in part (c). What values did you get? Would you reject the null hypothesis at the α = 0.05 level? (15 pt)

Solutions

Expert Solution


Related Solutions

The following data represents the heights of the old faithful geyser eruptions, the durations of the...
The following data represents the heights of the old faithful geyser eruptions, the durations of the eruption and the interval between eruptions. The data is attached and an excel file is also included on canvas. The data is arranged in duration, interval and height a) Use the paired data for durations and intervals after eruptions of the geyser. Is there significant linear correlation at the 0.05 significance level suggesting interval after an eruption is related to duration (use the r...
A data set is provided, entitled oldfaithful_asst, on the duration and height of the Old Faithful...
A data set is provided, entitled oldfaithful_asst, on the duration and height of the Old Faithful geyser in the Yellowstone National Park. Construct a scatterplot using Excel or any software (SPSS or Minitab) between the variables “duration” and “height.” Please title the graph “Scatterplot 1 Old Faithful” and create labels for both axes. There seems to be a an outlier in the data set. Although an outlier is not a detriment to the data analysis, as part of an exercise,...
The U.S. Geological Survey compiled historical data about Old Faithful Geyser (Yellowstone National Park) from 1870...
The U.S. Geological Survey compiled historical data about Old Faithful Geyser (Yellowstone National Park) from 1870 to 1987. Let x1 be a random variable that represents the time interval (in minutes) between Old Faithful eruptions for the years 1948 to 1952. Based on 9520 observations, the sample mean interval was x1 = 61.2 minutes. Let x2 be a random variable that represents the time interval in minutes between Old Faithful eruptions for the years 1983 to 1987. Based on 25,340...
The U.S. Geological Survey compiled historical data about Old Faithful Geyser (Yellowstone National Park) from 1870...
The U.S. Geological Survey compiled historical data about Old Faithful Geyser (Yellowstone National Park) from 1870 to 1987. Let x1 be a random variable that represents the time interval (in minutes) between Old Faithful eruptions for the years 1948 to 1952. Based on 9280 observations, the sample mean interval was x1 = 62.0 minutes. Let x2 be a random variable that represents the time interval in minutes between Old Faithful eruptions for the years 1983 to 1987. Based on 24,170...
The U.S. Geological Survey compiled historical data about Old Faithful Geyser (Yellowstone National Park) from 1870...
The U.S. Geological Survey compiled historical data about Old Faithful Geyser (Yellowstone National Park) from 1870 to 1987. Let x1 be a random variable that represents the time interval (in minutes) between Old Faithful eruptions for the years 1948 to 1952. Based on 9580 observations, the sample mean interval was x1 = 61.8 minutes. Let x2 be a random variable that represents the time interval in minutes between Old Faithful eruptions for the years 1983 to 1987. Based on 23,000...
This week, we consider how to conduct hypotheses test on one sample data. Discuss the concepts...
This week, we consider how to conduct hypotheses test on one sample data. Discuss the concepts associated with these tests. Consider the following: The difference between a one tail and a two tailed test. The importance of stating the null and alternative hypotheses before conducting the test. The importance of a type one error (p) in conducting the test   The relationship between the p value and our decision to accept or reject the null hypothesis
Companies often develop and test hypotheses about their products. For example, car manufacturers will test their...
Companies often develop and test hypotheses about their products. For example, car manufacturers will test their cars to determine fuel efficiency and miles per gallon. To ensure that products are safe and that they perform as advertised, regulatory and consumer protection groups also test companies’ claims. For this Assignment, you are working at a firm that conducts independent testing for heavy industry. Recently, an automobile manufacturer has been in the news for complaints about the highway gas mileage of their...
8. Here we are going to consider the differences between spectra and chromatograms. A list is...
8. Here we are going to consider the differences between spectra and chromatograms. A list is given below of characteristics and behaviors of UV spectra and liquid chromatography. Some characteristics listed apply only to spectra. Other characteristics apply only to chromatography. And here’s the tricky bit, some characteristics apply to both. From the list below enter each characteristic that is correct for that instrumental plot on the line associated with that instrumental plot. Be careful, think about your choices, misplaced...
Describe the kind of data that are collected for an independent-measures t test and the hypotheses...
Describe the kind of data that are collected for an independent-measures t test and the hypotheses that the test evaluates. The key to helping formulate your explanation would be to include the assumptions of this statistical model, the type of sample used in this model, and a statement about the null hypothesis.
Describe the kind of data that are collected for an independent-measures t test and the hypotheses...
Describe the kind of data that are collected for an independent-measures t test and the hypotheses that the test evaluates. The key to helping formulate your explanation would be to include the assumptions of this statistical model, the type of sample used in this model, and a statement about the null hypothesis.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT