17) On this worksheet, make an XY scatter plot linked to the following data:
X | Y |
92 | 22 |
87 | 23 |
102 | 23 |
80 | 25 |
91 | 27 |
100 | 20 |
95 | 21 |
109 | 19 |
77 | 28 |
100 | 221 |
98 | 25 |
89 | 27 |
97 | 23 |
93 | 22 |
89 | 27 |
91 | 22 |
97 | 21 |
105 | 21 |
88 | 22 |
83 | 24 |
86 | 27 |
89 | 26 |
79 | 30 |
88 | 22 |
94 | 24 |
18) Add trendline, regression equation and r squared to the plot.
Add this title. ("Scatterplot of X and Y Data")
19)
The scatterplot reveals a point outside the point pattern. Copy the data to a new location in the worksheet. You now have 2 sets of data. | ||||||||||
Data that are more tha 1.5 IQR below Q1 or more than 1.5 IQR above Q3 are considered outliers and must be investigated. | ||||||||||
It was determined that the outlying point resulted from data entry error. Remove the outlier in the copy of the data. | ||||||||||
Make a new scatterplot linked to the cleaned data without the outlier, and add title ("Scatterplot without Outlier,") trendline, and regression equation label. | ||||||||||
In: Statistics and Probability
A manufacturer of automobile batteries claim that at least 80%
of the batteries
that it produces will last 36 months. A consumers’ advocate group
wants to evaluate this
longevity claim and selects a random sample of 28 batteries to
test. The following data indicate
the length of time (in months) that each of these batteries lasted
(i.e., performed properly
before failure): 42.3, 39.6, 25.0, 56.2, 37.2, 47.4, 57.5, 39.3,
39.2, 47.0, 47.4, 39.7, 57.3, 51.8,
31.6, 45.1, 40.8, 42.4, 38.9, 42.9, 34.1, 49.0, 41.5, 60.1, 34.6,
50.4, 30.7, 44.1. Now, we would
like to test, at a significance level of 0.05, if there is a
significant evidence that less than 80%
of the batteries will last at least 36 months? Conduct and conclude
the test.
In: Statistics and Probability
I am very confused about how to find the critical value of the test statistic.
I have found the test statistic of 5.636, with 1% significance level and two degrees of freedom. How do I calculate the critical value?
Geneticists examined the distribution of seed coat color in
cultivated amaranth grains, Amaranthus caudatus. Crossing
black-seeded and pale-seeded A. caudatus populations gave the
following counts of black, brown, and pale seeds in the second
generation.
According to genetics laws, dominant epistasis should lead to
|
In: Statistics and Probability
Urban traffic congestion throughout the world has been increasing in recent years, especially in developing countries. The accompanying table shows the number of minutes that randomly selected drivers spend stuck in traffic in various cities on both weekdays and weekends. Complete parts a through e below.
City_A | City_B | City_C | City_D | |
Weekday | 90 | 42 | 55 | 54 |
79 | 110 | 78 | 68 | |
132 | 62 | 78 | 42 | |
72 | 77 | 96 | 48 | |
97 | 95 | 122 | 53 | |
Weekend | 79 | 83 | 33 | 34 |
91 | 24 | 85 | 44 | |
71 | 106 | 74 | 47 | |
72 | 76 | 62 | 43 | |
63 | 79 | 36 | 43 |
a) Using alpha = 0.05, is there significant interaction between the city and time of the week?
Identify the hypotheses for the interaction between the city and time of the week. Choose the correct answer below.
A. H0: City and time of the week do not interact, H1: City and time of the week do interact
B. H0: μCity ≠ μTime, H1: City=μTime
C. H0: μCity=μTime, H1: μCity≠μTime
D. H0: City and time of the week do interact, H1: City and time of the week do not interact
Find the p-value for the interaction between city and time of the week.
p-value=????
(Round to three decimal places as needed.)
Draw the appropriate conclusion for the interaction between the city and time of the week. Choose the correct answer below.
A. Do not reject the null hypothesis. There is insufficient evidence to conclude that the city and time of the week interact.
B. Do not reject the null hypothesis. There is insufficient evidence to conclude that the means differ.
C. Reject the null hypothesis. There is insufficient evidence to conclude that the means differ.
D. Reject the null hypothesis. There is sufficientevidence to conclude that the city and time of the week interact.
b) Using two-way ANOVA and α=0.05,does the city have an effect on the amount of time stuck in traffic?
Identify the hypotheses to test for the effect of the city. Choose the correct answer below.
A. H0: μCity A=μCity B=μCity C=μCity D, H1: Not all city means are equal
B. H0: μCity=μTime, H1: μCity≠μTime
C. H0: μCity A≠μCity B≠μCity C≠μCity D, H1: μCity A=μCity B=μCity C=μCity D
D. H0: μCity A=μCity B=μCity C=μCity D, H1: μCity A>μCity B>μCity C>μCity D
Find the p-value for the effect of the city.
p-value=???
(Round to three decimal places as needed.)
Draw the appropriate conclusion for the effect of the city. Choose the correct answer below.
A. Reject the null hypothesis. There is sufficient evidence to conclude that not all city means are equal.
B. Do not rejectthe null hypothesis. There is sufficient evidence to conclude that the means differ.
C. Do not rejectthe null hypothesis. There is insufficient evidence to conclude that not all city means are equal.
D. It is inappropriate to analyze because the city and the time of the week interact.
c) Using two-way ANOVA and α=0.05, does the time of the week have an effect on the amount of time stuck in traffic?
Identify the hypotheses to test for the effect of the time of the week. Choose the correct answer below.
A.H0: μWeekday=μWeekend, H1: Not all time of the week means are equal
B. H0: μCity=μTime, H1:μCity≠μTime
C. H0: μCity A=μCity B=μCity C=μCity D, H1: Not all time of the week means are equal
D. H0: μWeekday≠μWeekend, H1:μWeekday=μWeekend
Find the p-value for the effect of the time of the week.
p-value=???
(Round to three decimal places as needed.)
Draw the appropriate conclusion for the effect of the time of the week. Choose the correct answer below.
A. Do not reject the null hypothesis. There is insufficient evidence to conclude that not all time of the week means are equal.
B. Do not reject the null hypothesis. There is sufficient evidence to conclude that the means differ.
C. Reject the null hypothesis. There is sufficient evidence to conclude that not all time of the week means are equal.
D. It is inappropriate to analyze because the city and the time of the week interact.
d) Are the means for weekdays and weekends significantlydifferent?
A. Yes,because there is insufficient evidence to conclude that not all time of the week means are equal.
B. No, because there is insufficient evidence to conclude that not all time of the week means are equal.
C.Yes, because there is sufficient evidence to conclude that not all time of the week means are equal.
D. The comparison is unwarranted because the city and the time of the week interact.
In: Statistics and Probability
Blood cocaine concentration (mg/L) was determined both for a sample of individuals who had died from cocaine-induced excited delirium (ED) and for a sample of those who had died from a cocaine overdose without excited delirium; survival time for people in both groups was at most 6 hours. The accompanying data was read from a comparative boxplot in an article.
ED | 0 | 0 | 0 | 0 | 0.1 | 0.1 | 0.1 | 0.1 | 0.2 |
---|---|---|---|---|---|---|---|---|---|
0.2 | 0.3 | 0.3 | 0.3 | 0.4 | 0.5 | 0.7 | 0.8 | 1.0 | |
1.2 | 2.7 | 2.8 | 3.2 | 4.0 | 8.8 | 9.3 | 11.7 | 22.0 | |
Non-ED | 0 | 0 | 0 | 0 | 0 | 0.1 | 0.1 | 0.1 | 0.1 |
0.2 | 0.2 | 0.2 | 0.3 | 0.3 | 0.3 | 0.4 | 0.5 | 0.5 | |
0.6 | 0.8 | 0.9 | 1.0 | 1.2 | 1.4 | 1.5 | 1.7 | 2.0 | |
3.2 | 3.5 | 4.1 | 4.3 | 4.8 | 5.0 | 5.6 | 5.9 | 6.0 | |
6.4 | 7.5 | 8.3 | 8.6 | 9.2 | 9.6 | 9.9 | 11.0 | 11.5 | |
12.2 | 12.7 | 14.0 | 16.6 | 17.8 |
(a)
Determine the median, fourths, and fourth spread for the ED sample.
medianlower fourthupper fourthfourth spread
Determine the median, fourths, and fourth spread for the Non-ED sample.
medianlower fourthupper fourthfourth spread
In: Statistics and Probability
Simple regression was employed to establish the effects of
childhood exposure to lead. The effective sample size was about 122
subjects. The independent variable was the level of dentin lead
(parts per million). Below are regressions using various dependent
variables.
(a) Calculate the t statistic for each
slope, at significance level = 0.01. (Negative values
should be indicated by a minus sign. Round your answers to 2
decimal places.)
Dependent Variable | R2 | Estimated Slope | Std. Error | tcalculated | p-value | Differ from 0? | ||||||||
Highest grade achieved | .061 | −0.027 | 0.009 | .008 | (Click to select) Yes No | |||||||||
Reading grade equivalent | .121 | −0.070 | 0.018 | .000 | (Click to select) No Yes | |||||||||
Class standing | .039 | −0.006 | 0.003 | .048 | (Click to select) No Yes | |||||||||
Absence from school | .071 | 4.8 | 1.7 | .006 | (Click to select) Yes No | |||||||||
Grammatical reasoning | .051 | 0.159 | 0.062 | .012 | (Click to select) Yes No | |||||||||
Vocabulary | .108 | −0.124 | 0.032 | .000 | (Click to select) No Yes | |||||||||
Hand-eye coordination | .043 | 0.041 | 0.018 | .020 | (Click to select) Yes No | |||||||||
Reaction time | .025 | 11.8 | 6.66 | .080 | (Click to select) Yes No | |||||||||
Minor antisocial behavior | .025 | −0.639 | 0.36 | .082 | (Click to select) Yes No |
In: Statistics and Probability
Activity 3: Which procedure?
For each question, determine which inference procedure is
appropriate (perform hypothesis test or construct a confidence
interval), and identify the parameter of interest (p, p1 – p2, µ,
µ1 – µ2, µd).
What percentage of college students engage in underage drinking in
their freshman year?
Test or Interval Parameter
What is the average change in a person’s heart rate when comparing
measurements from before and after a scary scene in a horror
film?
Test or Interval Parameter
What is the average number of siblings of all Penn State
students?
Test or Interval Parameter
Is there a difference in the percent of college freshman and
college sophomores who engage in underage drinking?
Test or Interval Parameter
On average, are college graduates exiting school with a GPA above 3.0?
Test or Interval Parameter
Is the percent of sophomores living on campus at Penn State
different than 30%?
Test or Interval Parameter
Is there a difference in the percentage of season games of their favorite sport a fan attends based on if their favorite sport is baseball or basketball?
Test or Interval Parameter
In: Statistics and Probability
write three benefits from doing the principle component analysis?
In: Statistics and Probability
You have prepared 10 types of treats for your 5 cats. You don’t know which treat each of your cats will
go for, so you have bought for each type enough treats for all your cats. Assume that each cat is equally
likely to choose any type of treats, and let X be the number of pairs of cats that will choose the same
type of treats. Compute E(X) and Var(X).
(Hint: consider events Ai,j that the ith and jth cats will choose the same type of treats.)
In: Statistics and Probability
Diet A Diet B Diet C Total
3 10 8
6 12 3
7 11 2
4 14 5
8
6
the critical value = 3.98
Conclusion:
Analysis of Variance Summary Table
P-value
Source Sum of Squares D.F. Mean Square F ratio p-value
Between groups
Within
SSB SSW
MSB = SSBk-1 MSW=SSWN-k
F = MSBMSW
ANOVA 1 TUKEY B
ID N
____________________________
3 4 4.50
____________________________
1 4 5.00
____________________________
2 6 10.17
_____________________________
Sig. .954 1.00
In: Statistics and Probability
1. A regular type of laminate is currently being used by a manufacturer of circuit boards. A special laminate has been developed to reduce warpage. The regular laminate will be used on one sample of circuit boards and the special laminate on another independent sample of circuit boards. The amount of warpage will then be determined for each circuit board. The manufacturer will then switch to the special laminate only if it can be demonstrated that the true average amount of warpage for that laminate is less than for the regular laminate. State the relevant hypotheses, and describe the type I and type II errors in the context of this scenario.
2. The measured residual flame time, in seconds, for strips of treated children’s nightwear are given in the following table.
9.85 |
9.93 |
9.75 |
9.77 |
9.67 |
9.87 |
9.67 |
9.94 |
9.85 |
9.75 |
9.83 |
9.92 |
9.74 |
9.99 |
9.88 |
9.95 |
9.95 |
9.92 |
9.93 |
9.89 |
Suppose a true average flame time of at most 9.75 seconds had been mandated. Does the data suggest that this condition has not been met? Carry out an appropriate test after first investigating the plausibility of assumptions that underlie your method of inference. Use the recommended sequence of steps and reach a conclusion using a significance level of 0.01:
a. Identify the parameter of interest and describe it in the context of the problem situation:
b. Determine the null value and state the null hypothesis:
c. State the appropriate alternative hypothesis:
d. Give the formula for the computed value of the test statistic (substituting the null value and the known values of any other parameters, but not those of any sample-based quantities):
e. Compute any necessary sample quantities, substitute into the formula for the test statistic value and compute that value:
f. Determine the P-value:
g. Compare the selected or specified significance level to the P-value to decide whether H0 should be rejected, and state this conclusion in the problem context:
In: Statistics and Probability
1) Of all the trees planted by a landscaping firm, 10% survive.
What is the probability that 7 or more of the 9 trees they just
planted will survive? (Use a table of binomial probabilities. Give
your answer correct to four decimal places.)
2)
Find the following probabilities for X = pulse rates of group of people, for which the mean is 78 and the standard deviation is 4. Assume a normal distribution. (Round all answers to four decimal places.)
(a) P(X ≤ 75).
(b) P(X ≥ 84).
(c) P(68 ≤ X ≤ 86).
In: Statistics and Probability
1. Suppose that you are interested in determining whether the advice given by a physician during a routine physical examination is effective in encouraging patients to stop smoking. In a study of current smokers, one group of patients was given a brief talk about the hazards of smoking and was encouraged to quit. A second group received no talk related to smoking. All patients were given a follow-up exam. In the sample of 150 patients who had received the talk, 23 reported that they had quit smoking. In the sample of 90 patients that hadn’t received a talk 8 had ceased smoking.
a. Create a 2x2 table based on the data provided in the question.
b. Use Chi-square test to determine whether there is an association between receiving talks and quitting smoking. Choose α = 0.05.
In: Statistics and Probability
A sample of 1100 computer chips revealed that 58% of the chips fail in the first 1000 hours of their use. The company's promotional literature claimed that less than 61% fail in the first 1000 hours of their use. Is there sufficient evidence at the 0.01 level to support the company's claim?
State the null and alternative hypotheses for the above scenario.
In: Statistics and Probability
In a study, 1319 schoolchildren at the age of 12 were random selected to be questioned on the prevalence of symptoms of severe cold. Another 1319 schoolchildren at age 14 were random selected to be questioned on the prevalence of symptoms of severe cold. For schoolchildren of age 12, 356 children were reported to have severe colds in the past 12 months, for schoolchildren of age 14, 468 were reported to have severe colds in the past 12 months. Is there an association between the prevalence of severe cold and the age of 12 and 14?
Please conduct a hypothesis testing using the Chi-square test to justify your conclusion.
In: Statistics and Probability