Overview
The SPSS output tables below are all based on a larger study of product assessments for an ecologically friendly engine oil. Participants saw the packaging of an ecologically friendly engine oil on their computer screen and were asked several questions regarding how they perceived this product. The tables below focus on only a few of the variables used in the study.
Coding
Gender was coded as 1=female and 2=male
PWOP_M_S refers to perceived warmth of product. Participants were asked to which extent they perceived the product as “warm.” The assumption is that the perception of the color of the product’s packaging influences this assessment. For this Chapter 11 exercise, I performed a median split on the variable. Thus, the variable is now dichotomous with 1=low perceived warmth and 2=high perceived warmth.
FLUEN_M_S refers to processing fluency. Participants were asked to which extent they perceived the product and its packaging as easy to process, well organized, logical, etc. For this Chapter 11 exercise, I performed a median split on the variable. Thus, the variable is now dichotomous with 1=low fluency and 2=high fluency.
A_PI refers to purchase intentions. This variable is continuous on a scale from 1 to 7. Higher values represent higher purchase intentions.
Note 1: It may be true that you are not fully familiar with the constructs and do not know much about the study context, but you can nevertheless interpret the results provided in a table from a statistical point of view.
Note 2: When responding to the questions, please provide your answers in a way which will encourage the reader to believe that you understood the logic of these statistical tests. For example, it is helpful to point out which numbers in the tables are important, and what the meaning of these numbers is.
Question 11.1
The following two tables are equivalent to Exhibit 11.10 in your textbook (Crosstab Chi-Square example).
Please provide an interpretation of the two tables. What are the insights we can obtain from the SPSS output shown below? What is the meaning of the “count” vs. the “expected count” information in the table? (explain the logic with an example from the table). What is the logic of the Chi Square tests (and specifically, what is the meaning of the numbers shown in the “Asymp. Sig” column?
PWOP_M_S * GENDER Crosstabulation |
|||||
GENDER |
Total |
||||
female |
male |
||||
PWOP_M_S |
1.00 |
Count |
115 |
95 |
210 |
Expected Count |
104.7 |
105.3 |
210.0 |
||
% within PWOP_M_S |
54.8% |
45.2% |
100.0% |
||
% within GENDER |
55.3% |
45.5% |
50.4% |
||
% of Total |
27.6% |
22.8% |
50.4% |
||
2.00 |
Count |
93 |
114 |
207 |
|
Expected Count |
103.3 |
103.7 |
207.0 |
||
% within PWOP_M_S |
44.9% |
55.1% |
100.0% |
||
% within GENDER |
44.7% |
54.5% |
49.6% |
||
% of Total |
22.3% |
27.3% |
49.6% |
||
Total |
Count |
208 |
209 |
417 |
|
Expected Count |
208.0 |
209.0 |
417.0 |
||
% within PWOP_M_S |
49.9% |
50.1% |
100.0% |
||
% within GENDER |
100.0% |
100.0% |
100.0% |
||
% of Total |
49.9% |
50.1% |
100.0% |
Chi-Square Tests |
||||
Value |
df |
Asymptotic Significance (2-sided) |
||
Pearson Chi-Square |
4.033a |
1 |
.045 |
|
Likelihood Ratio |
4.039 |
1 |
.044 |
|
Linear-by-Linear Association |
4.023 |
1 |
.045 |
|
N of Valid Cases |
417 |
|||
a. 0 cells (0.0%) have expected count less than 5. The minimum expected count is 103.25. |
||||
b. Computed only for a 2x2 table |
In: Math
The Cheebles cookie factory changed their recipe. The inspectors took a sample of the new cookies and found that the sample was 42 grams with a standard deviation of 4 grams. the Cheebles CEO specially asked the inspectors to use these statistics to find the lower and upper boundary weighs of 50% of their cookies. What are the Z values of the limits of the limits of the area covering the middle half of the area under the normal curve that the inspectors would use to find this information for the CEO of Cheebles?
In: Math
Suppose that the Gross Domestic Product (GDP) in the US is denoted as Gt. Let the quarterly data of Gt from 1980:1 to 2015:1 is non-stationary but the first difference of Gt, denoted as DGt, is stationary. Assume that a researcher identified the following AR model for DGt:
DGt = Alpha0 + Alpha1DGt-1 + Alpha3DGt-3 + Alpha4DGt-4 + et
Suppose the estimated results of the above AR model are as follows:
DGt = 6.09 + 0.18DGt-1 + 0.12DGt-3 + 0.05DGt-4
(a) Based on what criteria of the coefficients of auto-correlation function (ACF) and the coefficients of partial auto-correlation function (PACF), the researcher identified the above AR model? Explain. Also explain why the researcher chose 1, 3, and 4 lags.
(b) Forecast the GDP for 2015:2, assuming that the GDP in 2015:1, 2014:4, 2014:3, 2014:2, 2014:1, 2013:4, 2013:3, and 2013:2 respectively are: 16264.1, 16294.7, 16205.6, 16010.4, 15831.7, 15916.2,15779.9, and 15606.6.
(c) Explain how to conduct a diagnostic test to check if the researcher has identified the correct AR model. (Hint: Step 3 of Box-Jenkin’s Method)
Please show work.
In: Math
A machine is designed to fill 16-ounce bottles of shampoo. When the machine is working properly, the amount poured into the bottles follows a normal distribution with mean 16.05 ounces with a standard deviation of .1 ounces. If four bottles are randomly selected each hour and the number of ounces in each bottle is measured, then 95% of the means calculated should occur in what interval? Hint: the standard deviation rule says that 95% of the observations are within how many standard deviations away from the mean? Round answers to four decimal places
.
In: Math
1.) Consideration of Causation:
(i) Describe a realistic, but a hypothetical example of a situation in public administration where a novice researcher might incorrectly think that a clear correlation between two variables implies that one caused the other.?
(ii) Then, discuss the correct research-based approach to the analysis of your hypothetical example. In other words, how should the relationship between the two variables be accurately described or approached by a seasoned researcher?
In: Math
2. Earlier in this chapter, we discussed that iron supplements are popular in part because they raise oxygen levels in our bodies, and increased oxygen levels help us feel more energetic. We also said that consuming an iron supplement with a drink high in Vitamin C enhances the effects of the iron supplement. A researcher has a sample of 40 people randomly assigned to two factors: iron supplement and Vitamin C consumption. Specifically, people were randomly assigned either to take an iron supplement or not to take an iron supplement. In addition, people were also randomly assigned either to consume a glass of orange juice (which is high in Vitamin C) or not to consume a glass of orange juice. The researcher uses a pulse oximetry to measure oxygen levels as the dependent variable (to keep the calculations simple, we are using intentionally hypothetical numbers for oxygen levels).The data is presented below. Carry out the analysis of variance using SPSS and determine whether there are any main effects or an interaction. Make a line graph of the results and interpret the pattern of the results.
Iron Supplement |
||
Took Supplement |
Did not take Supplement |
|
Ingested Vitamin C |
9 |
1 |
7 |
3 |
|
8 |
3 |
|
8 |
3 |
|
9 |
2 |
|
8 |
3 |
|
9 |
3 |
|
7 |
2 |
|
6 |
2 |
|
8 |
1 |
|
Did Not Ingest Vitamin C |
4 |
3 |
5 |
2 |
|
5 |
4 |
|
6 |
2 |
|
6 |
2 |
|
7 |
2 |
|
6 |
2 |
|
5 |
3 |
|
5 |
4 |
|
4 |
2 |
In: Math
The time until the light in Bob's office fails is exponentially distributed with mean 2 hours. The time until the computer crashes in Bob's office is exponentially distributed with mean 3 hours. Failure and crash times are independent.
(a) Find the probability that neither the light nor computer fail in the next 2 hours
(b) Find the probability that the computer crashes at least 1 hour after the light fails.
In: Math
8. Adam, Bonnie, Chuck, Dave and Elaine are engineers from different companies attending a professional conference at the University of Arizona in Tucson. There are seven hotels near the campus. Each engineer will stay at a randomly picked hotel. a. What is the probability that they will all stay at the same hotel? b. What is the probability that they will all stay at different hotels? c. Adam has a crush on Bonnie, what is the probability that they will stay at the same hotel? d. What is the probability that exactly two of the five engineers will stay at the same hotel with no one else staying at a same hotel?
In: Math
At a recent halloween party, the women appeared to be consuming more packages of halloween candy than were the men. If the mean number of packages consumed by the 3 men was 4, and that for the 7 women was 6, and the standard deviation for the whole group was 2 packages, what was the correlation between gender and the number of packages consumed?
I had used the the point-biserial correlation coefficient equation and had gotten 0.46. Is this correct? Also does this mean that it is a substantially high correlation or is it quite low?
In: Math
What was the age distribution of nurses in Great Britain at the time of Florence Nightingale? Suppose we have the following information. Note: In 1851 there were 25,466 nurses in Great Britain. Age range (yr) 20-29 30-39 40-49 50-59 60-69 70-79 80+ Midpoint x 24.5 34.5 44.5 54.5 64.5 74.5 84.5 Percent of nurses 5.9% 9.1% 19.1% 29.9% 25.5% 8.8% 1.7% (a) Using the age midpoints x and the percent of nurses, do we have a valid probability distribution? Explain. No. The events are indistinct and the probabilities do not sum to 1. Yes. The events are distinct and the probabilities sum to 1. No. The events are indistinct and the probabilities sum to 1. Yes. The events are distinct and the probabilities do not sum to 1. (b) Use a histogram to graph the probability distribution in part (a). Maple Generated Plot Maple Generated Plot Maple Generated Plot Maple Generated Plot (c) Find the probability that a British nurse selected at random in 1851 would be 60 years of age or older. (Round your answer to three decimal places.) (d) Compute the expected age μ of a British nurse contemporary to Florence Nightingale. (Round your answer to two decimal places.) yr (e) Compute the standard deviation σ for ages of nurses shown in the distribution. (Round your answer to two decimal places.) yr
In: Math
Produce a Pareto diagram that shows total spending in a
descending order and at the same time the cumulative percentage
curve.
Based on the above, can you estimate which part of visitors
accounts for 75% of total revenues in the island.
The data used are:
NATIONALITY | TOT_SPEND |
GERMAN | 1988 |
GERMAN | 1988 |
GERMAN | 1834 |
GERMAN | 1823 |
GERMAN | 1741 |
GERMAN | 1708 |
GERMAN | 1707 |
GERMAN | 1701 |
FRENCH | 1598 |
GERMAN | 1531 |
FRENCH | 1494 |
GERMAN | 1437 |
GERMAN | 1420 |
ITALIAN | 1416 |
GERMAN | 1414 |
GERMAN | 1391 |
GERMAN | 1358 |
ITALIAN | 1350 |
GERMAN | 1298 |
GERMAN | 1296 |
GERMAN | 1295 |
GERMAN | 1295 |
FRENCH | 1280 |
FRENCH | 1245 |
BRITISH | 1217 |
FRENCH | 1204 |
GERMAN | 1184 |
ITALIAN | 1166 |
FRENCH | 1140 |
GERMAN | 1112 |
GERMAN | 1095 |
ITALIAN | 1088 |
ITALIAN | 1079 |
FRENCH | 1057 |
BRITISH | 1046 |
FRENCH | 1041 |
GERMAN | 1026 |
GERMAN | 1013 |
GERMAN | 1011 |
In: Math
1. The provost at the University of Chicago claimed that the entering class this year is larger than the entering class from previous years but their mean SAT score is lower than previous years. He took a sample of 20 of this year’s entering students and found that their mean SAT score is 1,501 with a standard deviation of 53. The University’s record indicates that the mean SAT score for entering students from previous years is 1,520. He wants to find out if his claim is supported by the evidence at a 5% level of significance. Round final answers to two decimal places. Solutions only.
(A) The parameter the president is interested in is:
(a) the mean number of entering students to his university this
year.
(b) the mean number of entering students to all U.S. universities
this year.
(c) the mean SAT score of the entering students to his university
this year.
(d) the mean SAT score of the entering students to all U.S.
universities this year.
(e) None of the above.
(B) The population the president is interested in is:
(a) all entering students to all universities in the U.S this
year.
(b) all entering students to his university this year.
(c) all SAT test centers in the U.S. this year.
(d) the SAT scores of all students entering universities in the
U.S. this year.
(e) None of the above.
(F) True, False, or Uncertain: The null hypothesis would be rejected.
(G) True, False, or Uncertain: The null hypothesis would be rejected if a 10% probability of committing a Type I error is allowed.
(I) True, False, or Uncertain: The evidence proves beyond a doubt that the mean SAT score of the entering class this year is lower than previous years.
(J) True, False, or Uncertain: If these data were used to perform a two-tail test, the p-value would be 0.1254.
In: Math
1.A group of psychologists is interested in determining if private practice doctors and hospital doctors have the same distribution of working hours. They survey 150 private practice doctors and 150 hospital doctors (selected at random) and asked about the number of hours per week they worked Determine whether there is a difference in hours worked per week for private practice and hospital doctors.
20-30 |
30-40 |
40-50 |
50-60 |
|
Private Practice |
62 |
44 |
38 |
6 |
Hospital |
22 |
44 |
59 |
25 |
H0:
HA:
In: Math
Suppose the length of time a person takes to use an ATM at the bank is normally distributed with mean of 110 seconds and standard deviation of 10 seconds. There are 4 people ahead of you in the queue waiting to use the machine. You are concerned about the total time (T) the 4 people ahead of you will take to use the machine.
(i) What is the mean value of T, the total time (in seconds) for the 4 people ahead of you to use the machine?
(ii) Assuming that the times for the 4 people are independent of each other, determine the standard deviation of T. (remember 20 x 20 = 400) (iii) Sketch the distribution for T, clearly labelling the important features on your sketch.
(iv) Use your sketch and the Empirical rule (0.68 within 1 standard deviation of the mean, 0.95 within 2 standard deviations of the mean, 0.997 within 3 standard deviations of the mean in a normal population) to find the probability that the total time T is less than 400 seconds.
In: Math