For university, I have to write a Research paper and i want to examine whether there is a relationship (correlation)between hours worked in a Country per week and the Happiness Index points as the dependent variable. However, it has to be a mutiple linear regression model but I struggle with finding more independent variables. Can somebody help me here?
In: Statistics and Probability
Discuss a real life problem that is solved with the Sufficiency Principle of data reduction by elaborating on the following:.
1. State the real-life problem.
2. Formulate the problem by giving a distribution for a sample in a real-life situation.
3. Find a sufficient statistic based on the sample.
4. Discuss how the statistic is used as a data reduction device in the rea life problem.
In: Statistics and Probability
TWO MEANS – INDEPENDENT SAMPLES
Choose a variable from the advising.sav data set to compare group means. While the choice of which variable to test is up to you, you must remember that it must be a metric variable. The grouping variable, which is used to define the two groups to be compared, must be categorical. You can look in the “Measure” column of the “Variable View” in the data file for help in determining which is which. The managerial question is whether or not there is a significant difference between the groups for the metric variable you have chosen.
Once you have the results, report your findings using the five step hypothesis testing procedure outlined in class. (See below.) For Step 4, simply cut and paste the SPSS output into the report. This can be done by clicking on the desired portion of the output which will then be highlighted, and then right clicking on the highlighted portion and copying it to your flash drive. (Note that you may want to drop the results into a word document immediately since if you do not have SPSS on your personal laptop, you will not be able to open any SPSS output.) Then state the answer to the managerial question that was initially posed. For example, is there a significant difference between the two groups defined by the grouping variable (which you must identify in your report) for the metric variable tested? Also, interpret the confidence interval provided for the test. Does it indicate a significant difference or not?
PAIRED SAMPLE T-TEST
Choose a pair of metric variables and run a paired sample t-test on the pair. Again, these must be metric variables. The managerial question will be “Is there a significant difference between the two variables?” for the pair. Report your findings using the same procedure described above, including an interpretation of the confidence interval.
REPORT(SAMPLE)
Your report will consist of two hypotheses tests, (one for the independent sample test and one for the paired sample test). It will look something like this (for the independent sample test):
1: H0: μ1= μ2
Ha: μ1 ≠ μ2
2: Two group independent sample t-test (note that SPSS does everything as a t-test regardless of sample size).
3: α=.05 → tcrit = ±whatever the appropriate value is
4
Group Statistics |
|||||
status |
N |
Mean |
Std. Deviation |
Std. Error Mean |
|
dotest |
0 |
185 |
1494.071 |
2249.4948 |
165.3861 |
1 |
50 |
803.280 |
1080.0304 |
152.7394 |
Independent Samples Test |
||||||||||
Levene's Test for Equality of Variances |
t-test for Equality of Means |
|||||||||
F |
Sig. |
t |
df |
Sig. (2-tailed) |
Mean Difference |
Std. Error Difference |
95% Confidence Interval of the Difference |
|||
Lower |
Upper |
|||||||||
dotest |
Equal variances assumed |
13.465 |
.000 |
2.104 |
233 |
.036 |
690.7914 |
328.2585 |
44.0572 |
1337.5255 |
Equal variances not assumed |
3.068 |
169.287 |
.003 |
690.7914 |
225.1264 |
246.3747 |
1135.2080 |
5: Make a decision regarding the null hypothesis and interpret the confidence interval.
6: Answer the managerial question.
RESULTS AFTER RUNNING (DATA )
INDEPENDENT
Group Statistics |
|||||
Gender |
N |
Mean |
Std. Deviation |
Std. Error Mean |
|
OverallSatisfaction |
Female |
131 |
4.97 |
1.771 |
.155 |
Male |
145 |
4.99 |
1.488 |
.124 |
Independent Samples Test |
||||||||||
Levene's Test for Equality of Variances |
t-test for Equality of Means |
|||||||||
F |
Sig. |
t |
df |
Sig. (2-tailed) |
Mean Difference |
Std. Error Difference |
95% Confidence Interval of the Difference |
|||
Lower |
Upper |
|||||||||
OverallSatisfaction |
Equal variances assumed |
5.905 |
.016 |
-.120 |
274 |
.904 |
-.024 |
.196 |
-.410 |
.363 |
Equal variances not assumed |
-.119 |
255.054 |
.905 |
-.024 |
.198 |
-.414 |
.366 |
PAIRED
Paired Samples Statistics |
|||||
Mean |
N |
Std. Deviation |
Std. Error Mean |
||
Pair 1 |
DesiredConvenience |
6.20 |
273 |
1.175 |
.071 |
ActualConvenience |
4.55 |
273 |
1.636 |
.099 |
Paired Samples Correlations |
||||
N |
Correlation |
Sig. |
||
Pair 1 |
DesiredConvenience & ActualConvenience |
273 |
.213 |
.000 |
Paired Samples Test |
|||||||||
Paired Differences |
t |
df |
Sig. (2-tailed) |
||||||
Mean |
Std. Deviation |
Std. Error Mean |
95% Confidence Interval of the Difference |
||||||
Lower |
Upper |
||||||||
Pair 1 |
DesiredConvenience - ActualConvenience |
1.648 |
1.799 |
.109 |
1.434 |
1.863 |
15.140 |
272 |
.000 |
PLEASE ANSWER INDEPENDENT AND PAIRED PARTS REPORT INDEPENDENTLY AND FOLLOW THE SAMPLE REPORT'S FORMAT TO ANSWER THE 6 QUESTIONS
PLEASE USE TWO REPORTS FOR INDEPENDENT AND PAIRED PARTS
In: Statistics and Probability
The scores on a standardized math test for 8th grade children form a normal distribution with a mean of 80 and a standard deviation of 12. (8 points)
a) What is the probability of obtaining a sample mean greater than 82 for a sample of n = 36?
b) What is the probability of obtaining a sample mean less than 78 for a sample of n = 9?
In: Statistics and Probability
The data in the table below (and available online) represent monthly gross sales for Ska Brewing Company from 2009 to 2012. This is real data, let’s help Ska forecast 2013…
Year |
January |
February |
March |
April |
May |
June |
July |
August |
September |
October |
November |
December |
Total |
9 9 |
$193,481 |
$170,674 |
$228,095 |
$232,372 |
$288,188 |
$304,763 |
$337,825 |
$342,121 |
$320,011 |
$304,756 |
$221,514 |
$235,591 |
|
10 |
$267,782 |
$225,592 |
$356,604 |
$274,723 |
$377,369 |
$439,907 |
$430,999 |
$485,822 |
$407,577 |
$450,234 |
$315,238 |
$345,135 |
|
11 |
$319,313 |
$323,726 |
$342,353 |
$361,315 |
$612,500 |
$564,599 |
$518,422 |
$623,860 |
$412,091 |
$530,636 |
$313,034 |
$395,686 |
|
12 |
$375,117 |
$391,677 |
$426,746 |
$535,876 |
$659,204 |
$582,670 |
$663,534 |
$564,901 |
$636,399 |
$727,822 |
$539,011 |
$450,188 |
|
13 |
|||||||||||||
Slope |
a)Sum up the totals for each year and fill in the blanks above.
b)Forecast demand for each month and the total for 2013 using linear regression.
c)Graph the data, including the forecasts from part b)in chronological orderto display the seasonal pattern. Printthis out and attach it.
d)Determine the slope for each month and fill in the blanks above. According to the slopes, which twomonths are growing the fastest (at about the same rate)?
e) Interpret i) the slope for the two fastest growing month(s) and ii) the annual slope.
Predict the twelve months of 2013 using all data at once with the new =forecast.ets() function in Excel.
In: Statistics and Probability
43% of adults say cashews are their favorite kind of nut. You randomly select 12 adults and ask each to name his or her favorite nut. Find the probability that the number who say cashews are their favorite nut is (a) exactly three, (b) at least four, and (c) at most two. If convenient, use technology to find the probabilities.
P(3)=
In: Statistics and Probability
Claim: there is no difference between the mean daily driving time for the popuation of 16-20 year old men and the mean daily driving time for the population of men 60 years and older. In a study of times that people drive each day, it was found that 50 men aged 16-20 years drove a daily average of 49.3 minutes with a standard deviation of 21.4 minutes, and 55 men aged 60 years and older drove a daily average of 57.1 minutes with a standard deviation of 35.6 minutes.Test the claim at the 0.05 significance level
In: Statistics and Probability
There are 20 total socks, 10 white and 10 black. This makes 10 total matching pairs of 5 pair of white and 5 pair of black.
6. What is the total probability of picking a white sock and then another white sock (one pair of white socks)?
7. What is the probability of picking either a pair of white socks or a pair of black socks?
8. If each time you pick a sock from the drawer a sock just like it magically replaces it, what is the probability of picking either a pair of white socks or a pair of black socks?
9. How can you guarantee success of picking a matching pair? In other words, what is the minimum number of socks needing to be picked to guarantee a matching pair? (Hint: There is a right answer to this question!)
10. Explain dependent and independent trials and then further describe the difference between Question 7 and Question 8 as it relates to dependent and independent trials.
In: Statistics and Probability
Discuss the three measures of central tendency. Give an example for each that applies to the measure to a Radar / Speed Enforcement and discusses how the measure is used. What are the advantages and disadvantages for each of the three measures? How do outliers affect each of these three measures? What are some options for handling outliers?
In: Statistics and Probability
Two coins are tossed. A die is thrown for each coin that falls heads. What is the expected number of spots shown on the dice?
In: Statistics and Probability
Please find the mean, mode, medium as applicable. This is data collected based off an observational study I conducted. The data was collected to see if male or females actively or passively interacted with the the barista while placing their order. This first table is the study of both locations compressed in to one table. The second table is data that is specific to the location. Please include/ find P value and stats!
Gender | Observed | Active | Passive |
Male | 26 | 18 | 8 |
Female | 21 | 15 | 6 |
Total | 47 | 33 | 14 |
Location | Observed | Active | Passive |
Panera (PM) | 27 | 21 | 6 |
Dunkin (AM) | 20 | 10 | 10 |
Total | 47 | 21 | 16 |
In: Statistics and Probability
10. Ms. McNicholas wants to see if there is any difference in the Final Exam scores of her two Statistics classes. Class I 81 73 86 90 75 80 75 80 75 81 85 87 83 75 70 65 80 76 64 74 86 80 83 67 82 78 76 83 71 90 77 81 82 Class II 87 77 66 75 78 82 82 71 79 91 97 89 92 75 89 75 95 84 75 82 74 77 87 69 96 65 a) Find the five-number summary for each class. b) Construct a boxplot for each class. c) Determine the range for outliers on each graph. d) Is there a difference in the performance of the two classes?
In: Statistics and Probability
b) Interpret the coefficients in the estimated regression model of Sales on Csales. c) Before estimating the model, the manager claimed that for every 1 million increase in Csales, Sales go down by 2 million. Is there evidence from the estimated model that she was not correct? Answer by constructing an appropriate 95% confidence interval. d) What is the correlation between Sales and Csales?
Region |
Sales |
Advertising |
Promotions |
Csales |
||
Selkirk |
101.8 |
1.3 |
0.2 |
20.4 |
Csales=main competitor's sales |
|
Susquehanna |
44.4 |
0.7 |
0.2 |
30.5 |
Sales=sales of company's Nature -Bar |
|
Kittery |
108.3 |
1.4 |
0.3 |
24.6 |
||
Acton |
85.1 |
0.5 |
0.4 |
19.6 |
||
Finger Lakes |
77.1 |
0.5 |
0.6 |
25.5 |
||
Berkshires |
158.7 |
1.9 |
0.4 |
21.7 |
||
Central |
180.4 |
1.2 |
1 |
6.8 |
all variables are in millions of dollars |
|
Providence |
64.2 |
0.4 |
0.4 |
12.6 |
||
Nashua |
74.6 |
0.6 |
0.5 |
31.3 |
||
Dunster |
143.4 |
1.3 |
0.6 |
18.6 |
||
Endicott |
120.6 |
1.6 |
0.8 |
19.9 |
||
Five-Towns |
69.7 |
1 |
0.3 |
25.6 |
||
Waldeboro |
67.8 |
0.8 |
0.2 |
27.4 |
||
Jackson |
106.7 |
0.6 |
0.5 |
24.3 |
||
Stowe |
119.6 |
1.1 |
0.3 |
13.7 |
In: Statistics and Probability
A marketing manager for Country Kitchen Corporation (CKC), which sells snack food product "Nature -Bar" in dif- ferent regions, was interested in how its sales (denoted Sales) are influenced by the sales of its main competitor (denoted Csales.) She gathered last year data on Sales and Csales for a 15 randomly selected sales regions and ob- tained the regression model of Sales on Csales. Both Sales and Csales are measured in millions of dollars. To answer questions below use your own Minitab output.
The data are in Excel file Nature-Bar.
a) Is the regression of Sales on Csales statistically significant at a= 0.02? (State the hypothesis test, rejection rule and your conclusion)
Region |
Sales |
Advertising |
Promotions |
Csales |
||
Selkirk |
101.8 |
1.3 |
0.2 |
20.4 |
Csales=main competitor's sales |
|
Susquehanna |
44.4 |
0.7 |
0.2 |
30.5 |
Sales=sales of company's Nature -Bar |
|
Kittery |
108.3 |
1.4 |
0.3 |
24.6 |
||
Acton |
85.1 |
0.5 |
0.4 |
19.6 |
||
Finger Lakes |
77.1 |
0.5 |
0.6 |
25.5 |
||
Berkshires |
158.7 |
1.9 |
0.4 |
21.7 |
||
Central |
180.4 |
1.2 |
1 |
6.8 |
all variables are in millions of dollars |
|
Providence |
64.2 |
0.4 |
0.4 |
12.6 |
||
Nashua |
74.6 |
0.6 |
0.5 |
31.3 |
||
Dunster |
143.4 |
1.3 |
0.6 |
18.6 |
||
Endicott |
120.6 |
1.6 |
0.8 |
19.9 |
||
Five-Towns |
69.7 |
1 |
0.3 |
25.6 |
||
Waldeboro |
67.8 |
0.8 |
0.2 |
27.4 |
||
Jackson |
106.7 |
0.6 |
0.5 |
24.3 |
||
Stowe |
119.6 |
1.1 |
0.3 |
13.7 |
In: Statistics and Probability
8. Rollflex and Morgan Trust are two stocks traded on the New York Stock Exchange. For the past ten weeks, you recorded the Friday closing price (dollars per share): Rollflex: 21 20 20 23 24 20 25 23 28 21 Morgan Trust: 53 51 50 50 50 55 54 52 50 57 a) Compute the mean, median, mode, standard deviation, and variance for each stock. b) Create a boxplot for each stock. c) Which do you consider a better investment and why?
In: Statistics and Probability