In: Statistics and Probability
Question 5
Judging independence of two categorical datas.
Use the information below to perform a statistical test on whether a customer would recommend the bank is dependent on the gender of the customer. (Hint: use the chi square test for association)
1. There are 152 customers. Of which 77/152 are female and 75/152 are male
2. Of the female customers, 52/77 recommended the bank whilst 25/77 did not recommend the bank.
3. Of the male customers. 45/75 recommended the bank whilst 30/75 did not recommend the bank.
Is the result of the bank recommendation dependent on gender?
In: Statistics and Probability
Important Note
For the remaining items you are to create an APA styled results
presentation. This will include a table of results, and written
inference and interpretation. If you are unclear about APA styled
tables, review carefully the examples provided in the linked Word
document on the Course Index. Each item is worth 10 points (unless
specified otherwise) and will be graded according to the following
rubric: Calculation Numeric Results 5 points, Written Presentation
2 points, and APA style 3 point.
6. Does climate predict suicide rates across the USA? According
to Dixon and Kalkstein (2009) climate may be a predictor of suicide
incidents. To test this possible link, data were collected on
suicide rate per 100,000 individuals in each state and several
indicators of climate. More specifically, the data include the
following:
• Suicide rate per 100,000 individuals (i.e., 15 = 15 suicides out
of 100,000 individuals)
• Average percentage of time the sun shines during the day (i.e.,
58 = sun shines 58% of time during day on average)
• Average number of clear days (i.e., 99 = clear skies 99 out of
365 days a year)
• Average afternoon humidity (i.e., 52 = humidity is 52% on average
at 4pm in the afternoon)
• Average temperature in Fahrenheit (i.e., 62.8 = average
temperature throughout year is 62.8)
The data can be downloaded as either an SPSS file or Excel file.
The variable names in each file are
• Suicide_Rate = Suicide rate per 100,000 individuals
• Percent_sun_days = Average percentage of time the sun shines
during the day
• Number_clear_days = Average number of clear days
• Afternoon_humidity = Average afternoon humidity
• Average_Temp_F = Average temperature in Fahrenheit
SPSS data file:
http://www.bwgriffin.com/gsu/courses/edur8131/data/test3_suicide_data.sav
Excel data file:
http://www.bwgriffin.com/gsu/courses/edur8131/data/test3_suicide_data.xlsx
Is there any evidence that these four climate indicators
statistically predict suicide rates? Use Regression for this
analysis. (10 points)
Data sources for this item:
• Suicide data (2005 data):
http://www.suicide.org/suicide-statistics.html#death-rates
• Dixon and Kalkstein (2009):
http://geosciences.msstate.edu/faculty/dixon/reprints/2009compass.pdf
• Average humidity (14 June 2013):
http://www.currentresults.com/Weather/US/annual-average-humidity-by-state.php
• Number of sunny days (14 June 2013):
http://www.currentresults.com/Weather/US/average-annual-state-sunshine.php
In: Statistics and Probability
Question 10
The data set below contains total Medicare enrollment data for the state of California for the years 2013 - 2017. Estimate a linear trend regression model and forecast Medicare enrollment in California for the year 2020.
Year | Enrollment |
2013 | 5,300,177 |
2014 | 5,481,592 |
2015 | 5,653,896 |
2016 | 5,814,275 |
2017 | 5,965,489 |
Please round intermediate results to one decimal and your final answer to the closest integer.
(To facilitate grading, please enter only the number without commas separating thousands and millions. For example, if your final answer is 1 million, enter 1000000.)
In: Statistics and Probability
Company | Segment | 2012 U.S. Average Sales per Store ($1,000s) |
---|---|---|
Arby's | Sandwich | 993 |
Baskin-Robbins | Snacks | 207 |
Bojangles' | Chicken | 1,718 |
Boston Market | Chicken | 1,184 |
Burger King | Burger | 1,195 |
Carl's Jr. | Burger | 1,470 |
Checkers/Rally's | Burger | 903 |
Chick-fil-A | Chicken | 3,158 |
Chipotle Mexican Grill |
Mexican | 2,113 |
Church's Chicken | Chicken | 707 |
CiCi's Pizza | Pizza/Pasta | 915 |
Culver's | Burger | 1,838 |
Dairy Queen | Burger | 545 |
Del Taco | Mexican | 1,100 |
Domino's Pizza | Pizza/Pasta | 710 |
Dunkin' Donuts | Snacks | 857 |
El Pollo Loco | Chicken | 1,475 |
Five Guys Burgers & Fries |
Burger | 1,049 |
Hardee's | Burger | 1,145 |
In-N-Out Burger | Burger | 1,935 |
Jack in the Box | Burger | 1,379 |
Jamba Juice | Snacks | 715 |
Jason's Deli | Sandwich | 2,556 |
Jimmy John's | Sandwich | 879 |
Company | Segment | 2012 U.S. Average Sales per Store ($1,000s) |
---|---|---|
KFC | Chicken | 957 |
Krispy Kreme | Snacks | 2,428 |
Little Caesars | Pizza/Pasta | 465 |
McDonald's | Burger | 2,600 |
Moe's Southwest Grill |
Mexican | 1,103 |
Panera Bread | Sandwich | 2,427 |
Papa John's | Pizza/Pasta | 829 |
Papa Murphy's | Pizza/Pasta | 575 |
Pizza Hut | Pizza/Pasta | 883 |
Popeyes Louisiana Kitchen |
Chicken | 1,242 |
Qdoba Mexican Grill |
Mexican | 966 |
Quiznos | Sandwich | 345 |
Sonic Drive-In | Burger | 1,074 |
Starbucks | Snacks | 1,223 |
Steak 'n Shake | Burger | 1,700 |
Subway | Sandwich | 481 |
Taco Bell | Mexican | 1,363 |
Tim Hortons | Snacks | 1,095 |
Wendy's | Burger | 1,484 |
Whataburger | Burger | 1,996 |
White Castle | Burger | 1,284 |
Wingstop | Chicken | 902 |
Zaxby's | Chicken | 1,766 |
What was the mean U.S. sales (in thousand dollars) per store for the 47 restaurant chains? (Round your answer to one decimal place.)
$ thousand
(b)
What are the first and third quartiles (in thousand dollars)?
first quartile $ thousand
third quartile $ thousand
What is your interpretation of the quartiles?
Twenty-five Seventy-five One hundred percent of the restaurants have an average sales per unit less than or equal to the first quartile and ______ percent of the restaurants have an average sales per unit greater than or equal to the third quartile.
(c) Show a boxplot for the level of sales.
Are there are any outliers in terms of sales that would skew the results? Yes/No
(d) Develop a frequency distribution showing the average sales (in thousand dollars) per store for each segment. (Round your answers to the nearest thousand dollars.)
Segment | Average Sales per unit ($1,000s) |
---|---|
Burger | $ thousand |
Chicken | $ thousand |
Mexican | $ thousand |
Pizza/Pasta | $ thousand |
Sandwich | $ thousand |
Snacks | $ thousand |
Comment on the results obtained: The segment with lowest average sales per unit is _____
In: Statistics and Probability
According to an English professor, the following information represents the percent of times that each letter is used in the English language:
E = 11.16%, A = 8.5%, I = 7.54%, O = 7.16%, U = 3.63% (E.g. 11.16% of letters found in any writing sample will be letter E's)
Choose a paragraph of at least 30 words from any written source and count the number of times each vowel shows up in your paragraph. Then, perform a goodness-of-fit test to see if the above percents are a good fit for the population based on your sample paragraph.
Please make sure you include your paragraph when you submit your assignment. Note spaces, apostrophe's, etc. do not count towards the total number of letters in your paragraph.
Paragraph:
The Centers for Disease Control and Prevention have advised people to “put distance” between themselves and others if the virus is spreading in their community. Thousands of Americans are following this directive, and numerous workplaces have directed their employees to work from home.
In: Statistics and Probability
what does a 9 cell industry attractiveness/business strength matrix displaying PepsiCo's business look like?
In: Statistics and Probability
A researcher was curious about whether people’s attitudes are related to their behaviors. As a simple experiment, the researcher measured people’s attitudes and behaviors regarding housework.
On the attitude side, the researcher measured the level of agreement on the statement “It is much better for everyone involved in housework: strongly agree, agree, disagree, and strongly disagree” On the behavioral side, the respondents are asked to estimate how many hours of housework they do per week.
Based on your analysis of the data below, please come up with a conclusion whether people who respond differently to the attitudinal question actually behave differently?
Once you have arrived at an answer to each question, please write a sentence or two interpreting the result (100 pts).
Attitude |
Strongly Agree |
Agree |
Disagree |
Strongly Disagree |
34 |
28 |
40 |
43 |
|
31 |
26 |
42 |
42 |
|
33 |
24 |
39 |
38 |
|
32 |
26 |
32 |
39 |
|
28 |
32 |
41 |
41 |
|
27 |
34 |
39 |
44 |
|
43 |
27 |
43 |
48 |
|
38 |
26 |
44 |
36 |
|
27 |
31 |
41 |
||
31 |
39 |
|||
Sample N |
10 |
9 |
10 |
8 |
Sample Mean |
32.40 |
28.22 |
40.00 |
41.38 |
Sample St.Dev |
5.03 |
3.35 |
3.30 |
3.78 |
c.1. Find the grand mean, the mean of all values in the samples. (10 pts)
c.2. Find the between-group variance. (15 pts)
c3. Find the within group variance.
c.4. Compute the F-value. (15 pts)
In: Statistics and Probability
22.HE.A: It has been assumed that children are born with equal frequency with respect to days of the week. Randomly selected birth records are obtained and results are listed in the table below.
Sunday |
Monday |
Tuesday |
Wednesday |
Thursday |
Friday |
Saturday |
36 |
55 |
62 |
60 |
60 |
58 |
48 |
In: Statistics and Probability
X= weight of copper
Y=Actual Price of Copper in (USD)
x : 0.3 0.4 0.5 0.5 1.0 0.7
y : 510 1151 1343 1410 5669 2277
(a) Find the regression equation for the data points given.
(b) Determine the percentage of variation in price of Copper y that is explained by the weight x.
(c) Is it reasonable to predict the price of a 0.8 grams of copper using this model?
(d) Is it reasonable to predict the price of a 1.5 grams of copper using this model?
This is strictly to check answers.
In: Statistics and Probability
Camellia plants were grown in two different settings - low light and high light - and the leaf widths of the plants were measured (in millimeters). For the plants growing in low light areas, 25 leaves sampled returned a mean of 47.7 mm and a standard deviation of 3.96 mm. The sample of 25 plants in high-light areas had a mean leaf width of 55.8 mm with standard deviation of 7.26 mm. At the 5% significance level, do camellia plants in high light areas have significantly wider leaves?
(a) State the null and alternate hypothesis (symbolic or sentence form).
(b) State the p-value.
(c) Draw a conclusion about the claim.
In: Statistics and Probability
An entrepreneur opened a small hardware store in a strip mall. During the first few weeks, business was slow, with the store averaging only one customer every 20 minutes in the morning. Assume that the random arrival of customers is Poisson distributed. (a) What is the probability that at least 1 hour would elapse between customers? (b) What is the probability that 10 to 30 minutes would elapse between customers? (c) What is the probability that less than 5 minutes would elapse between customers?
In: Statistics and Probability
Use maximum likelihood to find the parameters in logistic regression, where the domain is x and the sigmoid is used for the ’activation’.
In: Statistics and Probability
Examine the statistical probability concept with making ethical decisions.
In: Statistics and Probability
A researcher surveyed randomly selected Democrats and Republicans asking them what the number one concern should be for the president of the United States. The results of the survey are shown below. Is there evidence to conclude that there is a difference in what Democrats and Republicans think is the most important?
Economy | Foreign Affairs | Family Values | Environment | Other | |
---|---|---|---|---|---|
Democrats | 88 | 55 | 83 | 70 | 32 |
Republicans | 114 | 71 | 120 | 40 | 36 |
What can be concluded at the αα = 0.10 significance level?
In: Statistics and Probability