Questions
Suppose Y is an random variable. If P(a<Y<2a)=0.16 and the median of Y is 5, what...

Suppose Y is an random variable. If P(a<Y<2a)=0.16 and the median of Y is 5, what is a? Note: There may be more than one solution. Report all.

In: Math

1. The Coefficient of Determination is * a. the percent of variance in the dependent variable...

1. The Coefficient of Determination is *

a. the percent of variance in the dependent variable that can be explained by the independent variable

b. the ratio of the variance of Y to the variance of Y for a specific X

c. a measure of how strong the linear relationship is between the explanatory and response variables

2.

The null hypothesis for a regression model is state as *

a. beta_1=0: there is no relationship

b. beta_1 > 0: there is a positive relationship

c. rho=0: there is no relationship

d. rho < 1: there is a negative relationship

3.Choose the best interpretation of \beta_{0} *

a. the sample correlation between x and y

b. the change in y as x increases by 1 unit

c. the amount of uncertainty remaining after fitting the model

d. the value of y when all x's are zero

4.Linear regression analysis is used to assess the relationship between what two types of measurements? *

a. quantiative; quantitative

b. categorical; categorical

c. quantitative; categorical

In: Math

1. Explain in words what a confidence interval means to someone who has never taken statistics....

1. Explain in words what a confidence interval means to someone who has never taken statistics.

2. There is concern that rural Minnesota is aging at a different rate than urban Minnesota. We want to test if the average age in rural Minnesota is different from the average age in urban Minnesota. Write out the null and alternative hypotheses.

3. At the 5% significance level, do you reject the null hypothesis? Why? Explain this to someone who has never taken statistics.

In: Math

A random sample is drawn from a population with mean μ = 74 and standard deviation...

A random sample is drawn from a population with mean μ = 74 and standard deviation σ = 6.2. [You may find it useful to reference the z table.]

a. Is the sampling distribution of the sample mean with n = 18 and n = 47 normally distributed?

Yes, both the sample means will have a normal distribution.

No, both the sample means will not have a normal distribution.

No, only the sample mean with n = 18 will have a normal distribution.

No, only the sample mean with n = 47 will have a normal distribution.

b. Calculate the probability that the sample mean falls between 74 and 77 for n = 47. (Round intermediate calculations to at least 4 decimal places, “z” value to 2 decimal places, and final answer to 4 decimal places.)

In: Math

So this week we learned about the four V's of Big data - Velocity, Volume, Veracity...

So this week we learned about the four V's of Big data - Velocity, Volume, Veracity and Variety. As we know the velocity of data can vary so does it affect the other three V's? How are they affected? Also same if there are changes in other or one of the V then how are the rest affected?

In: Math

This assignment is a series of short answer, multiple choice, and fill-in-the-blank questions based on the...

This assignment is a series of short answer, multiple choice, and fill-in-the-blank questions based on the article, “The Effects of Hospital-Level Factors on Patients' Ratings of Physician Communication” (Al-Amin & Makaremet, 2016).

What statistical techniques were used for the results presented in Table 2?  Summarize the findings from this statistical analysis (20 points).

Statistical Technique:

Findings based on statistical technique:

            Overall model:

            Individual factors:

In: Math

Assume that z is the test statistic. (Give your answers correct to two decimal places.) (a)...

Assume that z is the test statistic. (Give your answers correct to two decimal places.)

(a) Calculate the value of z for Ho: μ = 10, σ = 2.8, n = 36, x = 11.4.

_________

(b) Calculate the value of z for Ho: μ = 120, σ = 26, n = 26, x = 125.9.

_________

(c) Calculate the value of z for Ho: μ = 18.2, σ = 4.4, n = 140, x = 18.88.

__________

(d) Calculate the value of z for Ho: μ = 81, σ = 13.5, n = 52, x = 78.6.

___________

In: Math

Conduct a hypothesis test to determine if  companies with negative revenue change tend to be on the...

Conduct a hypothesis test to determine if  companies with negative revenue change tend to be on the (500) market less time (and/or how much less time)?

The data is below. Please show all work in excel.

Company Name Revenue Change # Years on Fortune 500 List
Walmart 3.00% 24
Exxon Mobil 17.40% 24
Berkshire Hathaway 8.30% 24
Apple 6.30% 24
UnitedHealth Group 8.80% 24
McKesson 3.10% 24
CVS Health 4.10% 24
Amazon.com 27.10% 17
AT&T -2.00% 24
General Motors -5.50% 24
Ford Motor 3.30% 24
AmerisourceBergen 4.30% 24
Chevron 25.10% 24
Cardinal Health 6.90% 24
Costco 8.70% 24
Verizon 0.00% 24
Kroger 6.40% 24
General Electric -3.50% 24
Walgreens Boots Alliance 0.70% 24
JPMorgan Chase 8.00% 24
Fannie Mae 4.90% 21
Alphabet 22.80% 13
Home Depot 6.70% 24
Bank of America Corp. 7.00% 24
Express Scripts Holding -0.20% 19
Wells Fargo 3.80% 24
Boeing -1.20% 24
Phillips 66 26.50% 6
Anthem 6.10% 24
Microsoft 5.40% 24
Valero Energy 26.00% 20
Citigroup 6.80% 24
Comcast 5.10% 23
IBM -1.00% 24
Dell Technologies 21.40% 21
State Farm Insurance Cos. 2.90% 24
Johnson & Johnson 6.30% 24
Freddie Mac 13.70% 21
Target 3.40% 24
Lowes 5.50% 24
Marathon Petroleum 21.00% 7
Procter & Gamble -7.70% 24
MetLife 4.20% 24
UPS 8.20% 24
PepsiCo 1.20% 24
Intel 5.70% 24
DowDuPont 30.20% 24
Archer Daniels Midland -2.40% 24
Aetna -4.10% 18
FedEx 19.80% 24
United Technologies 4.50% 24
Prudential Financial 1.50% 24
Albertsons Cos. 1.60% 14
Sysco 9.90% 24
Disney -0.90% 24
Humana -1.10% 24
Pfizer -0.50% 24
HP 7.90% 24
Lockheed Martin 0.80% 24
AIG -5.40% 24
Centene 19.30% 9
Cisco Systems -2.50% 22
HCA Healthcare 6.50% 24
Energy Transfer Equity 26.60% 12
Caterpillar 18.00% 24
Nationwide 9.60% 24
Morgan Stanley 15.00% 24
Liberty Mutual Insurance Group 11.40% 24
New York Life Insurance 3.70% 24
Goldman Sachs Group 12.00% 19
American Airlines Group 5.00% 24
Best Buy 7.00% 24
Cigna 4.90% 24
Charter Communications 43.40% 18
Delta Air Lines 4.00% 24
Facebook 47.10% 6
Honeywell International 3.10% 24
Merck 0.80% 24
Allstate 5.40% 23
Tyson Foods 3.70% 24
United Continental Holdings 3.20% 24
Oracle 1.80% 23
Tech Data 40.20% 24
TIAA -2.90% 21
TJX 8.10% 24
American Express 5.20% 24
Coca-Cola -15.40% 24
Publix Super Markets 1.60% 24
Nike 6.10% 24
Andeavor 42.50% 19
World Fuel Services 24.70% 14
Exelon 6.90% 24
Massachusetts Mutual Life Insurance -11.40% 24
Rite Aid 6.90% 24
ConocoPhillips 33.80% 24
CHS 5.20% 19
3M 5.10% 24
Time Warner 6.70% 19
General Dynamics -1.20% 24
USAA 10.60% 24

In: Math

The Toyota Camry is one of the best-selling cars in North America. The cost of a...

The Toyota Camry is one of the best-selling cars in North America. The cost of a previously owned Camry depends on many factors, including the model year, mileage, and condition. To investigate the relationship between the car’s mileage and the sales price for Camrys, the following data show the mileage and sale price for 19 sales (PriceHub web site, February 24, 2012).

Miles (1,000s) Price ($1,000s)
22 16.2
29 16.0
36 13.8
47 11.5
63 12.5
77 12.9
73 11.2
87 13.0
92 11.8
101 10.8
110 8.3
28 12.5
59 11.1
68 15.0
68 12.2
91 13.0
42 15.6
65 12.7
110 8.3
(d) How much of the variation in the sample values of price does the model estimated in part (b) explain?
If required, round your answer to two decimal places.
%
(e) For the model estimated in part (b), calculate the predicted price and residual for each automobile in the data. Identify the two automobiles that were the biggest bargains.
If required, round your answer to the nearest whole number.

The best bargain is the Camry # in the data set, which has  miles, and sells for $ less than its predicted price.

The second best bargain is the Camry # in the data set, which has  miles, and sells for $ less than its predicted price.

(f) Suppose that you are considering purchasing a previously owned Camry that has been driven 30,000 miles. Use the estimated regression equation developed in part (b) to predict the price for this car.
If required, round your answer to one decimal place. Do not round intermediate calculations.
Predicted price: $
Is this the price you would offer the seller?
- Select answer -Yes or No?
Explain.

In: Math

What are the issues with large significance values?

What are the issues with large significance values?

In: Math

You discover an isolated population of island squirrels and collect 200 of them, finding leucism 12....

You discover an isolated population of island squirrels and collect 200 of them, finding leucism 12. Perform a hypothesis test for a difference in the proportions of leucism among this island population and the previously considered population. Report your conclusion at both the a=.01 and a=.05 level.

In: Math

Dry Goods Sales The data is for weekly sales in the dry goods department at a...

Dry Goods Sales

The data is for weekly sales in the dry goods department at a Wal*Mart store in the Northeast.  Peak values, I.e. spikes, usually occur at holiday periods.  Week 1 is the first week of February 2002.  To show continuity, week 1 of 2003 is represented as week 54 since week 53 represents the end of fiscal 2002 and start of the 2003 fiscal year. Dollar values are adjusted in order to disguise true sales figures, but trends in the data are retained for analysis puposes.

Week Sales in $
26 15200
27 15600
28 16400
29 15600
30 14200
31 14400
32 16400
33 15200
34 14400
35 13800
36 15000
37 14100
38 14400
39 14000
40 15600
41 15000
42 14400
43 17800
44 15000
45 15200
46 15800
47 18600
48 15400
49 15500
50 16800
51 18700
52 21400
53 20900
54 18800
55 22400
56 19400
57 20000
58 18100
59 18000
60 19600
61 19000
62 19200
63 18000
64 17600
65 17200
66 19800
67 19600
68 19600
69 20000
70 20800
71 22800
72 23000
73 20800
74 25000
75 30600
76 24000

77

21200

1.) Can you identify at least 6 holiday periods or special events that cause the spikes in the data?

a.) In each case give the week number, date, and what holiday or special event it represents

b.) Which holiday results in the maximum sales for this department and how much are the sales?

2.) Generate three linear models for this data. Each linear model should be generated from a pair of data points.

a.) For each linear model, find the equation of the line. Show your work. Write the equation in slope intercept form.

b.) For each linear model discuss the meaning of the slope and y-intercept. Also provide an analysis as to why you like or dislike that particular model

c.) Discuss the rationale behind the model that you believe best predicts future results.

3.) Predict and analyze sales for the next four weeks

a.) Using your most preferred linear model, predict sales for the next four weeks and show calculations

b.) Based on your preferred linear model, compute the percent rate of increase (y2-y1)/y1 for the next four weeks

4.) If you were a manager of this department store, what recommendation would you make to the person in charge of inventory?

In: Math

The mean systolic blood pressure for people in the United States is reported to be 122...

The mean systolic blood pressure for people in the United States is reported to be 122 millimeters of mercury (mmHg) with a standard deviation of 22.8 millimeters of mercury. The wellness department of a large corporation is investigating if the mean systolic blood pressure is different from the national mean. A random sample of 200 employees in a company were selected and found to have an average systolic blood pressure of 124.7 mmHg.

a) What is the probability a random employees blood pressure is higher than 135 mmHg.

b) What is the probability that 200 randomly selected employees mean blood pressure is greater than 127 mmHg.

c) The wellness department is providing a new health program to their employees. Past studies have shown 9.9 % of their employees have high blood pressure. Find the probability that if the wellness department examines 200 randomly selected employees, less than 12 employees will have high systolic blood pressure. Do you think the new program significantly lowers the number of employees with a high blood pressure.

In: Math

Age HRS1 58 32 24 46 32 40 29 40 34 86 49 40 60 40...

Age HRS1
58 32
24 46
32 40
29 40
34 86
49 40
60 40
78 25
39 5
67 15
22 40
  1. Develop a scatter plot with HRS1 (how many hours per week one works) as the dependent variable and age as the independent variable. Include the estimated regression equation and the coefficient of determination on your scatter plot.
  2. Does there appear to be a relationship between these variables (HRS1 and age)? Briefly explain and justify your answer.
  3. Calculate the slope (b1) and intercept (b0) coefficients and use them to develop an estimated regression equation that can be used to predict HRS1 given age. Conduct your analysis using Alpha (α ) of 0.05. Submit your Excel output or workings to receive full points. Hint: Use formulas and Excel, or Excel regression Tool (run a regression with HRS1 as dependent or y variable and age as independent or x variable).
  4. Interpret the slope coefficient b1 (the coefficient for the independent variable, age).
  5. Use t and F to test for a significant relationship between HRS1 and age. Use α = 0.05 and make sure you know what hypotheses you are using to conduct the significance tests.
  6. Calculate and interpret the coefficient of determination R2. Based on this R2, did the estimated regression equation provide a good fit? Briefly justify your answer. Hint: If you used Excel Regression Tool to answer part c, R2 was reported with your output.
  7. Use the estimated regression equation to predict the HRS1 for a 60 year old individual.

In: Math

The weight of navel oranges of a domestic farm is normally distributed with a mean of...

The weight of navel oranges of a domestic farm is normally distributed with a mean of 8.0oz and a standard deviation of 1.5oz. Suppose that you bought 10 oranges randomly sampled.

a) what are the mean of the sampling distribution and the standard error of the mean?

b) what is the probability that the sample mean is between 8.5 and 10.0 oz?

c) The probability is 90% that the sample mean will be between what two values symmetrically distributed around the population mean?

In: Math