Question

In: Statistics and Probability

Using the unemployment data​ provided, investigate the association between the male unemployment rate in 2007 and...

Using the unemployment data​ provided, investigate the association between the male unemployment rate in 2007 and 2010 for a sample of 52 countries. Complete parts a through d.

​a) Find a regression model predicting the 2010 rate from the 2007 rate for the sample of 52 countries. State in simple language what the model says.

2010 Index=_________+____________x 2007 index ​(Round to two decimal places as​ needed.)

State in simple language what the model says. Select the correct choice below and fill in the answer box to complete your choice. ​(Round to two decimal places as​ needed.)

  1. The 2010 rate increased by _________ per year.
  2. For each​ 1% increase in the 2010​ rate, the 2007 rate increased by __________
  3. For each​ 1% increase in the 2007​ rate, the 2010 rate increased by ___________
  4. The 2007 rate increased by __________ per year.

b) Determine the test statistic.

T=_________ ​(Round to two decimal places as​ needed.)

Determine the​ P-value.

P=_______________​(Round to three decimal places as​ needed.)

Make a conclusion.

Since the​ P-value is (1) the significance​ level, α​, (2) the null hypothesis. The association (3) significant.

​c) What percentage of the variability in the 2010 Index is accounted for by the regression​ model?

The regression model accounts for nothing ​% of the variability in the 2010 Index.

​(Round to one decimal place as​ needed.)

Country Male 2007 Male 2010
1 24.1 22.4
2 11.8 10.7
3 8.7 11.7
4 19.6 17.7
5 26.8 22.2
6 13.9 13.2
7 16.4 14.3
8 13.5 10.7
9 12.1 11.8
10 30.5 29.5
11 9.5 13.3
12 22.2 18.8
13 8.5 8.2
14 14.9 11.3
15 22.8 15.1
16 28.4 16.6
17 23.8 4.2
18 25.3 26.4
19 13.8 26.1
20 18.6 18.8
21 8.3 4.3
22 15.2 20.8
23 8.8 8.2
24 19.1 8.2
25 21.3 16.8
26 11.2 9.7
27 11.3 12.3
28 15.2 11.5
29 22.2 16.3
30 21.8 20.9
31 6.8 6.2
32 9.9 8.5
33 12.3 12.1
34 22.8 17.2
35 19.7 21.1
36 18.1 15.3
37 38.3 35.4
38 12.1 14.3
39 22.6 24.8
40 22.1 18.5
41 22.1 21.6
42 34.6 30.1
43 12.8 10.4
44 19.7 17.2
45 22.7 20.8
46 16.3 15.7
47 7.7 7.6
48 5.1 5.7
49 65.3 63.8
50 19.8 19.7
51 11.8 12.4
52 12.3 12.8

Solutions

Expert Solution

I have used R code to build simple linear regression model to the given data set.

(a) SIMPLE LINEAR REGRESSION R OUTPUT:

ESTIMATED SIMPLE LINEAR REGRESSION EQUATION:

The estimated simple linear regression equation is,

where

is the predicted dependent variable "Male unemployment rate in 2010"

is the intercept

is the slope coefficient of independent variable "Male unemployment rate in 2007"

is the independent variable "Male unemployment rate in 2007"

The model says that Option (C): For each​ 1% increase in the 2007​ rate, the 2010 rate increased by 87% since the slope coefficient of independent variable "Male unemployment rate in 2007" is .

(b) T TEST STATISTIC:

From the given R output,

The value of t test statistic is and the p value for t test statistic is .

T TEST FOR INDIVIDUAL SIGNIFICANCE OF PREDICTOR:

HYPOTHESIS:

The t test is used to test the significance of individual predictor "Male unemployment rate in 2007"

The hypothesis is given by,

(That is, the slope coefficient of predictor "Male unemployment rate in 2007" is not statistically significant. In other words, there is no significant linear relationship between independent variable "Male unemployment rate in 2007" and dependent variable "Male unemployment rate in 2010")

(That is, the slope coefficient of predictor "Male unemployment rate in 2007" is statistically significant. In other words, there is significant linear relationship between independent variable "Male unemployment rate in 2007" and dependent variable "Male unemployment rate in 2010")

CONCLUSION:

Since the calculated p value is less than the significance level , we reject null hypothesis and conclude that there is significant linear relationship between independent variable "Male unemployment rate in 2007" and dependent variable "Male unemployment rate in 2010".

(c) COEFFICIENT OF DETERMINATION:

From the R output, the coefficient of determination is . Thus the regression model accounts for % of the total variability in the dependent variable "Male unemployment rate in 2010".


Related Solutions

Using the unemployment data provided, investigate the association between the male unemployment rate in 2007 and...
Using the unemployment data provided, investigate the association between the male unemployment rate in 2007 and 2010 for a sample of 52 countries. (If you use EXCEL please show me how you solved this please - it would be great help to see the steps) Thank you! Find a regression model predicting the 2010 rate from the 2007 rate for the sample of 52 countries . State in simple language what the model says. 2010 Index = ___________ + _____________________...
1. correlation It is interested to investigate if there is an association between body weight and...
1. correlation It is interested to investigate if there is an association between body weight and the concentration of cholesterol in the blood. Subjects were randomly selected from a population of adult males between the ages of 50 and 55 with a height of between 60 and 69 inches in height. the data appears in the following table weight (pounds) X cholesterol (mg/100 ml) Y 146 181 205 228 157 182 165 249 184 259 153 201 220 339 181...
In this case study you investigate how good the official ‘unemployment rate’ captures the unemployment in...
In this case study you investigate how good the official ‘unemployment rate’ captures the unemployment in the economy. You should reference the two articles provided on this subject above, the material in your textbook related to this topic and one more article of your choosing. Address the following questions in your paper: 1. Does the official unemployment rate capture the true unemployment in the economy? 2. What are the ways to address any flaws in the report?
Jane wishes to investigate the association between the average length of sleep in hours and an...
Jane wishes to investigate the association between the average length of sleep in hours and an NUS student’s Cumulative Average Point (CAP). She collated responses from 250 undergraduates, found the correlation to be 0.87, and established a linear regression equation of CAP (y) against the length of sleep (x): y=0.5x+0.5 Which of the following is true? A) If we group the data into different faculties and we take the average C.A.P and average length of sleep per faculty, we will...
Explain the relationship between the unemployment rate and economic growth, and why the unemployment rate is...
Explain the relationship between the unemployment rate and economic growth, and why the unemployment rate is or is not significant to the economy.
A building society manager wishes to investigate the degree of association between the level of interest...
A building society manager wishes to investigate the degree of association between the level of interest rates and the flow of building society deposits. The following historical data is available: Date Interest rates Building of society deposits(inflows) £ m =============================================================== 2006Jan 11 0.8 2006July 9.5 1.1 2007Jan 9 1.4 2007July 12 1.2 2008Jan 12 1.5 2008July 12 1.3 2009Jan 12.5 1.7 2009July 10 1.1 2010Jan 11 2.0 2010July 10 1.1 2011Jan 8.5 1.2 2011July 10.5 1.9 2012Jan 13 2.5 2012July...
: An observational study is conducted to investigate the association between age and total serum cholesterol....
: An observational study is conducted to investigate the association between age and total serum cholesterol. The correlation is estimated at r = 0.35. The study involves n=125 participants and the mean (std dev) age is 44.3 (10.0) years with an age range of 35 to 55 years, and mean (std dev) total cholesterol is 202.8 (38.4). a. Estimate the equation of the line that best describes the association between age (as the independent variable) and total serum cholesterol. b....
In early 2007 the Mortgage Lenders Association reported that homeowners were defaulting on their adjustable-rate mortgages...
In early 2007 the Mortgage Lenders Association reported that homeowners were defaulting on their adjustable-rate mortgages in record numbers. The foreclosure rate for that particular year was 1.45%. To see if there has been any change in the foreclosure rate, the association plans to sample 237 homeowners to see if they are in danger of defaulting on their loan. Describe the sampling distribution of the sample proportion of homeowners with adjustable-rate mortgages who are in danger of defaulting on their...
A small study is conducted involving 10 infants to investigate the association between gestational age at...
A small study is conducted involving 10 infants to investigate the association between gestational age at birth, measured in weeks, and birth weight, measured in grams.: Gestational age: 34.7, 36, 29.3, 40.1, 35.9, 40.8, 38.3, 37, 41.2, 39.8. Birth weight:1895, 2030, 1440, 2835, 3115, 4013, 3174, 3625, 2289, 2845. a) Find the correlation coefficient of the gestational age at birth and birth weight. b) Test H0:ρ= 0 vs.H1:ρ does not equal 0. at 5% level of significance. Here ρ is...
A ["prospective", OR "retrospective"] cohort study is carried out to investigate the association between occupational arsenic...
A ["prospective", OR "retrospective"] cohort study is carried out to investigate the association between occupational arsenic inhalation and neurological exposure and neurological effects among workers in a copper smelter. For the sake of simplicity, let’s assume there are two possible exposure categories: high and low (for example, those working in the smelting process and those working in administration). The exposure was carefully assessed by review of company records which reflected very good exposure monitoring (both air sampling and urine testing)....
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT