Question

In: Statistics and Probability

Census data was collected on the 50 states and Washington, D.C. We are interested in determining...

Census data was collected on the 50 states and Washington, D.C. We are interested in determining whether average lifespan (LIFE) is related to the ratio of males to females in percent (MALE), birth rate per 1,000 people (BIRTH), divorce rate per 1,000 people (DIVO), number of hospital beds per 100,000 people (BEDS), percentage of population 25 years or older having completed 16 years of school (EDUC) and per capita income (INCO).

(a) Fit the MLR model with LIFE (y) as the response variable, and MALE (x1), BIRTH (x2), DIVO (x3), BEDS (x4), EDUC (x5), and INCO (x6), as predictors.
(b) At level α =5%, conduct the F-test for the overall fit of the regression. Comment on the results.

Concerning (b), what is the H0 and H1 for the F-test in THIS case?(please use the above case to explain). What command in R can we use to find the value that 0.05 has to compare with?What can we conclude if that value < 0.05? (I didn't provide the data as I only want to know how to do it instead of the exact answer, you can just assume that is a variable A here)

Solutions

Expert Solution

I assume that you have already fitted the MLR model and it will look like as,

LIFE (y) = Intercet + b0* MALE (x1)+ b1* BIRTH (x2)+ b2*DIVO (x3)+b3* BEDS (x4)+b4*EDUC (x5)+b5*INCO (x6)

concerning B)

Following is the set of hypothesis you would need for F-test

H0: The fitted Model is good that is, model with no independent variables fits the data as well as your model.

That is all the coefficients are equal to 0

H1: The fitted Model is not good that is, your model fits the data better than the intercept-only model.

That is all the coefficients are not equal to 0

What command in R ?

lm(LIFE (y) ~ MALE + BIRTH+DIVO+ BEDS+EDUC+INCO, data = data)

gives you the complete output and there only you can get p-value,

Following is the screenshot for your reference,

Last line gives you p-value

If -p value is less than 0.05 then we can conclude that, We Reject H0 and conclude that

The fitted Model is not good that is, your model fits the data better than the intercept-only model.

That is all the coefficients are not equal to 0


Related Solutions

Census data was collected on the 50 states and Washington, D.C. We are interested in determining...
Census data was collected on the 50 states and Washington, D.C. We are interested in determining whether average lifespan (LIFE) is related to the ratio of males to females in percent (MALE), birth rate per 1,000 people (BIRTH), divorce rate per 1,000 people (DIVO), number of hospital beds per 100,000 people (BEDS), percentage of population 25 years or older having completed 16 years of school (EDUC) and per capita income (INCO). "STATE" "MALE" "BIRTH" "DIVO" "BEDS" "EDUC" "INCO" "LIFE" AK...
Census data was collected on the 50 states and Washington, D.C. We are interested in determining...
Census data was collected on the 50 states and Washington, D.C. We are interested in determining whether average lifespan (LIFE) is related to the ratio of males to females in percent (MALE), birth rate per 1,000 people (BIRTH), divorce rate per 1,000 people (DIVO), number of hospital beds per 100,000 people (BEDS), percentage of population 25 years or older having completed 16 years of school (EDUC) and per capita income (INCO). "STATE" "MALE" "BIRTH" "DIVO" "BEDS" "EDUC" "INCO" "LIFE" AK...
Census data was collected on the 50 states and Washington, D.C. We are interested in determining...
Census data was collected on the 50 states and Washington, D.C. We are interested in determining whether average lifespan (LIFE) is related to the ratio of males to females in percent (MALE), birth rate per 1,000 people (BIRTH), divorce rate per 1,000 people (DIVO), number of hospital beds per 100,000 people (BEDS), percentage of population 25 years or older having completed 16 years of school (EDUC) and per capita income (INCO). "STATE" "MALE" "BIRTH" "DIVO" "BEDS" "EDUC" "INCO" "LIFE" AK...
A friend who lives in Los Angeles makes frequent consulting trips to Washington, D.C.; 50% of...
A friend who lives in Los Angeles makes frequent consulting trips to Washington, D.C.; 50% of the time she travels on airline #1, 20% of the time on airline #2, and the remaining 30%of the time on airline #3. For airline #1, flights are late into D.C. 15% of the time and late into L.A. 10% of the time. For airline #2, these percentages are 40% and 30%, whereas for airline #3 the percentages are 35% and 20%. If we...
Suppose that we collected annual data to identify factors determining technological innovation of firms (Data Obtained)...
Suppose that we collected annual data to identify factors determining technological innovation of firms (Data Obtained) a. Technological Innovation (number of new patents) of firms b. No of employees / c. R&D expenditures /d. Appropriability /e. No of technological alliances with another firms. We want to check if 1) there is inverted U shape relation between the no of alliances and the technological innovation 2) the impact of the R&D expenditure on the technological innovation are different by appropriability level...
The following data was collected by a student performing the DETERMINING THE MOLAR ENTHALPY OF NEUTRALIZATION...
The following data was collected by a student performing the DETERMINING THE MOLAR ENTHALPY OF NEUTRALIZATION portion of the experiment. 50.00 mL of a 0.250 M acid is combined with 50.00 mL of 0.255 M NaOH. Before the reaction, the acid and base are at a temperature of 24.92 °C. After mixing, the neutralized solution reaches a maximum temperature of 26.50 °C in a calorimeter (Ccal=58.4 J/°C). The neutralized solution has a specific heat of 3.89 J/g°C and a density...
Use the data and Excel to answer this question. It contains the United States Census Bureau’s...
Use the data and Excel to answer this question. It contains the United States Census Bureau’s estimates for World Population from 1950 to 2014. You will find a column of dates and a column of data on the World Population for these years. Generate the time variable t. Then run a regression with the Population data as a dependent variable and time as the dependent variable. Have Excel report the residuals. (a) Based on the ANOVA table and t-statistics, does...
Use the data and Excel to answer this question. It contains the United States Census Bureau’s...
Use the data and Excel to answer this question. It contains the United States Census Bureau’s estimates for World Population from 1950 to 2014. You will find a column of dates and a column of data on the World Population for these years. Generate the time variable t. Then run a regression with the Population data as a dependent variable and time as the dependent variable. Have Excel report the residuals. (a) Based on the ANOVA table and t-statistics, does...
Logistic Regression In logistic regression we are interested in determining the outcome of a categorical variable....
Logistic Regression In logistic regression we are interested in determining the outcome of a categorical variable. In most cases, we deal with binomial logistic regression with the binary response variable, for example yes/no, passed/failed, true/false, and others. Recall that logistic regression can be applied to classification problems when we want to determine a class of an event based on the values of its features.    In this assignment we will use the heart data located at   http://archive.ics.uci.edu/ml/datasets/Statlog+%28Heart%29 Here is the...
We are interested in determining if the amount of weight a woman gains during pregnancy affects...
We are interested in determining if the amount of weight a woman gains during pregnancy affects the babies birth weight. The following data are provided: Moms Weight Gained Baby's Weight in Pounds 32 6.88 23 6.81 50 7.25 15 8.81 12 5.81 60 6.56 20 7.38 45 7.19 22 8.69 20 5.88 Which one is the independent variable and which one is the dependent variable? Find b1 Find b0. Find SST. Find SSR. Find SSE. Find the coefficient of determination....
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT