Question

In: Statistics and Probability

Question 21 pts Which of these statements about multicollinearity is FALSE? A If the average variance...

Question 21 pts

Which of these statements about multicollinearity is FALSE?

A

If the average variance inflation factor is greater than 1 then the regression model might be biased.

B

Multicollinearity in the data is shown by a VIF (variance inflation factor) greater than 10.

C

Tolerance values above 0.2 may indicate multicollinearity in the data.

D

The tolerance is 1 divided by the VIF (variance inflation factor).

Flag this Question

Question 31 pts

Recent research has shown that professors are among the most stressed workers. The output below shows the results of a regression using several variables to predict stress among professors. (Data from Cooper, 1988).

Based on the output above, which of the predictors is significantly related to burnout? Check all that apply.

A

Stress from research

B

Perceived control

C

Stress from teaching

D

Stress from providing pastoral care

Flag this Question

Question 41 pts

Using the same output from Question 3, how would we interpret the b-value of perceived control? Please use Model 3 to answer this question.

A

As perceived control increases by .675 units, burnout increases by one unit controlling for the other variables.

B

As perceived control increases by 8.271 units, burnout increases by one unit.

C

As perceived control increases by one unit, burnout increases by .675 units.

D

As perceived control increases by one unit, burnout increases by .675 units, controlling for the other variables.

Flag this Question

Question 51 pts

Again using the output from Question 3, which variables would we consider eliminating from our model due to concerns of multicollinearity?

A

Stress from research

B

Perceived control

C

Stress from teaching

D

Stress from providing pastoral care

Flag this Question

Question 61 pts

Which statistic is useful for assessing the influence of a single predictor in a linear regression? Check all that apply.

A

R2 change.

B

t-statistic.

C

Unstandardized B

D

Chi Square

Flag this Question

Question 71 pts

Which of the following are potential sources of bias in a linear model?

A

Z-scores and influential cases

B

Coefficients and outliers

C

T-statistics and influential cases

D

Outliers and influential cases

Flag this Question

Question 81 pts

The head of retail sales at a large cosmetic company was interested in determining what the best marketing model was for launching a forthcoming new product to ensure high sales. She ran two separate simple linear regressions; the first used money spent on social media marketing as a predictor and the second had money spent on print media as a predictor. The model featuring social media marketing as a predictor had a R2 of .665, an adjusted R2 of .661, an F-statistic of 112.56 (p < .001). The model featuring print media marketing as a predictor had a R2 of .705, an adjusted R2 of .15, an F-statistic of 34 (p < 0.001). Which marketing model should she invest in, based on these findings, to generate predicted higher sales?

A

The model featuring print media marketing as a predictor is the better of the two models.

B

The model featuring social media marketing as a predictor is the better of the two models.

C

Neither model is effective.

D

The model featuring social media as a predictor is better but biased.

Flag this Question

Question 91 pts

A researcher was interested in examining what factors influenced children’s scores in a fitness test. He ran a multiple linear regression, which included four predictors (‘hours spent taking part in physical activity per day’, ‘calories consumed per day’, ‘BMI’, and ‘hours spent watching TV per day’).   His model had a R2 of .739, an adjusted R2 of .742, an F-statistic of 109.46 (p < .001). How would you interpret his findings?

A

It is not a significant model.

B

It is a significant model where the four predictors account for 74% of the variance in the children’s scores in the fitness test.

C

It is a significant model where the four predictors account for 109% of the variance in the children’s scores in the fitness test.

Flag this Question

Question 101 pts

The same researcher noticed that his residual scatterplot seemed to violate homoscedasticity. What should he do in order to ensure that his model is robust?

A

Throw out any outliers and re-run the model.

B

Run the model as a stepwise regression so he can manually throw out any bad predictors.

C

Perform bootstrapping.

D

Without seeing the scatterplot, we can’t tell what he should do.

Solutions

Expert Solution

Note: Some question which need additional data are skip, but which are possible to solve with the information provide are given correct answers with explanation.

Question 21 pts

Which of these statements about multicollinearity is FALSE?

Correct option = C Tolerance values above 0.2 may indicate multicollinearity in the data.
It must be below 0.2, along with VIF above 10.

The remain options are corret.
A If the average variance inflation factor is greater than 1 then the regression model might be biased.
B Multicollinearity in the data is shown by a VIF (variance inflation factor) greater than 10.
D The tolerance is 1 divided by the VIF (variance inflation factor).


Which statistic is useful for assessing the influence of a single predictor in a linear regression? Check all that apply.
Correct option : B t-statistic.

For each beta coefficient we test the following hypothesis.

Next we check the pvalue for the variable in the regression output and check if the pvalue is less than 0.05, if it is less than 0.05, then we reject the null hypothesis and conclude that the variable is significant.(Note - the pvalue is calculated using the tstat)


Question 71

Which of the following are potential sources of bias in a linear model?
Correct option : D Outliers and influential cases

Bias in the model is create by extreme values which swing the model and make it unbalanced.
Zscore, tstat, coefficient are calculated based on the datapoint or observation, hence they cannot insert bais in the model.


Question 81

The head of retail sales at a large cosmetic company was interested in determining what the best marketing model was for launching a forthcoming new product to ensure high sales. She ran two separate simple linear regressions; the first used money spent on social media marketing as a predictor and the second had money spent on print media as a predictor. The model featuring social media marketing as a predictor had a R2 of .665, an adjusted R2 of .661, an F-statistic of 112.56 (p < .001). The model featuring print media marketing as a predictor had a R2 of .705, an adjusted R2 of .15, an F-statistic of 34 (p < 0.001). Which marketing model should she invest in, based on these findings, to generate predicted higher sales?

Correct answer : B The model featuring social media marketing as a predictor is the better of the two models.

In case of social media the R2 and adjusted R2 are very close values, indicating no junk variables are present in the model.
But in the case of print media, adjusted R is much lower than R2, indicating the presence of junk variables.

A researcher was interested in examining what factors influenced children’s scores in a fitness test. He ran a multiple linear regression, which included four predictors (‘hours spent taking part in physical activity per day’, ‘calories consumed per day’, ‘BMI’, and ‘hours spent watching TV per day’). His model had a R2 of .739, an adjusted R2 of .742, an F-statistic of 109.46 (p < .001). How would you interpret his findings?

Correct answer : B It is a significant model where the four predictors account for 74% of the variance in the children’s scores in the fitness test.
We see that pvalue of the Fstatistic is less than 0.05, hence the model is significant.
Coefficient of determination( adjusted rsqaure) = 0.742
It is the measure of the amount of varaiblity in y explained by x. Its value lies between 0 and 1. Greater the value, better is the model. In this case, it 0.742%, hence the model is good


Related Solutions

QUESTION 11 Which of the following statements about graphs is false? A graph is a collection...
QUESTION 11 Which of the following statements about graphs is false? A graph is a collection of nodes and a collection of segments connecting pairs of nodes. Graphs are a directed tree structure. A path is a sequence of vertices in which each vertex is adjacent to the next one. Graphs may be directed or undirected. The degree of a vertex is the number of lines incident to it. QUESTION 12 Which of the following statements about linked lists is...
Which of the following statements about codes of conduct is false? -They are formal statements of...
Which of the following statements about codes of conduct is false? -They are formal statements of what an organization expects of its employees. -They guarantee an ethical business climate. -They help employees determine what behaviors are acceptable. -They provide rules and guidelines for employees to follow. -They should be specific enough to be reasonably capable of preventing misconduct. ____________________________________________________________________________________________________ While ideally the board of directors financial audit committee conducts ethics audits, in most firms they are conducted by: -managers or...
Which of the following statements about receptor potentials is FALSE?
Which of the following statements about receptor potentials is FALSE? A. They are changes in the resting membrane potential of a sensory cell in response to a stimulus. B. The receptor potential spreads from the cell body of a sensory cell to the axon hillock, where action potentials can be generated C. One receptor potential always prompts the release of a neurotransmitter that induces an associated neuron to generate an action potential. D. They must be converted into action potentials...
Which of the following statements about the nuclear envelope is FALSE?
Which of the following statements about the nuclear envelope is FALSE? A. It is composed of two membranes (a double membrane) B. Unlike other membranes in the cell, it primarily consists of complex polysacchrides such as cellulose C. It is supported by a cytoskeletal network known as nuclear lamina D. It contains pores for the passage of large molecules E. It contains the chromosomal DNA of eukaryotic cells  
Which of the following statements about perfect price discrimination is false?
Which of the following statements about perfect price discrimination is false?A condition for perfect price discrimination is that it must be costlier to service some customers than others.There is no consumer surplus if a firm engages in perfect price discrimination.For the price-discriminating firm, its marginal revenue curve coincides with its demand curve.Perfect price discrimination occurs when the seller charges the highest price each consumer would be willing to pay for the product.
Decide which of the following statements are True and which are False about equilibrium systems: If...
Decide which of the following statements are True and which are False about equilibrium systems: If K = 8 for the reaction A + B ⇌ C + D, K will equal -8 for the reaction C + D ⇌ A + B For a reaction with K >> 1, the rate of the forward reaction is less than the rate of the reverse reaction at equilibrium. The value of K at constant temperature does not depend on the amounts...
Decide which of the following statements are True and which are False about equilibrium systems: The...
Decide which of the following statements are True and which are False about equilibrium systems: The value of K at constant temperature depends on the amounts of reactants and products that are mixed together initially. For a reaction with K >> 1, the rate of the forward reaction is less than the rate of the reverse reaction at equilibrium. A large value of K means the equilibrium position lies far to the left. For the following reaction: CaCO3(s) ⇌ CaO(s)...
21 Which of the following statements about the cell cycle is TRUE? a. A cell has...
21 Which of the following statements about the cell cycle is TRUE? a. A cell has more DNA during G1 than it does in G2 b. Cytokinesis is the phase of the cell cycle where DNA is replicated. c. Cells in the G0 state are actively dividing. d. Normal cells do not enter S phase unless there are sufficient nutrients and resources to complete an entire cell cycle. 23 Cancers that arise due to proto-oncogenes turning into oncogenes require that:...
Question 21 You cannot nest structures. Question 21 options: True False Question 22 A structure is...
Question 21 You cannot nest structures. Question 21 options: True False Question 22 A structure is useful when the program needs to store different values of different types in a single collection. Question 22 options: True False Question 23 mystring.find(str, x) returns the first position at or beyond position x where the string str is found in mystring. str may be either a string object or a character array Question 23 options: True False Question 24   Given the code snippet...
Which of the following statements about process efficiency is FALSE? A process is efficient if it...
Which of the following statements about process efficiency is FALSE? A process is efficient if it is able to achieve a high flow rate with few resources. Cost of direct labor goes up if flow rate goes up  Revenue goes up if flow rate goes up Cycle time goes up if flow rate goes down.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT