Question

In: Statistics and Probability

Included in a classification or prediction model, highly correlated values, or variables that are unrelated to...

Included in a classification or prediction model, highly correlated values, or variables that are unrelated to the outcome of interest can lead to overfitting, and reliability can suffer. T/F

It is best to normalize when the units of measurement are common for the variables and when their scale reflects their importance. T/F

This is a useful procedure for reducing the number of predictors in the model by analyzing the input variables. It is intended to be used with quantitative variables. A.correlation Analysis , B Correspoding Anaylysis, C Principle components analysis D, ABC

This provides the sense of how dispersed the data are relative to the mean. A.Standard Deviation, B.Variance, C.Mean D.Average

Solutions

Expert Solution

1) Included in a classification or prediction model, highly correlated values, or variables that are unrelated to the outcome of interest can lead to overfitting, and reliability can suffer.  

True

Because the correlation between such variables is not cause effect relationship.

So it's lead to overfitting, and reliability can suffer.

2) It is best to normalize when the units of measurement are common for the variables and when their scale reflects their importance. T/F

False.

If the units of the measurements are common then we don't need to normalize the variables.

we can compare such variables which are common unit variables .

3) This is a useful procedure for reducing the number of predictors in the model by analyzing the input variables. It is intended to be used with quantitative variables. A.correlation Analysis , B Correspoding Anaylysis, C Principle components analysis D, ABC

The correct option is C) Principle component analysis.

Because: Correlation analysis is useful to investigate the relationship between the quantitative variables.

Correspondence analysis is used to investigate the relationship between two qualitative variables.

Principal component analysis (PCA) is a statistical procedure that uses an orthogonal transformation to convert a set of

observations of possibly correlated variables into a set of values of linearly uncorrelated variables which reduces the number

of predictors. So correct option is C.

4)

This provides the sense of how dispersed the data are relative to the mean. A.Standard Deviation, B.Variance, C.Mean D.Average

The correct option is A) Standard deviation.

Because : it's measure the spread within the data-set and the measurement unit of standard deviation is same as the mean.


Related Solutions

Prediction: A common goal in regression is to use the estimated model for prediction. a) (1...
Prediction: A common goal in regression is to use the estimated model for prediction. a) (1 points) Using your model from 2c, provide a highway driving CO2 emissions prediction for vehicles that have a city CO2 emissions of 550. Show work by hand! You may validate with code provided in R. b) (1 point) A 2017 Ferrari California T has a city driving CO2 Emission of 550 and an observed highway driving CO2 Emission of 389. How far off is...
1. In reality, overall growth and income for the poorest are: a. highly positively correlated, but...
1. In reality, overall growth and income for the poorest are: a. highly positively correlated, but not perfectly. b. uncorrelated. c. highly negatively correlated, but not perfectly. d. slightly positively correlated. 2. Economists like Hernando de Soto feel that a ___________ is the key to economic development. a. strong national defense b. strong currency c. strong titling system d. stable currency 3. Peruvian economist Hernando de Soto claims the ___________ in Latin America results in "dead capital." a. weak titling...
A- It is documented that human capital and inherited wealth are highly correlated. How might this...
A- It is documented that human capital and inherited wealth are highly correlated. How might this influence a redistributive government’s policy action(s)? B- Summarize three facts from the article ”International Comparisons of Economic Mobility” by Julia B. Isaacs (article provided on the class website). C-What do you consider to be a fair tax? What are the two main opposing views on this issue? Support your answer with specific arguments or examples.
1. This is especially valuable when we have subsets of measurements that are highly correlated. In...
1. This is especially valuable when we have subsets of measurements that are highly correlated. In that case it provides few variables that are weighted linear combinations of the original variables that retain the explanatory power of the full original set. A. Correlation analysis B. Affinity analysis C. Variance Analysis D. Principal Component Analysis 2. These are very helpful for data reduction. The information they convey can assist in combining categories, in choosing the variables to remove, and in assessing...
A firm has discovered that its sales are highly correlated to a certain economic parameter. The...
A firm has discovered that its sales are highly correlated to a certain economic parameter. The value for that parameter has been predicted to be 15 for the next year (7th year). Use regression modeling to develop a forecast for the next year. Year 1 2 3 4 5 6 Econ. Parameter 16.2 14.9 18.5 16.1 13.8 15.4 # Sales 120 100 165 140 100 110
Which of the following environmental factors is highly correlated with color vision in fish? water temperature...
Which of the following environmental factors is highly correlated with color vision in fish? water temperature water salinity water depth the coloration of prey
1.These learning algorithms are used in classification and prediction and must have data available in which...
1.These learning algorithms are used in classification and prediction and must have data available in which value of the outcome of interest is known. Simple Linear Regression analysis is an example of this. A. Correlation Analysis B. Supervised Learning C. Unsupervised Learning D. Confusion Matrix 2.This partition is used to assess the performance of each model so that you can compare models and pick the best one. A. Training Partition B. Test Partition C. Validation Partition D. None of the...
Omitted variable bias: a. exists if the omitted variable is correlated with the included regressor but is not a determinant of the dependent variable. b. exists if the omitted variable is correlated
Omitted variable bias:a. exists if the omitted variable is correlated with the included regressor but is not a determinant of the dependent variable.b. exists if the omitted variable is correlated with the included regressor and is a determinant of the dependent variable.c. will always be present as long as the regression R2 < 1.d. is always there but is negligible in almost all economic examples.
1. White dwarfs are not included in the luminosity classification because they are: a) no longer...
1. White dwarfs are not included in the luminosity classification because they are: a) no longer producing energy by nuclear reactions b) too small c) not yet active stars; they have not yet begun nuclear reactions d) no longer radiating energy away 2. Absolute magnitude is defined as the apparent magnitude that a star would have if a) it were located at exactly 10 pc from Earth b) it were located at exactly 10 AU from Earth c) all the...
Suppose that m = 2, and that the p-values for the two tests are positively correlated,...
Suppose that m = 2, and that the p-values for the two tests are positively correlated, so that if one is small then the other will tend to be small as well, and if one is large then the other will tend to be large. How does the family-wise error rate associated with these m = 2 tests qualitatively compare to the answer in (b) with m = 2? In (b), FWER = 1-(1-alpha)^m
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT