Question

In: Statistics and Probability

Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample...

Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample size was about 122 subjects. The independent variable was the level of dentin lead (parts per million). Below are regressions using various dependent variables.

(a) Calculate the t statistic for each slope. From the p-values, which slopes differ from zero at α = .01? (Round your answers to 2 decimal places. Negative values should be indicated by a minus sign.)

Dependent Variable R2 Estimated Slope Std Error tcalculated p-value Differ from 0?
Highest grade achieved 0.050 -0.029 0.010 .004 (Click to select)  No  Yes
Reading grade equivalent 0.126 -0.093 0.022 .000 (Click to select)  Yes  No
Class standing 0.078 -0.005 0.002 .014 (Click to select)  Yes  No
Absence from school 0.073 4.700 1.620 .004 (Click to select)  No  Yes
Grammatical reasoning 0.090 0.132 0.070 .062 (Click to select)  No  Yes
Vocabulary 0.119 -0.196 0.002 .000 (Click to select)  Yes  No
Hand-eye coordination 0.035 0.060 0.027 .028 (Click to select)  Yes  No
Reaction time 0.016 9.300 3.560 .010 (Click to select)  No  Yes
Minor antisocial behavior 0.037 -0.819 0.579 .160 (Click to select)  No  Yes


(b) It would be inappropriate to assume cause and effect without a better understanding of how the study was conducted.


  • Yes

  • No

Solutions

Expert Solution

The detailed calculation of the tstat and pvalue is shown using excel formula.

Also, the significance is indicated for each variable.

the explanation for determining the significance.

For each beta coefficient, we test the following hypothesis.

We check the pvalue associated with that variable,
if the pvalue is less than 0.01(level of significance specified), then we reject the null hypothesis and conclude that the variable is significant or is statistically different from zero. Hence it is a significant predictor of y.

if the pvalue is greater than 0.01(level of significance specified), then we fail to reject the null hypothesis and conclude that the variable is not significant or is not statistically different from zero. Hence it is not a significant predictor of y and can be dropped from the regression.


(b) It would be inappropriate to assume cause and effect without a better understanding of how the study was conducted.
Yes


Explanation:

Based on the correlation of the independent variable with the target variable, we cannot conclude a cause and effect relationship.

Understanding causality
Causality defines a cause and effect between to variables. In other words, it indicates a relationship between two variables (events) where one variable effects the other.

For example: Variable 1: Average number of hours spent preparing for exams
Variable 2: Score or marks in the exams.

Here we have a cause and effect relationship if you do not study you will very poor marks. But you study every day for 3 hours you will get a good score in the exams.

Differentiating correlation from causation

Correlation between two variables defines the strength and the direction of the linear relationship between two variable.
By strength we mean, how strong or weak is the association between the two variables.

The correlation coefficient takes a value between 0 and 1 and it can have a positive or negative sign depending on the relationship.

Higher the value, stronger is the relationship.
A positive sign indicates that as one variables increase or decreases, the other variable also increases or decreases in the same proportion.

A negative sign indicates that as one variable increases the other decreases and vice versa.

For causation, we need to have an empirical relationship between the variable.
But for correlation, we only examine the values and do check for any intrinsic connection between the variables.

The correlation between lung cancer and drinking alcohol. But we cannot conclude that drinking causes lung cancer. There could be many other reasons for lung cancer.

On the other have smoking and lung cancer can have a causal relationship and also a high correlation.

Hence correlation does not indicate causation, correlation only indicates the strength of the relationship, causation can be established only with empirical evidence.

Therefor with out understanding how the study was conducted assuming a cause and effect relationship is incorrect.


Related Solutions

Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample...
Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample size was about 122 subjects. The independent variable was the level of dentin lead (parts per million). Below are regressions using various dependent variables.      (a) Calculate the t statistic for each slope. From the p-values, which slopes differ from zero at α = .01? (Round your answers to 2 decimal places. Negative values should be indicated by a minus sign.)    Dependent Variable...
Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample...
Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample size was about 122 subjects. The independent variable was the level of dentin lead (parts per million). Below are regressions using various dependent variables. (a) Calculate the t statistic for each slope. From the p-values, which slopes differ from zero at α = .01? (Round your answers to 2 decimal places. Negative values should be indicated by a minus sign.) Dependent Variable R2 Estimated...
Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample...
Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample size was about 122 subjects. The independent variable was the level of dentin lead (parts per million). Below are regressions using various dependent variables.      (a) Calculate the t statistic for each slope. From the p-values, which slopes differ from zero at α = .01? (Round your answers to 2 decimal places. Negative values should be indicated by a minus sign.)    Dependent Variable...
Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample...
Simple regression was employed to establish the effects of childhood exposure to lead. The effective sample size was about 122 subjects. The independent variable was the level of dentin lead (parts per million). Below are regressions using various dependent variables. (a) Calculate the t statistic for each slope, at significance level = 0.01. (Negative values should be indicated by a minus sign. Round your answers to 2 decimal places.)    Dependent Variable R2 Estimated Slope Std. Error tcalculated p-value Differ...
From the table below, use a simple linear regression analysis to establish the relationship that may...
From the table below, use a simple linear regression analysis to establish the relationship that may exist between a) number of confirmed cases and deaths; b) number of confirmed cases and number of tests performed; c) number of confirmed cases and number of recoveries; and d) number of deaths and number of recoveries. Briefly discuss these relations. Date Total confirmed Death Recoveries Test 1-Apr 195 5 3 12046 2-Apr 204 5 3 12046 3-Apr 205 5 3 12046 4-Apr 214...
A simple random sample of size n=300 individuals who are currently employed is asked if they...
A simple random sample of size n=300 individuals who are currently employed is asked if they work at home at least once per week. Of the 300 employed individuals​ surveyed, 27 responded that they did work at home at least once per week. Construct a​ 99% confidence interval for the population proportion of employed individuals who work at home at least once per week.
A simple random sample of size n=300 individuals who are currently employed is asked if they...
A simple random sample of size n=300 individuals who are currently employed is asked if they work at home at least once per week. Of the 300 employed individuals​ surveyed, 38 responded that they did work at home at least once per week. Construct a​ 99% confidence interval for the population proportion of employed individuals who work at home at least once per week. The lower bound is The upper bound is ​(Round to three decimal places as​ needed.)
A simple random sample of size n equals 400 individuals who are currently employed is asked...
A simple random sample of size n equals 400 individuals who are currently employed is asked if they work at home at least once per week. Of the 400 employed individuals​ surveyed, 34 responded that they did work at home at least once per week. Construct a​ 99% confidence interval for the population proportion of employed individuals who work at home at least once per week. The upper bound is (Round to three decimal places as​ needed.)
1. A simple random sample of size n equals 350 individuals who are currently employed is...
1. A simple random sample of size n equals 350 individuals who are currently employed is asked if they work at home at least once per week. Of the 350 employed individuals​ surveyed, 32 responded that they did work at home at least once per week. Construct a​ 99% confidence interval for the population proportion of employed individuals who work at home at least once per week. QUESTION- what is the lower and upper bound 2. A simple random sample...
Why would a cluster sample lead to lower data collection costs as compared to a simple...
Why would a cluster sample lead to lower data collection costs as compared to a simple random sample or a stratified probability sample.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT