Question

In: Statistics and Probability

1) In a study, the final statistical analysis showed that r square=0.35 (p<0.01). Which one of...

1) In a study, the final statistical analysis showed that r square=0.35 (p<0.01). Which one of the following interpretations best explains this results?

A) The model explains 65% of the variation in the outcome, because 1.00-0.35=65%.

B) No conclusions can be drawn because it is not apparent whether the estimated coefficients for each covariate were statistically significant.

C) About 35% of the variation in the outcome was explained by the independent variable(s).

D) The model explains about 35% of the variation for independent variable(s).

2) Which of followings refers the multicollinearity problem?

A) Some independent variables are strongly correlated each other.

B) When the coefficient for a product of two independent variables (X1*X2) is statistical significant (p<0.05).

C) There is an interaction effect between two independent variables.

D) It will occur when linear regression encounters step-wise regression.

3) A PGY1 post-graduate conducted a survey study in her community. Of 10,000 surveyed residents, there are 200 persons with diabetes mellitus, 50 persons with heart disease and 20 persons with both diabetes and heart disease.   If a selected resident has diabetes mellitus, what is the probability that this same individual also has heart disease? (Clue: need to calculate the relevant probability).

A) 10%

B) 20%

C) 0.2%

D) 40%

E) 0.5%

4) A clinical research plans to conduct a linear regression analysis to assess the Health related quality of life score which is the primary outcome with continuous data. The health outcomes will be regressed on 10 predictors or confounding factors including age, sex, race, BMI, health education, family incomes, number of years disease on set, etc. Based on our discussion in the lecture, how many patients at least does he/she need to recruit for this linear regression?

A) 50

B) 150

C) 200

D) 1500

E) 30

5) Since you learned the multiple linear regression analysis in class, you are given the following linear regression model:   Y (female life expectancy) = 82.7 – 0.12 * (fertility number) – 0.24 * (infant mortality per 1000). Please predict the female life expectancy in Ghana country where fertility number = 5.8 and infant mortality per 1000 = 58.3.

A) 80.7

B) 68.0

C) 57.6

D) 69.0

6) A research scientist conducted a factorial ANOVA for her clinical study, which involved 5 different therapy regimens in each of four different hospital settings. In order to assess the therapy effect, the pharmacist would like to evaluate any interaction effect between hospital and regimen. The degree of freedom for interaction is equal to:

A) 7

B) 12

C) 8

D) 20

E) 6

7) There are two kinds of influential statistics: parametric vs. non-parametric statistics. Which of followings is NOT parametric influential statistics?

A) Student t-test

B) F-test

C) Two-way ANOVA

D) Wilcoxon test

E) ANCOVA

Solutions

Expert Solution

Solution-1:

r sq=0.35

=35% variation in Y is explained by x

C) About 35% of the variation in the outcome was explained by the independent variable(s).

2) Which of followings refers the multicollinearity problem?

Multicollinearity means of the independent vriables are strongly related to each other.

There can be multicollinearity among dependent and indpendent variables but not within independent variables

variance inflation factor,VIF=1/1-r^2

if variance inflation factor >10, such variables are multicollinear and can be excluded

A) Some independent variables are strongly correlated each other.

Solution-3:

P(D)=200/10000=0.02

P(H)=50/10000=0.005

P(D and H)=20/10000=0.002

P(D/H)=probability that he has diabetes/given he has heart disease

form conditional prob

P(D/H)=P(D and H)/P(H)

=0.002/0.005

=0.4

=0.4*100

=40%

40%

Solution-5:

Y (female life expectancy) = 82.7 – 0.12 * (fertility number) – 0.24 * (infant mortality per 1000)

Given (fertility number=5.8

infant mortality per 1000=58.3

Y=82.7-0.12*5.8-0.24*58.3

Y=68.012

68


Related Solutions

Find a study in a professional journal which uses chi-square as one of the statistical processes....
Find a study in a professional journal which uses chi-square as one of the statistical processes. Write one paragraph (at least 100 words) about the purpose of the study, another paragraph (at least 100 words) about the conclusions from the study and a description of how chi-square was used in the study. State complete reference information in APA format.
A statistical analysis is made of the midterm and final scores in a large course, with...
A statistical analysis is made of the midterm and final scores in a large course, with the following results: Average midterm score = 65, SD = 10, Average final score = 65, SD = 12, r = 0.6 The scatter diagram is football shaped. a. About what percentage of the class final scores above 70? b. A student midterm was 75. Predict his final score c. Suppose the percentile rank of midterm score was 95%, predict his percentile rank on...
A statistical analysis is made of the midterm and final scores in a large course, with...
A statistical analysis is made of the midterm and final scores in a large course, with the following results: Average midterm score = 65, SD = 10, Average final score = 65, SD = 12, r = 0.6 The scatter diagram is football shaped. a. About what percentage of the class final scores above 70? b. A student midterm was 75. Predict his final score c. Suppose the percentile rank of midterm score was 95%, predict his percentile rank on...
4. Explain the difference between R-square and R-square (adj). Which one should you use? Why? Provide...
4. Explain the difference between R-square and R-square (adj). Which one should you use? Why? Provide an example.
Which of the following t scores would significant at the P<0.05 level not the P<0.01? There...
Which of the following t scores would significant at the P<0.05 level not the P<0.01? There is more than 1. Select one or more: a. t = 2.89, df = 14 b. t = -2.87, df = 27 c. t = -2.17, df = 17 d. t = 3.52, df = 9 e. t = 2.68, df = 60 f. t = -2.93, df = 19
Which of the following t scores would significant at the P<0.05 level not the P<0.01? There...
Which of the following t scores would significant at the P<0.05 level not the P<0.01? There is more than 1. Select one or more: a. t = 2.89, df = 14 b. t = -2.87, df = 27 c. t = -2.17, df = 17 d. t = 3.52, df = 9 e. t = 2.68, df = 60 f. t = -2.93, df = 19
Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate 1 .941a...
Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate 1 .941a .885 .872 1.00528 a. Predictors: (Constant), SelfControl, NumStrains ANOVAa Model Sum of Squares df Mean Square F Sig. 1 Regression 132.570 2 66.285 65.590 .000b Residual 17.180 17 1.011 Total 149.750 19 a. Dependent Variable: AgeFirstArrest b. Predictors: (Constant), SelfControl, NumStrains Coefficientsa Model Unstandardized Coefficients Standardized Coefficients t Sig. 95.0% Confidence Interval for B Collinearity Statistics B Std. Error Beta Lower Bound Upper Bound...
1- Regression analysis can be described as ________. A. a statistical hypothesis test in which the...
1- Regression analysis can be described as ________. A. a statistical hypothesis test in which the test statistic follows a Student's t-distribution if the null hypothesis is supported B. a collection of statistical models in which the observed variance in a particular variable is partitioned into components attributable to different sources of variation C. a statistical hypothesis test in which the sampling distribution of the test statistic is a chi-square distribution when the null hypothesis is true D. a tool...
PART 1. A study reports the following final notation: F (2, 12) = 5.00, p >...
PART 1. A study reports the following final notation: F (2, 12) = 5.00, p > .05 a. How many samples were involved in this study? b. How many total participants were involved in this study? c. If MSwithin is 3, what is MSbetween? PART 2. Test the claim that the mean GPA for student athletes is higher than 3.1 at the .01 significance level. Based on a sample of 50 people, the sample mean GPA was 3.15 with a...
SOLVE THE FOLLOWING USING STATISTICAL SOFTWARE R. SHOW YOUR CODE PROBLEM 1 A study of 400...
SOLVE THE FOLLOWING USING STATISTICAL SOFTWARE R. SHOW YOUR CODE PROBLEM 1 A study of 400 glaucoma patients yields a sample mean of 140 mm and a sample standard deviation of 25 mm for the the following summaries for the systolic blood pressure readings. Construct the 95% and 99% confidence intervals for μ, the population average systolic blood pressure for glaucoma patients. PROBLEM 2 Suppose that fasting plasma glucose concentrations (FPG) in some population are normally distributed with a mean...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT