Question

In: Statistics and Probability

data set will need at least four variables - at least two categorical and at least...

data set will need at least four variables - at least two categorical and at least two quantitative. For example, you might consider the following variables for American participants in a survey: birth month (categorical), state of birth (categorical), average number of bowls of cereal eaten per week (quantitative), and amount spent on groceries (quantitative).

(a) First, formulate a research question relating to two of your quantitative variables along the lines of "how does *quantitative variable 1* relate to *quantitative variable 2*?" For example, you might ask "Does the average height for students relate to the average number of hours slept by students?" Include the question in your Word document.

(b) Create a least-squares regression line that answers the research question posed in part (a). Your answer here will be graded on the following: (i) an appropriate scatterplot related to the two variables (ii) correlation coefficient "r" and coefficient of determination "r2" between the two variables, (iii) a determination of whether the correlation coefficient is significant and (iv) whether your line is correct (with slope and intercept) based on the data provided!

Hours of sleep Height (inches)
4 62
5 65
6 65
6 62
7 63
7 67
7 60
7 74
7 64
7 63
8 73
8 62
8 66
8 70
8 72
8 69
8 63
9 60
9 67
10 73

Solutions

Expert Solution

## ANSWER:

a) research question is

whether the Correlation between number of hours of sleep & height is significant?

b)

i) scatter plot is attached above

ii) Correlation coefficient r= 0.3658

this indicates that there is week positive linear correlation between number of hours of sleep & the height of a person

*Coefficient of determination= 0.1338

this indicates only 13.38% of the variability in y(height)s is explained by the X(hours)

iii)to test the Correlation coefficient is significant or not.

test statistic is t= 1.6677

Df= 18

p-value= 0.1127

let significance level is 0.05 which is smaller than pvalue

we fail to reject the null hypothesis of non significance

we conclude they the there is not significant linear relationship between these two variables

iv) to test the regression equation

height= 57.3529 + 1.1765×number of hours of sleep

intercept= 57.3529

slope= 1.1765

here test statistic F=2.7812

pvalue = 0.1127

since pvalue is less than 0.05 we conclude that the regression model is not significant.

​​​


Related Solutions

Design a correlational study, you will need two variables with at least five sets of data....
Design a correlational study, you will need two variables with at least five sets of data. between these two variables: time spent playing video games and aggression. My question: Assume the study produces a correlation of .56 between the variables. Analyze three possible causal reasons for the relationship.
Pick two variables that could be collected that would produce a set of data that would...
Pick two variables that could be collected that would produce a set of data that would have the mean much higher than the median or much lower than the median. please explain the variables
Question 2 Dummy variables can be used to represent categorical data ___ a) only when the...
Question 2 Dummy variables can be used to represent categorical data ___ a) only when the categorical data is used as a response variable b) only when the categorical data is used as an explanatory variable c) when the categorical is used as either the response or explanatory variable d) Dummy variables can never be used to represent categorical data Question 3 Consider the following OLS regression equation: predicted y = b0 + b1X1 + b2d. The "X1" refers to...
Hello, I am in need of some assistance in interpreting the data for the two variables...
Hello, I am in need of some assistance in interpreting the data for the two variables I did in a t-test for in Excel. Variable 1 is Relationship with Direct Supervisor and Variable 2 is the Workplace Happiness Rating. I am supposed to write a 125- to 175-word summary of my interpretation of the results of the t test. t-Test: Two-Sample Assuming Equal Variances Variable 1 Variable 2 Mean 2.5 7.4 Variance 1.030612245 2 Observations 50 50 Pooled Variance 1.515306122...
There are two variables in this data set. Variable Definition Height Height in inches Weight Weight...
There are two variables in this data set. Variable Definition Height Height in inches Weight Weight in pounds Using Excel, compute the standard deviation and variance (both biased and unbiased) for height and weight. Height weight 53 156 46 131 54 123 44 142 56 156 76 171 87 143 65 135 45 138 44 114 57 154 68 166 65 153 66 140 54 143 66 156 51 173 58 143 49 161 48 131
The family college data set contains a sample of 792 cases with two variables, teen and...
The family college data set contains a sample of 792 cases with two variables, teen and parents, and is summarized in Table below. The teen variable is either college or not, where the teenager is labeled as college if she went to college immediately after high school. The parent variable takes the value degree if at least one parent of the teenager completed a college degree. Parents Degree Parents No Degree Total Teen College 231 214 445 Teen Not college...
Consider two models that you are to fit to a single data set involving three variables:...
Consider two models that you are to fit to a single data set involving three variables: A, B, and C. Model 1 : A ~B Model 2 : A ~B + C (a) When should you say that Simpson’s Paradox is occuring? A. When Model 2 has a lower R2 than Model 1. B. When Model 1 has a lower R2 than Model 2. C. When the coef. on B in Model 2 has the opposite sign to the coef....
Hair Color and Job Title are examples of: continuous variables categorical variables quantitative variables ordinal variables...
Hair Color and Job Title are examples of: continuous variables categorical variables quantitative variables ordinal variables numerical variables discrete variables none of the above
In this question, we will formulate a measure to quantify the level of association between the two categorical variables.
  In this question, we will formulate a measure to quantify the level of association between the two categorical variables. Such a measure is often used in a statistical test called Chi-square test for assessing whether there is an association between two categorical variables. This question is also used to motivate the learning of independence and to connect the concept back to what we have learnt in the course.Let's revisit the example we have looked at in the course. How...
Propose a life event or situation and two categorical variables for it. Complete a chi-square analysis...
Propose a life event or situation and two categorical variables for it. Complete a chi-square analysis of this event or situation and these variables, and share your results. Do you agree with your findings? Why or why not? What other factors, if any, should be considered?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT