In: Statistics and Probability
What is the coefficient of determinaton and cofficient of correlation for this data set
home prices vs square footage.
| 369,000 | 1,372 | 
| 569,000 | 2909 | 
| 439000 | 1837 | 
| 544000 | 2573 | 
| 399000 | 1642 | 
| 499000 | 2216 | 
| 599000 | 1400 | 
| 487000 | 2508 | 
| 410000 | 1800 | 
| 379950 | 1674 | 
| 565000 | 2056 | 
| 659000 | 4171 | 
| 859000 | 4308 | 
| 610000 | 2303 | 
| 450000 | 1698 | 
| 480000 | 1896 | 
| 319900 | 1970 | 
| 385000 | 2320 | 
| 449000 | 2846 | 
| 199000 | 1440 | 
| 369000 | 2295 | 
| 350000 | 1512 | 
| 449000 | 1900 | 
| 525000 | 2080 | 
| 770000 | 3168 | 
| 789000 | 3236 | 
| 675000 | 2784 | 
| 765000 | 2802 | 
Import the data set into R and store it into a data frame named dat.
| Home_Price | Sq_ft | 
| 369,000 | 1,372 | 
| 569,000 | 2909 | 
| 439000 | 1837 | 
| 544000 | 2573 | 
| 399000 | 1642 | 
| 499000 | 2216 | 
| 599000 | 1400 | 
| 487000 | 2508 | 
| 410000 | 1800 | 
| 379950 | 1674 | 
| 565000 | 2056 | 
| 659000 | 4171 | 
| 859000 | 4308 | 
| 610000 | 2303 | 
| 450000 | 1698 | 
| 480000 | 1896 | 
| 319900 | 1970 | 
| 385000 | 2320 | 
| 449000 | 2846 | 
| 199000 | 1440 | 
| 369000 | 2295 | 
| 350000 | 1512 | 
| 449000 | 1900 | 
| 525000 | 2080 | 
| 770000 | 3168 | 
| 789000 | 3236 | 
| 675000 | 2784 | 
| 765000 | 2802 | 
R Code Output-
> cor(dat$Home_Price,dat$Sq_ft)
[1] 0.7610977
> summary(lm(dat$Home_Price~dat$Sq_ft))
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 146970.69 64293.42 2.286 0.0307 *
dat$Sq_ft 158.35 26.47 5.983 2.57e-06 ***
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 104700 on 26 degrees of freedom
Multiple R-squared: 0.5793, Adjusted R-squared: 0.5631
F-statistic: 35.8 on 1 and 26 DF, p-value: 2.572e-06
Hence ,the correlation coefficient =0.7610977 and the coefficient of determination= 0.5793.