Question

In: Statistics and Probability

12. Below is a table detailing the number of days of personal travel over a year...

12. Below is a table detailing the number of days of personal travel over a year paired with annual household income (in 1000's of dollars) for 9 various families.
HH Income ($000's) Travel Days
61 11
32 6
45 13
35 9
22 3
89 21
30 8
74 15
37 9
a. Construct a scatterplot for this data set in the region to the right (with household income as the independent variable, and travel days as the dependent variable.)
b. Based on the scatterplot, does it look like a linear regression model is appropriate for this data?  Why or why not?
c. Add the line of best fit (trend line/linear regression line) to your scatterplot. Give the equation of the trend line below.  Then, give the slope value of the line and explain its meaning to this context.
d. Determine the value of the correlation coefficient.  Explain what this value tells you about the two variables?  
e. Based on the linear regression equation, what is the predicted number of personal travel days a person will take annually if their household income is $85,000?  Show your calculation.

Solutions

Expert Solution

a.

b. As we see that it is increasing trend and also if we draw a line through points, we see many points will fall on it, so linear model is appropriate

c.

Sum of X = 425
Sum of Y = 95
Mean X = 47.2222
Mean Y = 10.5556
Sum of squares (SSX) = 4075.5556
Sum of products (SP) = 894.8889

Regression Equation = ŷ = bX + a

b = SP/SSX = 894.89/4075.56 = 0.2196

For every increase in x, there is corresponding 0.2196 change in y.

a = MY - bMX = 10.56 - (0.22*47.22) = 0.1868

ŷ = 0.2196X + 0.1868

d.

X Values
∑ = 425
Mean = 47.222
∑(X - Mx)2 = SSx = 4075.556

Y Values
∑ = 95
Mean = 10.556
∑(Y - My)2 = SSy = 224.222

X and Y Combined
N = 9
∑(X - Mx)(Y - My) = 894.889

R Calculation
r = ∑((X - My)(Y - Mx)) / √((SSx)(SSy))

r = 894.889 / √((4075.556)(224.222)) = 0.9361

e. For x=85 y=(0.2196*85)+0.1868=18.8528


Related Solutions

The prices of Rawlston, Inc. stock (y) over a period of 12 days, the number of...
The prices of Rawlston, Inc. stock (y) over a period of 12 days, the number of shares (in 100s) of company's stocks sold (x1), the volume of exchange (in millions) on the New York Stock Exchange (x2), and the daily profit of the company (in thousands) (x3) are shown below. day y x1 x2 x3 1 87.50 950 11.00 40 2 86.00 945 11.25 45 3 84.00 940 11.75 27 4 78.00 930 11.75 22 5 84.50 935 12.00 34...
The data below represents the bacterial growth in a liquid culture over a number of days....
The data below represents the bacterial growth in a liquid culture over a number of days. Day 0 4 8 12 16 20 Amount ×106 67 84 98 125 149 185 Find a best-fit equation to the above data trend using polynomial regression with parabolic model given as: ? = ?? + ?1? + ?2?2 After calculating the values of ao, a1, and a2, substitute these values in the above parabolic model. Plot the given data and the obtained parabolic...
Below is a table detailing the age of nine randomly selected houses and the insurance claims...
Below is a table detailing the age of nine randomly selected houses and the insurance claims made on each home during the past five years. Age of houses (years) Insurance claims (1,000 of dollars) 72 10 35 6 45 8 39 5 22 3 100 21 57 8 74 15 37 9 Construct a scatterplot for this data set in the region to the right (with age of houses as the independent variable, and five-year insurance claim as the dependent...
The table below shows the number of deaths in the U.S. in a year due to...
The table below shows the number of deaths in the U.S. in a year due to a variety of causes. For these questions, assume these values are not changing from year to year, and that the population of the United States is 312 million people. Cause Deaths Passenger car occupant (driver or rider) 13,100 Motorcycle (driver or rider) 4,500 Tornado 553 Skydiving 56 a) What is the probability that an American chosen at random died as a passenger car occupant...
The table below shows the number of deaths in the U.S. in a year due to...
The table below shows the number of deaths in the U.S. in a year due to a variety of causes. For these questions, assume these values are not changing from year to year, and that the population of the United States is 312 million people. Cause Deaths Passenger car occupant (driver or rider) 13,100 Motorcycle (driver or rider) 4,500 Tornado 553 Skydiving 56 a) What is the probability that an American chosen at random died as a passenger car occupant...
The table below shows the number of students in each year at a certain university: Year...
The table below shows the number of students in each year at a certain university: Year of study 1 2 3 4 5 6 7 No. of students 300 280 275 175 92 48 30 You would like to select a random sample of 100 students from this university. i. Explain how you would choose a simple random sample. ii. Explain how you would choose a sample using systematic (interval) sampling method. iii. If you use stratified sampling method to...
Table 1 below shows the number of lawn mowers sold by Mangi Enterprises over a period...
Table 1 below shows the number of lawn mowers sold by Mangi Enterprises over a period of three years. Table 1: Sales Data of Lawn Mowers Jan Feb March April May June July Aug Sept Oct Nov Dec 20016 238 220 195 245 345 380 270 220 280 120 110 85 2017 135 145 185 219 240 420 520 410 380 320 290 240 2018 205 230 340 370 395 505 540 500 402 360 310 280 REQUIRED: Based on...
The table below contains various grades for three students obtained over the course of the year....
The table below contains various grades for three students obtained over the course of the year. Assuming that all grades are equally weighted, generate a new table that displays the name of the student, their average grade, and the number of grades they had in their record. import numpy as np np.random.seed(seed=0) Students = ['Samir','Mark','Zoe','Andrew','Rupert'] Grades_Data = [] for i in range(100): Student_Index = np.random.randint(0,len(Students)) Student = Students[Student_Index] Grade = round(np.random.normal(loc=75,scale=15),1) Grades_Data.append([Student,Grade]) import pandas as pd Grades_df = pd.DataFrame(data=Grades_Data,columns=['Student','Grade']) Grades_df.iloc[0:10]
The table shows the number of days on the market for the 36 recent home sales...
The table shows the number of days on the market for the 36 recent home sales in the city of Sonando Hills. 22 13 30 165 104 54 123 38 7 116 176 178 47 156 20 12 165 19 113 53 48 8 46 29 24 31 137 29 2 175 28 55 117 38 70 145 (a1) Construct a frequency distribution and histogram, using nice (round) bin limits. (Round your "percent" answers to 1 decimal place.) Days on...
The data set shown below contains the number of hurricanes that occurred each year over a​...
The data set shown below contains the number of hurricanes that occurred each year over a​ 14-year period. Some scientists claim that there has been an increasean increase in the number of hurricanes as the years progressed. Complete parts​ a) through​ d). Year 11 22 33 44 55 66 77 88 99 1010 1111 1212 1313 1414 ​# 11 22 22 00 22 33 33 11 11 22 44 11 11 00 ​a) Create a histogram of these data. Choose...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT